
ANCIENT LINES PARSED WITH TECH | Mathematicians at the University of the Philippines (UP) conduct a demonstration (left) of the software translating text written in the precolonial alphabet known as Baybayin. A renewed interest in Baybayin also shows on a portion of a road inside UP Diliman campus, in a photo taken on July 11, 2023/ (Photo by KRIXIA SUBINGSUBING and GRIG C. MONTEGRANDE / Philippine Daily Inquirer)
MANILA, Philippines 鈥 The world has become significantly smaller as communication barriers continue to be torn down by translator tools, Artificial Intelligence (AI), and other technology.
These tools, as developed by tech giants like Microsoft and Google, are applicable to major languages like English, Chinese, and even Filipino. Recently, they have also served to revive ancient languages like Aramaic and ancient Hebrew.
With these developments, mathematician Renier Mendoza 鈥 who also considers himself a keen advocate of preserving languages 鈥 had realized that 鈥渢here [still] weren鈥檛 any, if at all, tools like that for Baybayin,鈥 the country鈥檚 precolonial writing alphabet.
Database
鈥淚n comparison, tools like that for Japanese, Korean, and Chinese are already highly developed,鈥 said the associate professor at the Institute of Mathematics of the University of the Philippines.
鈥淪o we wanted to develop our own tool using machine-learning algorithms,鈥 he said.
Mendoza, fellow associate professor Rachelle Sambayan and master鈥檚 student Rodney Pino began developing in 2021 a software that they now call the first artificial intelligence (AI)-powered Baybayin translator tool, which can convert entire paragraphs and even pages of Baybayin text into something readable for Filipinos today.
Pino, whose thesis is this project, said the team hopes to contribute to efforts to decode pre-colonial texts and preserve the country鈥檚 ancestral alphabet. Two years ago, he started collating images for each of the 17 characters in Baybayin 鈥 鈥渁nything I could find that had Baybayin text on it,鈥 he said.
After more than three months, the team was able to organize a Baybayin database of 17,000 images 鈥 each of the 17 characters represented by 1,000 images and their variations 鈥 then tested a support vector machine (SVM) on that assortment.
Sambayan described the SVM as an algorithm that is part of the translator tool and is able to recognize whether a character fed to that software is in Baybayin or Latin.
The team converted all the images into black and white to make their processing easier for the tool鈥檚 power and memory.
Demonstration
The software鈥檚 translations are still fairly straightforward, as a demonstration by the team to the Inquirer showed.
In translating 鈥渁lab ng puso鈥 (heart aflame) from Baybayin, for example, the tool showed all possible translations, including variations on the spelling 鈥減uso,鈥 such as 鈥減oso.鈥
Mendoza said the tool still had 鈥渋ssues鈥 in transliterating similar-sounding vowels like 鈥渆鈥 and 鈥渋,鈥 and 鈥渙鈥 and 鈥渦.鈥
But the software will not translate anything outside the Filipino dictionary, Sambayan said.
Although the tool is still in its initial stage, she said it is the first such software that is able to translate large chunks of Baybayin text into Filipino.
The team is working on cutting down the tool鈥檚 speed to increase its volume capacity. Mendoza said it is also currently unable to translate from Filipino to Baybayin, as this would require a higher level of AI.
He said the software is also being developed further 鈥渙n how to choose the better word for translation, depending on the syntax and context of the sentence 鈥 and that is where we would need greater linguistic expertise.鈥
Furthermore, the team hopes to develop a mobile version soon, 鈥渟o there is a portable version for the public,鈥 Mendoza said.
But it has made the Baybayin database public, for now, so that more researchers could use it in their studies on Baybayin and AI.
鈥楩or modern times鈥
Many precolonial documents are written in Baybayin, and some historians may not necessarily know or understand this alphabet.
鈥淪o while it is a bit [of a] niche [field], this tool still has a lot of possible uses in other aspects,鈥 Mendoza said.
He said the team is not necessarily advocating Baybayin as the primary writing system at this age, yet they hope to preserve the alphabet and help future scholars.
鈥淲e鈥檙e really hoping to inspire more efforts in preserving the Baybayin,鈥 Pino said. 鈥淲e should be proud of our own alphabet and at the same time, digitize it for modern times.鈥