Saturday, July 8, 2023

USC researchers use AI to help translate Bible into very rare languages - The Washington Post - Translation

Out of the 7,100 languages that exist, the Bible has been translated into more than 700, making it the most-translated book in the world. Yet, those remaining languages — many of them extremely rare — have vexed Bible translators for decades. Two scientists are looking to new advancements in artificial intelligence to help close the gap.

“We want to reach all the languages on Earth; the goal is to reach everyone,” said Joel Mathew, a research engineer who alongside Ulf Hermjakob recently launched the Greek Room, an AI-powered technology to help streamline the highly technical process of biblical translation.

Combining Hermjakob’s long experience with natural language processing and Mathew’s field knowledge of Bible translation, the two researchers at the University of Southern California’s Information Sciences Institute developed the technology to target “very low-resource languages that are not even in the top 500,” Mathew said.

The Greek Room includes three main tools: spell-checking; world alignment, which ensures consistency in translation; and Wildebeest, used to detect improper characters in a script.

The two scientists met in 2015 when Mathew joined USC to complete a master’s degree in computer science. There, he encountered Hermjakob in the AI division of the Information Sciences Institute. They bonded over a shared passion for languages and their Christian faith.

Mathew, the son of two Bible translators, has observed firsthand the difficulties that come with manual translation by local church members. In his hometown, New Delhi, he took notes on all the tasks that technology could accomplish.

Spell-checking usually requires many people and time, he explained. In the context of translation into rare languages, only local church members are qualified, and they don’t have technology to back up their work.

“These are not trivial problems; these are very hard problems. But big companies are not interested in solving them; it’s not their business model to target very rare languages,” he said.

When Mathew shared with Hermjakob some of the problems Indian translators faced on the ground, he jumped at the opportunity.

“I always had this feeling to know how, at some point, I could apply my skills to my faith,” said Hermjakob, who earned a PhD in computer science at the University of Texas.

With their project, Mathew and Hermjakob want to work on languages that don’t even have a written system, grammar codes, dictionaries or spell-checkers.

“We are thinking of languages like Uyghur or Oromo,” said Hermjakob. Oromo is spoken in Ethiopia and northern Kenya.

Recently, they have been approached by an Indian consultant interested in the spell-checking and world-alignment tool for Bible translation in Kolami, a language in western India that counts 130,000 native speakers.

The Greek Room also aims to change the traditional model of Bible translation. Historically, translations were done by Western missionaries, who could work on only two languages at most in their lifetime, explained Hermjakob. With the Greek Room, the two researchers encourage a local church-driven model.

“Local churches and local language communities are asking for translations of the Bible in their heart language,” explained Mathew, adding that in a multilingual context, the heart language is the one in which people express their deepest feelings and is usually their native language.

This first version of the Greek Room focuses on quality control so that translators can prioritize other tasks requiring more judgment, such as finding a way to translate a concept that doesn’t exist in a given language. In the next version, the two researchers want the tool to suggest better translations.

Now that their codes and data are available on GitHub, they hope other users will integrate their research into the tools and innovate further.

Their initiative, supported by the Wycliffe Bible Translators USA organization, is part of a broader program directed by Every Tribe, Every Nation that hopes to make the Scripture available in every language by 2033.

Religion News Service

Adblock test (Why?)

No comments:

Post a Comment