Tuesday, January 23, 2024

Walang charot-charot 'to siz: This dictionary is your guide to the inventive language of Pinoy gay communities - PhilStar Life - Dictionary

"Nakakalurkey itech, neuro akesh, ay bongga pak na pak!"

Did you read the sentence above slowly, tongue-twisted, and wondered what that could mean? You are not alone, as even some LGBTQ+ individuals can have a hard time understanding such words even though they're popular in the queer community. Turns out, the sentence means: "I'm shocked, I'm blown away, and it's extravagant and great."

A Google search online might not provide a proper translation of such phrases but a new dictionary may just define any Pinoy beki slang you hear and see. 

In case you missed it, Vibal Foundation launched the Philippine Slang and Gay Dictionary on Jan. 12 to explore the "endlessly inventive counter language of Filipino gay and street communities," offering a comprehensive introduction to the vibrant, influential, and historically significant terms still in use today.

Speaking to PhilSTAR L!fe, Sally Eugenio a.k.a. Lady S, one of the authors of the book, believed that "Beki Language" plays a vital role in developing the Filipino language and the country's history. Thus, a beki dictionary was born.

"We want the readers to learn that this language exists and must be accepted. They need to discover the beauty and the relevance of every word created by the extraordinary gay people," Eugenio told L!fe.

Understanding the syntax and semiotics of 'Beki language'

LGBTQ+ language in the Philippines remains relatively distinct, with its rapidly shifting linguistic code called "Swardspeak," influenced by Spanish and English loan words. This code mainly consists of lexical items but includes sound changes, as if you are deciphering a Da Vinci Code.

But for Eugenio, indulging themselves in the community makes them fully understand the language with the help of research and analysis of its syntax and semiotics.

"We also did many interviews, especially with the community of gays in parlors, barberyas, in the marketplace, schools, and with ordinary and not-so-ordinary human beings. We consulted different dictionaries, verified facts and definitions, and interviews with the right group of people—from linguistics to scholars to students, to the old and new version of Bekis."

Through this, they decided on what to include or not in the over 2,000-word list, not to mention their judgment as certified "beks by heart."

'Gay Lingo is beautiful'

Believing that gay lingo is beautiful, funny, daring, expressive, and mostly makes everyone happy, Eugenio and their team were amazed at how the public received the book, including their straight allies.

"Somehow, this book represents and recognizes the community and the language that has evolved from it," Eugenio shared with L!fe

The book also features scholarly talks on the study of slang and homosexual language, a comprehensive lexicon, vibrant pictures, and the origin of each phrase, as well as usage notes and sample sentences.

In addition to commonly-used slang, the dictionary also includes definitions of counter language—a dialectical form created in opposition to more formal or "official" languages—providing information about LGBT+ customs and culture, collectively contributed by author and Palanca awardee Christopher Dela Cruz, veteran columnist of Philippine Online Chronicles Lady S, language expert Ojo Gonzales, and drag queen Randiva Datinguinoo.

Although the book has a disclaimer for its use of explicit and sexual language that may offend some readers, Eugenio said, "It's 2024, and representation matters, and it gladdens our hearts to see the overwhelming excitement around this title."

Philippine Slang and Gay Dictionary is available for P610 on the Vibal online shop. (with reports from John Patrick Ranara)

Adblock test (Why?)

Monday, January 22, 2024

Faulty machine translations litter the web - Tech Xplore - Translation

translation
Credit: CC0 Public Domain

Near the end of the last century, Bill Gates saw the prospect of unifying citizens of nearly 200 countries, speaking more than 7,000 languages, coming together in common dialogue through the suddenly burgeoning web community.

"The Internet is becoming the town square for the global village of tomorrow," he declared.

The Internet certainly has since drawn the world closer and has enriched global communications, commerce, research and entertainment immeasurably.

But a recent report reminds us—as if we really needed reminding—that along with progress sometimes come problems.

Researchers at Amazon Web Services Artificial Intelligence Lab and the University of California, Santa Barbara, say that after examining more than 6 billion sentences across the web, they have found that more than half had been translated into two or more different languages. The translations, they found, were often poor. And with each successive translation into other languages, some up to eight or nine, the results became worse.

The report, "A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism," was uploaded to the preprint server arXiv Jan. 11.

"The low quality of these … translations indicates they were likely created using machine translation," the authors report. "Our work raises serious concerns about training models such as multilingual large language models on both monolingual and bilingual data scraped from the web."

The researchers said texts are not only being translated by artificial intelligence but are being created by AI as well. They observed rates of AI-generated translations were highest among lower-resource languages, such as Wolof and Xhosa, African languages.

"We find that highly multi-way parallel translations are significantly lower quality than two-way parallel translations," the authors continue.

That means that as trillions of bits of data are ingested for AI training operations, regions under-represented on the web, such as African nations and other countries with more obscure languages, will face greater challenges in establishing reliable—and grammatical—large language models. With few native resources to draw upon, they must heavily rely on tainted translations flooding the market.

Mehak Dhaliwal, a former applied science intern at Amazon Web Services, told Motherboard in an interview, "We actually got interested in this topic because several colleagues who work in machine training and are native speakers of low resource languages noted that much of the internet in their native language appeared to be machine training generated… Everyone should be cognizant that content they view on the web may have been generated by a machine."

The Amazon researchers found bias in selection of content used for AI training.

They state, "Machine generated, multi-way parallel translations not only dominate the total amount of translated content on the web in lower resource languages, it also constitutes a large fraction of the total web content in those languages."

Such content, they suggested, tends to be simpler, lower-quality passages "likely produced to generate ad revenue." Since fluency and accuracy are lower for machine-trained material, numerous translations will lead to even less accurate content and increase the odds of AI hallucination.

Sometimes, computer-generated translations over the years have led to unintentionally humorous or embarrassing interpretations.

Google misinterpreted a phrase "Russia is a great country" and referred instead to Mordor, a fictional village in J.R.R. Tolkien's "The Lord of the Rings." Facebook's translation software in 2019 inadvertently referred to China's President Xi Jinping as "Mr. S***hole" several times in an English article translated from Burmese text. Facebook immediately apologized and blamed the mishap on a "technical error."

And a medical prescription translation tool for Armenian speakers provided some unfortunate advice for a patient with a headache.

English: "You can take over-the-counter ibuprofen as needed for pain."

Translation to Armenian: "You may take anti-tank missile as much as you need for pain."

More information: Brian Thompson et al, A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism, arXiv (2024). DOI: 10.48550/arxiv.2401.05749

Journal information: arXiv

© 2024 Science X Network

Citation: Faulty machine translations litter the web (2024, January 22) retrieved 22 January 2024 from https://ift.tt/SqimhuO

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Adblock test (Why?)

AI-Dubbed Speech by Argentina's President Goes Viral. But How Good is the Translation? - Slator - Translation

Adblock test (Why?)

New Jersey Will Translate Crucial State Documents Into 7 Languages - Patch - Translation

NEW JERSEY — Imagine trying to navigate your way through a maze of often-cryptic paperwork and applications to get crucial social services as a recent immigrant to New Jersey. Now, imagine trying to do so with forms that you can’t read.

And this is why a new law that regulates “language access” in the state’s government offices is desperately needed, advocates say.

As part of a flurry of bill signings made earlier this month, Gov. Phil Murphy gave final approval to S-2459/A-3837 (read the full text here). It will require state offices to translate official documents into the seven most common non-English languages in New Jersey.

Find out what's happening in Montclairwith free, real-time updates from Patch.

Here’s what to know about the new law, according to the state Legislature:

“This bill requires state government entities in the executive branch that provide direct services to the public to translate vital documents and information, including public documents such as forms and instructions provided to or completed by program beneficiaries or participants, in at least the seven most common non-English languages spoken by individuals with limited-English proficiency in this state, based on United States Census Bureau American Community Survey data, and relevant to services offered by each State government entity.”

The original version of the bill called for translations into 15 languages, not seven. Read More: NJ Bill Would Make State Agencies Translate Forms Into 15 Languages

Find out what's happening in Montclairwith free, real-time updates from Patch.

The languages most commonly used by non-English speakers in New Jersey include Spanish, Chinese (Mandarin and Cantonese), Korean, Portuguese, Gujarati, Arabic, Polish, Haitian, Russian, Hindi, Tagalog, Italian, Vietnamese, Urdu and French, NJ.com previously reported.

The bill takes effect immediately. Translations must be completed within 12 months for the five most common languages, and within 23 months for the additional two most common languages.

New Jersey joins California, Washington D.C., Hawaii, Maryland, Minnesota, New York and dozens of other jurisdictions that have established similar language access laws.

How much will it cost? According to the Office of Legislative Services (OLS):

“The total annual cost of this bill will depend on what interpretation and translation services are already being provided by state government entities and the additional services needed to meet the bill’s requirements … The OLS notes that the bill’s costs for document translation would be the greatest during the first two years of implementation, within which all state government entities would be required to undertake document translations at a rate of five languages in the first year and two in the second year. Documents related to a public health emergency or state of emergency declared by the governor after the bill’s effective date would be translated immediately.”

New Jersey is home to nearly 2 million immigrants and refugees – about 22.2 percent of the its entire population, according to the state health department, which cites U.S. Census data. About 5.2 percent of New Jersey residents have “limited English proficiency.”

ADVOCATES: ‘TRUE LANGUAGE JUSTICE’

Several state legislators and immigration advocates spoke in favor of the new law after the governor signed it. They included:

Sen. Majority Leader Teresa Ruiz – “This legislation will remove the language barrier faced by so many of our communities by ensuring all state entities are prepared to assist our residents regardless of what language they speak. Now that Governor Murphy has signed this bill into law, when people are in need, and especially in crisis, they will be able to connect to the resources available to them. This was highlighted all the more during the pandemic where we saw various obstacles preventing individuals from accessing real-time government information and routine government services. This is a step to close that crucial accessibility gap.”

Sen. Nellie Pou, chair of the Legislative Latino Caucus – “Dealing with state agencies and formal applications can be confusing for native speakers, but it becomes that much more challenging when dense information is presented in a language you don’t know well. This legislation will ensure residents have access to vital documents in their first language, so they don’t have to rely on others to connect them to the services they need.”

Sen. Nilsa Cruz-Perez – “Translating for your parents is simply a part of life for many first generation children, but as one of the most diverse states in the country our residents deserve better. This law will help to ensure we aren’t relying on kids to connect families to life-saving resources.”

Assemblywoman Sadaf Jaffer – “As a state that takes pride in its rich diversity, New Jersey is committed to making sure everyone has the best possible access to government services. Expanding these documents and translation services beyond English and providing them in several of the most commonly spoken languages is extremely timely and important.”

Assemblywoman Ellen Park – “Diversity is one of the Garden State’s greatest assets and nearly a third of New Jerseyans age five and older speak a language other than English. By requiring essential documents and other information to be available in seven of the most commonly spoken non-English languages, New Jersey will ensure access to public services will be determined by eligibility, not by language proficiency, and I could not be prouder to see this legislation now signed into law.”

NJ Human Services’ Office of New Americans Director Johanna Calle – “Language barriers can make it very difficult for New Jersey’s immigrant communities to navigate government programs and access important information, services and worker protections as they make our state their home.”

Araceli Argueta, Advocacy and Organizing Director at the American Friends Service Committee NJ Immigrants Rights Program – “Language is the fundamental tool for understanding the world, culture, and interacting in different contexts and situations. Debating and passing the Language Access bill is about giving communities a voice, ensuring rights, and recognizing the challenges and complexities of the various systems that are supposed to provide care in New Jersey.”

Alejandra Sorto, Campaign Strategist of ACLU of New Jersey – “As one of the most diverse states in the country, today’s passage of S2459/A3837 represents an important and long-overdue step in ensuring that New Jersey’s immigrant communities have equitable access to information, resources, and opportunities and allow for our state government to communicate and build trust with the communities they serve.”

Kat Phan, New Jersey Alliance for Immigrant Justice – “There is a world in which true language justice exists, and we will not stop fighting until all New Jerseyans can dream and thrive in their own language, and live dignified and celebrated lives regardless of their background.”

Reverend Erich Kussman, St. Bartholomew Lutheran Church and Faith in New Jersey leader – “Language is an aspect of national origin. When folks come to this country seeking a new way of life, it is important that we provide equal access so that they can understand and get the help they need. [This bill] provides such an opportunity for those seeking assistance. This bill is love in action. Every time we choose love over hate, love over the negative voices, we are participating in the redemption of the world.”

Serges Demefack, coordinator for the American Friends Service Committee’s Black Immigrant Justice Project – “The expansion of language access demonstrates a commitment to inclusivity and respect for linguistic diversity within the community. This new law will foster trust between government institutions and immigrants, encouraging greater civic engagement and participation in public processes. By providing translation and interpreting services, New Jersey state agencies ensure that all residents, regardless of their proficiency in English, can access vital services such as health care, legal aid, education, and social services.”

IMMIGRANT RIGHTS BILLS

The new law is part of a three-bill package involving immigrant rights that was signed by Murphy earlier this month.

Other new legislation includes a law that will beef up job protections for nearly 50,000 “domestic workers” in New Jersey. Read More: NJ Domestic Workers Win Job Protections, Immigration Advocates Cheer

Another new law requires state agencies to update demographic data collection methods on Asian, Native Hawaiian, Pacific Islander, Middle Eastern, North African and South Asian and Indian Diaspora residents in New Jersey.

Send local news tips and correction requests to eric.kiefer@patch.com. Learn more about advertising on Patch here. Find out how to post announcements or events to your local Patch site.


Get more local news delivered straight to your inbox.Sign up for free Patch newsletters and alerts.
We’ve removed the ability to reply as we work to make improvements. Learn more here

Adblock test (Why?)

How to watch TLC’s ‘Love & Translation’ premiere for free Sunday, Jan. 21 - MassLive.com - Translation

The TLC dating reality series “Love & Translation” premieres this Sunday, Jan. 21 at 10 p.m. ET/9 p.m. CT on the network.

Those without cable can catch premiere of “Love & Translation” for free either on Philo, on DirecTV Stream, or on FuboTV, each of which offer a free trial to new users.

You can watch a trailer for the new series below or by clicking here to watch on TLC’s YouTube channel.

“Watch as three American bachelors travel to paradise where they will be joined by twelve women from nine different countries, who don’t speak any English,” TLC said in a description of the series.

“Without a shared language or the use of a translator, ‘Love & Translation’ explores how singles looking for love come together in the attempt to find a connection,” TLC added.

The premiere episode is titled “You Had Me at Bonjour,” and in a description TLC said “Three American men who only speak English look for love in a seaside villa with twelve international women who do not speak English, testing a social experiment on whether love can break through the language barrier.”

How can I watch TLC’s “Love & Translation” for free if I don’t have cable?

Viewers can stream the premiere through Philo, DirecTV Stream or FuboTV, which all offer a free trial for new users.

What is Philo?

Philo is an over-the-top internet live TV streaming service that offers 60+ entertainment and lifestyle channels, like AMC, BET, MTV, Comedy Central and more, for the budget-friendly price of $25/month.

What is DirecTV Stream?

The streaming platform offers a plethora of content including streaming the best of live and On Demand, starting with more than 75 live TV channels.

What is FuboTV?

FuboTV is an over-the-top internet live TV streaming service that offers more than 100 channels, such as sports, news, entertainment and local channels.

Adblock test (Why?)

This AI Paper from Johns Hopkins and Microsoft Revolutionizes Machine Translation with ALMA-R: A Smaller Sized LLM Model Outperforming GPT-4 - MarkTechPost - Translation

Machine translation, a crucial aspect of Natural Language Processing, has significantly increased. Yet, a primary challenge persists: producing translations beyond mere adequacy to reach near perfection. Traditional methods, while effective, often need to be improved by their reliance on large datasets and supervised fine-tuning (SFT), leading to limitations in the quality of the output.

Recent developments in the field have brought attention to moderate-sized large language models (LLMs), such as the ALMA models, which have shown promise in machine translation. However, the efficacy of these models is often constrained by the quality of reference data used in training. Researchers have recognized this issue and explored novel training methodologies to enhance translation performance.

Introducing Contrastive Preference Optimization (CPO), a game-changing approach to refining machine translation training. Achieve unparalleled translation accuracy with this groundbreaking technique. This method diverges from traditional supervised fine-tuning by focusing on more than just aligning model outputs with gold-standard references. Instead, CPO trains models to distinguish between just ‘adequate’ and ‘near-perfect’ translations, pushing the translation quality boundaries.

The mechanics of CPO are intriguing. It employs a contrastive learning strategy that utilizes hard negative examples, a significant shift from the usual practice of minimizing cross-entropy loss. This approach allows the model to develop a preference for generating superior translations while learning to reject high-quality but not flawless ones.

The results of implementing CPO have been nothing short of remarkable. The method has demonstrated a substantial leap in translation quality when applied to ALMA models. The enhanced model, referred to as ALMA-R, has showcased performance that matches or surpasses that of the leading models in the field, such as GPT-4. This improvement was achieved with minimal resource investment – a notable achievement in machine translation.

A detailed examination of the ALMA-R model’s performance reveals its superiority over existing methods. It excels in various test datasets, including those from the WMT competitions, setting new translation accuracy and quality standards. These results highlight the potential of CPO as a transformative tool in machine translation, offering a new direction away from traditional training methodologies that rely heavily on extensive datasets.

In conclusion, the introduction of Contrastive Preference Optimization marks a significant advancement in the field of neural machine translation. By focusing on the quality of translations rather than the quantity of training data, this novel methodology paves the way for more efficient and accurate language models. It challenges existing assumptions about machine translation, setting a new benchmark in the field and opening up possibilities for future research and development.


Check out the Paper and Github. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter. Join our 36k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our Telegram Channel

Muhammad Athar Ganaie, a consulting intern at MarktechPost, is a proponet of Efficient Deep Learning, with a focus on Sparse Training. Pursuing an M.Sc. in Electrical Engineering, specializing in Software Engineering, he blends advanced technical knowledge with practical applications. His current endeavor is his thesis on "Improving Efficiency in Deep Reinforcement Learning," showcasing his commitment to enhancing AI's capabilities. Athar's work stands at the intersection "Sparse Training in DNN's" and "Deep Reinforcemnt Learning".

🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others...

Adblock test (Why?)

Dragon Ball Super Sparks Debate Over Chapter 101 Translation Blip - ComicBook.com - Translation

[unable to retrieve full-text content]

Dragon Ball Super Sparks Debate Over Chapter 101 Translation Blip  ComicBook.com