Tuesday, March 22, 2022

Microsoft claims new AI model architecture improves language translation - VentureBeat - Translation

Did you miss a session at the Data Summit? Watch On-Demand Here.


Coinciding with Nvidia’s March 2022 GPU Technology Conference, Microsoft today announced an update to Translator — its Azure service that can translate roughly 100 languages across call centers, chatbots, and third-party apps — that the company claims greatly improves the quality of Translator’s translations. Powered by a new family of AI models that can translate directly between certain languages, Microsoft says that an internal study found the translations to be up to 15% better compared with those generated by previous Translator models.

The models also power a new feature in Translator, multilingual document translation, that can translate documents containing text written in different languages.

Z-code Mixture of Experts

Powering Translator’s upgrades is Z-code, a part of Microsoft’s larger XYZ-code initiative to combine AI models for text, vision, audio, and language to create software that can speak, see, hear, and (hopefully) understand. The team comprises a group of scientists and engineers from Azure AI and the Project Turing research group, focusing on building multilingual, large-scale models that power various Microsoft products.

Z-code provides the framework, architecture, and models for AI-powered translation across language families. With Z-code, Microsoft says it’s using transfer learning — an AI technique that applies knowledge from one task to another, related task — to move beyond common languages, like English, and improve translation for the estimated 1,500 “low-resource” languages in the world.

Like all models, Microsoft’s learn from examples in large datasets sourced from a mixture of public and private archives (e.g., ebooks, websites such as Wikipedia, and hand-translated documents). Low-resource languages are generally defined as having under 1 million example sentences, which adds to the challenge of developing models; AI models usually perform better when given more examples.

Because many languages share linguistic elements, Microsoft develops Z-code models multilingually across different languages and that knowledge is transferred between languages. For example, a model’s translation skills might be used to improve its ability to understand natural (i.e., everyday) language.

Microsoft rolled out Z-code-powered enhancements to Translator last October, adding support for 12 new languages including Georgian, Tibetan, and Uyghur. Now, the company says that an improved version of Z-code — Z-code Mixture of Experts (MoE), which launched this week — can better understand “low-resourced” language nuances.

The AI models used in modern text translation, MoE or no, contain components called “neurons” that are organized into distinctive layers. Each neuron is a mathematical operation that plays a key role in how the model “learns” to interpret and translate languages. MoEs are made up of small clusters of neurons that are only active under special, specific circumstances. Lower layers extract certain “features” from the text to be translated — i.e., characteristics — and “experts” — i.e., clusters — are called upon to evaluate those features. For example, each expert cluster can learn to handle a separate part of speech or semantic or grammatical rule.

“Z-code MoE models are a promising way forward in the language domain since they are more efficient and need fewer systems to run. The same underlying model can be fine-tuned to perform different language understanding tasks such as translating between languages, summarizing a speech, offering ways to complete a sentence or generating suggested tweets, instead of having to develop separate models for each of those narrow purposes,” Xuedong Huang, chief technology officer at Microsoft’s Azure AI division, told VentureBeat via email. “While the Z-code MoE models learn universal representation, specific parts of the model can specialize in particular languages and linguistics characteristics to enable better translation.”

Compared with other model architectures, MoEs have some advantages. The experts can receive a mix of data, but only a few experts remain active at any one time, meaning that even a huge model needs only a small amount of processing power in order to develop or run. In fact, MoE is one of the few architectures demonstrated to scale to more than a trillion parameters. (Parameters are the part of the model that’s learned from example text data, and generally speaking — especially in language — the correlation between the number of parameters and sophistication has held up remarkably well.)

To illustrate, an MoE model containing 1.6 trillion parameters requires compute resources approximately equal to that of a 10 billion-parameter conventional model, by Microsoft’s estimation. The cost isn’t insubstantial, to be fair — a 2020 study from startup AI21 Labs pegged the expenses for developing a text-generating model with only 1.5 billion parameters at between $80,000 and $1.6 million. But it’s more efficient than other methods. Microsoft’s and Nvidia’s recently released Megatron 530B language model, which has 530 billion parameters, was originally developed across 560 Nvidia DGX A100 servers. A single DGX A100 starts at $199,000.

MoEs were first proposed in the ’90s, and research papers in recent years from companies including Google describe experiments with trillion-parameter-plus MoE language models. But Microsoft claims that Z-code MoE is the first MoE language model to reach production.

“Using an MoE approach allows us to achieve performance and quality benefits more efficiently, as it only engages a portion of the model to complete a task, as opposed to other architectures that have to activate an entire AI model to run every request. This architecture allows massive scale in the number of model parameters while keeping the amount of compute constant,” Huang continued. “For our production model deployment, the training dataset was 5 billion parameter models, which are 80 times larger than Microsoft’s currently deployed models. The models are trained on 64 GPUs. A single MoE model can replace 20 of the current translation models, increasing efficiency of training MoE models while also improving translation accuracy.”

Future work

While Microsoft says that Z-code MoE has led to great strides in improving language translation, the problem isn’t solved. Not by a long shot.

Because of biases in public example text, non-English models continued to perform worse than their English-language counterparts. For example, languages in Wikipedia-based datasets vary not only by size but in the percentage of stubs without content, the number of edits, and the total number of users (because not all speakers of a language have access to Wikipedia). Beyond Wikipedia, ebooks in some languages, like Arabic and Urdu, are more commonly available as scanned images versus text, which requires processing with optical character recognition tools that can dip to as low as 70% accuracy.

A recent piece in The Conversation points out the other flaws in AI-powered translation, including different forms of gender bias. In certain languages, Google Translate once presupposed that doctors were male while nurses were female, while Bing’s translator translated phrases like “the table is soft” as the feminine “die Tabelle” in German (which refers a table of figures). Other translations miss the meaning of the original text entirely. In one study referenced by The Conversation, the headline “UK car industry in brace position ahead of Brexit deadline” was translated by an AI system as “L’industrie automobile britannique en position de force avant l’échéance du Brexit,” which implies that the U.K. car industry is in a position of strength as opposed to weakness.

“No matter how fluent the suggested translation appears, these types of errors (incorrect terminology, omissions, mistranslations) abound in machine translation output,” Guillaume Deneufbourg, a researcher in language sciences at the Université de Lille in Lille, France, wrote for The Conversation. “Another issue with machine translation which people may be less aware of is a process known as normalization. If new translations are only ever made using existing ones, over time, the process can stifle inventiveness, creativity, and originality.”

One study from Tilburg University and the University of Maryland referred to the normalization phenomenon as “translationese,” with the coauthors finding a quantifiable loss of “linguistic richness” in AI systems’ translations. While the study points out that that this might be desirable side effect if the goal is to simplify the translation, normalization becomes problematic when it prevents systems from making grammatically correct choices and reduces diversity in “morphologically richer” languages, like Spanish and French.

Microsoft says that it continues to develop new methods to improve translation, both through architectural improvements and techniques to mitigate bias in example data.

“Today’s machine learning models need huge translation data sets with dialects for training, and there may not be enough data for all the desired languages and dialects, particularly in smaller markets,” Huang added. “The ability to share knowledge across different languages enables Z-code to produce more accurate results for underrepresented languages that don’t have a huge number of translation examples to learn from. This will help improve AI fairness and ensure that high-quality translations are not restricted to languages with rich training resources only.”

VentureBeat's mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Learn More

Adblock test (Why?)

Monday, March 21, 2022

Reasons why you should hire professional translation services - KHTS Radio - Translation

Turning your business towards the international market is one of the biggest milestones and your very first step of going global. It might also be quite an overwhelming task at the same time. However, if you have thoroughly planned out the process then it is actually quite affordable and seamless. One of the biggest challenges you may have to face while entering the international market is dealing with the language barriers. 

Many businesses may turn towards hiring individual translators and try to keep the work within the business. However, turning towards a professional translation service is a much wiser option. 

There are a number of reasons why you should go for translational services instead of going through the hassle of hiring a translator. Here are a few reasons why you should consider hiring professional translation services. 

One of the key points of translation is precision. Translation has no edge for mistakes, a single error can change the meaning of the whole thing and make you lose credibility. Your translation needs to be highly precise and appropriate for you to be able to use it in business. In fact, the nature of translating one language to another is complicated in itself and may be relevant to some field or setting. For example, translating text that is from specific fields like law, medical or technical fields has to be used according to the set vocabulary of that particular field. Translation services can provide you with such specialized services without you having to do anything by yourself. 

When expanding your business, simply using translated content is not enough. Your translation needs to be localized. It should be able to adapt to the local culture and should give off the same sense of nativity. In order for your business to truly flourish it should definitely conform to the local culture.This does not just include translating the words but also keeping the color, images, date, time, and other features depending upon the setting. The whole point is to give off a strong impression to locals that is appropriate for the settings they live in. Nobody is better at it like professional translators. Visit JK Translate certified translation services.

 

Hiring a professional service for translation is basically hiring experts. The translators that are usually hired through the translation services generally tend to have excellent qualifications. They go through some hard screening where their experience and expertise both are tested. Moreover they are generally experts in whatever business you are in. Not only that but their expertise also allows them to deal with complicated projects as well. They can generally handle most of the work and increase your work efficiency. These professionals can obviously provide quality content and all of the intricacies of a language are dealt with professionalism. 

When it comes to translation, the quality of translation can be both an advantage or disadvantage for a business. Quality management systems are literally there to work on the final outcome and provide error free quality translation. Many companies tend to turn towards using various translational aids. This allows these translation companies to work on large projects without much room for errors. Good quality translation is obviously going into your favor as it would help with the company’s growth and moreover it would affect all the future projects of your company as well. Read more on How To Choose A Translation Service? Read The Reviews

 If you are going for a global business, then turning towards a professional translating service is one of the best decisions that you can make for yourself. These services are generally much more affordable in comparison to hiring a full time translator.

Adblock test (Why?)

Developing Machine Translation for ASL - Language Magazine - Translation

Though automatic translation models like Google Translate are far from perfect, they have become highly accurate tools to allow individuals to communicate and easily understand each other, particularly for high-demand language pairs like English and Spanish. Still, machine translation for signed languages like American Sign Language (ASL) lags far behind spoken and written languages.

That could be changing soon, though—the COVID-19 pandemic has spurred the development of artificial intelligence-based technologies that can translate sign languages into written language. Most recently, an engineering student at the Vellore Institute of Technology in Tamil Nadu, India, went viral on social media for her efforts to develop an AI model that can translate basic ASL phrases into English with high accuracy rates. In a now-viral LinkedIn post, Priyanjali Gupta shared the model, receiving more than 60,000 reactions on the platform.
Gupta’s AI model made headlines on Feb. 15 for its ability to identify simple ASL phrases with accuracy rates hovering around 90% or higher. While it doesn’t work as an all-purpose machine translation model (it can only identify six different phrases right now), Gupta’s model serves as a testament to the increased interest in developing automatic translation for signed languages. She plans to work on expanding the model to improve its ability to identify additional signs.

“The data set is manually made with a computer webcam and given annotations. The model, for now, is trained on single frames,” Gupta told Interesting Engineering. “To detect videos, the model has to be trained on multiple frames, for which I’m likely to use LSTM. I’m currently researching on it… I’m just an amateur student but I’m learning. And I believe, sooner or later, our open-source community, which is much more experienced than me, will find a solution.”

Unlike written languages, machine translation for signed languages requires a given model to be capable of identifying specific gestures—that is, the placement, shape, and movement of an individual’s hands—with high precision.

This means developers must have knowledge about computer vision in addition to their knowledge about machine translation and sign language. As a result, it’s more difficult than developing machine translation for written languages, which generally have a standardized set of already-digitized characters.
AW

Adblock test (Why?)

Best English to French dictionary - WesternSlopeNow - Dictionary

[unable to retrieve full-text content]

Best English to French dictionary  WesternSlopeNow

Genius English Translations – Red Velvet - In My Dreams (English Translation) - Genius - Translation

[unable to retrieve full-text content]

Genius English Translations – Red Velvet - In My Dreams (English Translation)  Genius

Friday, March 18, 2022

Why dog-eared dictionaries are left on the shelf - The Times - Dictionary

Ciaran Bruton from Galway sprung a new word on me this week. “First Max Hastings, now Matthew Parris,” he complained, had been “fumfering about negotiating with Putin”. What could this mean?

Fumfering is an onomatopoeic sort of word, so I could hazard a guess but I’d never come across it before and it doesn’t seem to appear in any mainstream dictionary. Thanks to some online resources — lexico.com, wiktionary.org et al — I now know that to fumfer (or phumpher) can mean any of the following: to waffle, to stutter, to mutter, to temporise, to putter aimlessly or to stall.

Putting aside Ciaran Bruton’s injustice to our distinguished columnists, I’m glad to have been introduced to the word. I wondered, given his address, if fumfer

Adblock test (Why?)