{"id":123156,"date":"2022-03-22T17:25:10","date_gmt":"2022-03-22T17:25:10","guid":{"rendered":"https:\/\/news.microsoft.com\/?p=445813"},"modified":"2022-03-22T17:25:10","modified_gmt":"2022-03-22T17:25:10","slug":"new-z-code-mixture-of-experts-models-improve-quality-efficiency-in-translator-and-azure-ai","status":"publish","type":"post","link":"https:\/\/sickgaming.net\/blog\/2022\/03\/22\/new-z-code-mixture-of-experts-models-improve-quality-efficiency-in-translator-and-azure-ai\/","title":{"rendered":"New Z-code Mixture of Experts models improve quality, efficiency in Translator and Azure AI"},"content":{"rendered":"<p>Microsoft is making upgrades to <a href=\"https:\/\/www.microsoft.com\/en-us\/translator\/\">Translator<\/a> and other <a href=\"https:\/\/azure.microsoft.com\/en-us\/overview\/ai-platform\/\">Azure AI<\/a> services powered by a new family of artificial intelligence models its researchers have developed called <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-zcode\/\">Z-code<\/a>, which offer the kind of performance and quality benefits that other large-scale language models have but can be run much more efficiently.<\/p>\n<p>\u201cOur goal is to help everyone and every organization on the planet to communicate better, and to achieve that goal there are really two important dimensions \u2014 we want the quality of translations to be as good as possible and we want to support as many languages as possible,\u201d said Xuedong Huang, Microsoft technical fellow and Azure AI chief technology officer.<\/p>\n<p>Z-code takes advantage of shared linguistic elements across multiple languages via transfer learning \u2014which applies knowledge from one task to another related task \u2014 to <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-translator-enhanced-with-z-code-mixture-of-experts-models\/\">improve quality for machine translation<\/a> and other language understanding tasks. It also helps extend those capabilities beyond the most common languages across the globe to underrepresented languages that have less available training data.<\/p>\n<p>\u201cWith Z-code we are really making amazing progress because we are leveraging both transfer learning and multitask learning from monolingual and multilingual data to create a state-of-the-art language model that we believe has the best combination of quality, performance and efficiency that we can provide to our customers,\u201d Huang said.<\/p>\n<p>These models use a sparse \u201cMixture of Experts\u201d approach that is more efficient to run because it only needs to engage a portion of the model to complete a task, as opposed to other architectures that have to activate an entire AI model to run every request. This architecture allows massive scale in the number of model parameters while keeping the amount of compute constant.<\/p>\n<p>To put these models in production, Microsoft is using <a href=\"https:\/\/blogs.nvidia.com\/blog\/2022\/03\/22\/microsoft-translator-triton-inference\/\">NVIDIA GPUs and Triton Inference Server<\/a> to deploy and scale them efficiently for high-performance inference.<\/p>\n<p>Microsoft has recently deployed Z-code models to improve common language understanding tasks such as name entity recognition, text summarization, custom text classification and key phrase extraction across its Azure AI services. But this is the first time a company has publicly demonstrated that it can use this new class of Mixture of Experts models to power machine translation products.<\/p>\n<p>The new Z-code-based translation model is now available, <a href=\"https:\/\/aka.ms\/DocumentTranslationZcode\">by invitation<\/a> initially, to customers using document translation in Translator, a Microsoft <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/?OCID=AID2200277_SEM_21b10c24cce010ad95796e02b9d812b3:G:s&amp;ef_id=21b10c24cce010ad95796e02b9d812b3:G:s&amp;msclkid=21b10c24cce010ad95796e02b9d812b3\">Azure Cognitive Service<\/a> which is a part of Azure AI.<\/p>\n<p>Microsoft\u2019s Z-code models consistently improved translation quality over current production models, according to common industry metrics. In contrast with typical multilingual transfer learning approaches, which typically show AI quality gains in languages that have fewer direct translation examples available for training, the Z-code Mixture of Experts models show consistent gains even in the largest languages.<\/p>\n<figure id=\"attachment_83003\" aria-describedby=\"caption-attachment-83003\" class=\"wp-caption alignnone\"><a href=\"https:\/\/www.sickgaming.net\/blog\/wp-content\/uploads\/2022\/03\/new-z-code-mixture-of-experts-models-improve-quality-efficiency-in-translator-and-azure-ai.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"wp-image-83003 size-full\" src=\"https:\/\/www.sickgaming.net\/blog\/wp-content\/uploads\/2022\/03\/new-z-code-mixture-of-experts-models-improve-quality-efficiency-in-translator-and-azure-ai.jpg\" alt=\"A chart shows percentage improvements in translation quality across 37 different language pairs from Translator\u2019s old AI models to a new class of models called Z-code.\" width=\"2000\" height=\"1123\"><\/a><figcaption id=\"caption-attachment-83003\" class=\"wp-caption-text\">New Z-code Mixture of Experts AI models are powering improvements and efficiencies in Translator and other Azure AI services.<\/figcaption><\/figure>\n<p>Human evaluators in a blind test commissioned by Microsoft found that the Z-code Mixture of Experts models improved translations across languages, with an average gain of 4%. For instance, the models improved English to French translations by 3.2 %, English to Turkish by 5.8 %, Japanese to English by 7.6%, English to Arabic by 9.3% and English to Slovenian by 15%.<\/p>\n<h2><strong>Creating more powerful and integrative AI systems<\/strong><\/h2>\n<p>Z-code is part of Microsoft\u2019s larger <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/a-holistic-representation-toward-integrative-ai\/\">XYZ-code<\/a> initiative that seeks to combine models for text, vision, audio and multiple languages to create more powerful and integrative AI systems that can speak, hear, see and understand people better.<\/p>\n<p>Over the past five years, Microsoft has developed models that have matched human performance in <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-researchers-achieve-new-conversational-speech-recognition-milestone\/\">conversational speech recognition<\/a>, <a href=\"https:\/\/www.microsoft.com\/en-us\/translator\/blog\/2019\/06\/17\/neural-machine-translation-enabling-human-parity-innovations-in-the-cloud\/\">machine translation<\/a>, <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/novel-object-captioning-surpasses-human-performance-on-benchmarks\/\">image captioning<\/a>, <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-deberta-surpasses-human-performance-on-the-superglue-benchmark\/\">SuperGLUE natural language understanding<\/a> and <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/azure-ai-milestone-microsoft-kear-surpasses-human-performance-on-commonsenseqa-benchmark\/\">commonsense question answering<\/a>. These breakthroughs provide the foundation to realize more ambitious AI systems that can achieve multisensory and multilingual learning that is closer to how people learn and understand, Huang said.<\/p>\n<p>\u201cThose are the pieces, the building blocks that we are using to build a truly differentiated intelligence\u2026and to form production systems that are cost efficient,\u201d Huang said.<\/p>\n<p>Z-code models were developed as part of Microsoft\u2019s <a href=\"https:\/\/www.microsoft.com\/en-US\/ai\/ai-at-scale?activetab=pivot_1%3aprimaryr6\">AI at Scale<\/a> and <a href=\"https:\/\/turing.microsoft.com\/\">Turing<\/a> initiatives, which seek to develop large models that are pretrained on vast amounts of textual data to understand nuances of language \u2014 which can be integrated in multiple Microsoft products and also made available to customers for their own uses.<\/p>\n<p>The same underlying model can be fine-tuned to perform different language understanding tasks such as translating between languages, summarizing a speech, offering ways to complete a sentence or generating suggested tweets, instead of having to develop separate models for each of those narrow purposes.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Microsoft is making upgrades to Translator and other Azure AI services powered by a new family of artificial intelligence models its researchers have developed called Z-code, which offer the kind of performance and quality benefits that other large-scale language models have but can be run much more efficiently. \u201cOur goal is to help everyone and [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":123157,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[49],"tags":[135,50],"class_list":["post-123156","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-microsoft-news","tag-artificial-intelligence","tag-recent-news"],"_links":{"self":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts\/123156","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/comments?post=123156"}],"version-history":[{"count":0,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts\/123156\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/media\/123157"}],"wp:attachment":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/media?parent=123156"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/categories?post=123156"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/tags?post=123156"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}