Monday, November 28, 2022
HomeArtificial IntelligenceSpeed up the in-vehicle digital expertise with Azure Cognitive Providers | Azure...

Speed up the in-vehicle digital expertise with Azure Cognitive Providers | Azure Weblog and Updates

Hero image

Microsoft helps to reshape the automotive business in the way in which it serves its drivers with in-vehicle infotainment techniques. For instance, Azure is partnering with XPENG to allow AI voice experiences for automotive manufacturers and prospects. The answer gives the business with a contemporary tackle text-to-speech and expressive voice, international languages, speaker constancy, and self-service customization. XPENG joins a rising development of automakers rethinking investments in environmental voice.

“This can be a cutting-edge exploration of auto voice interplay within the auto business,” XPENG automotive AI product senior professional Hao Chao stated. “The expertise delivers an entire new stage of pure speech. With a deep understanding of city mobility, we’re discovering many extra eventualities to leverage AI expertise for a excessive stage of driver-machine instinct.”

XPENG tapped into Microsoft’s neural text-to-speech expertise for his or her in-car person expertise. Through the use of Microsoft’s neural text-to-speech with emotional kinds, XPENG can present a extra pleasant listening expertise for his or her prospects and fight listening fatigue. Microsoft’s neural text-to-speech gives fluency and naturalness that’s corresponding to a human voice. Coupled with multi-emotional voices, Microsoft text-to-speech acts as a refreshing substitute to the monotonous sound many automobile assistants have immediately.

“We’re excited to reimagine how speech and voice can enhance the lives of drivers,” Azure AI Speech Product Lead Binggong Ding stated. “Whereas from a technical viewpoint, we actually need to make this a mannequin that may serve all auto manufacturers and their builders. How can we finest optimize using artificial speech to allow a high-fidelity voice expertise with out compromising sound high quality? XPENG is constructing upon this problem to offer a voice assistant that prospects have been searching for.”

Microsoft’s long-term purpose is to make superior multi-emotional, international voice capabilities the brand new customary for international automobile manufacturers and customers. The expertise adopted by XPENG added dozens of voice kinds, distinctive emotional depth management, and deduction talents. It covers 90 certifications worldwide together with home insurance policies, regulatory knowledge middle requirement and EU GDPR, and better knowledge privacy-policy holder necessities. Along with the automobile producers, Microsoft is creating new driving experiences with speech primarily based on the text-to-speech and speech-to-text capabilities inside Azure Cognitive Providers for speech.

Accelerated speech innovation

Voice is the brand new interface in ambient computing expertise. The standard of text-to-speech and speech-to-text has improved in recent times because of analysis and technological leaps enabled by the event of neural networks. Excessive-quality speech-to-text and text-to-speech fulfill the wants of the automaker to create the following technology trendy in-car speech expertise. Microsoft speech-to-text affords sturdy recognition capabilities that are speaker-independent and able to dealing with ambient noise whereas driving. Microsoft text-to-speech additionally incorporates a extra fluid, natural-sounding voice which generally is a differentiation for automakers and prospects alike. Each speech-to-text and text-to-speech additionally improve hands-free management of the automobile infotainment system. Microsoft text-to-speech helps a number of talking kinds, together with chat, newscast, and customer support. These developments permit drivers to have a extra pleasant driving expertise. For extra details about the current developments in speech-to-text and text-to-speech try speech-to-text with its analysis outcomes, reaching human parity on the Switchboard analysis benchmark and neural-text-to-speech is near human-parity.

Providing international languages

Microsoft helps automakers cowl their international enterprise and only recently hit a milestone of 100 languages and now helps 119 languages and variants with 278 voices out-of-box. That is aligned with our firm imaginative and prescient to empower each particular person and group on the planet to realize extra. “100 languages is an efficient milestone for us to realize our ambition for everybody to have the ability to talk whatever the language they converse,” stated Xuedong Huang, Microsoft Technical Fellow and Azure AI Chief Expertise Officer. With extra languages with their variants lined, we’re excited to be powering pure and intuitive voice experiences for automakers.

Differentiation with customization

Microsoft empowers automakers to develop a extremely sensible branded voice for extra pure conversational interfaces utilizing the customized neural voice functionality. Based mostly on the neural text-to-speech expertise and the multi-lingual multi-speaker common mannequin, customized neural voice helps you to create artificial voices which are wealthy in talking kinds or adaptable cross languages with as little as half-hour of audio. The sensible and natural-sounding voice of customized neural voice can symbolize manufacturers and particular personas and permit customers to work together with purposes naturally in a conversational type. Take a look at this weblog for a step-by-step information on how one can create a customized neural voice.

Compliance and accountable AI

Microsoft is dedicated to investing in assembly regulatory requirements across the globe to fulfill the automakers’ compliance necessities. The speech service, a part of Azure Cognitive Providers, is licensed by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO. Backed by Azure infrastructure, the speech service additionally affords enterprise-grade safety, availability, compliance, and manageability.


Microsoft is dedicated to growing AI expertise in a accountable manner. We use completely different technical and coverage options to safeguard in opposition to misuse of the expertise. For instance, we’re designing and releasing Customized Neural Voice with the intention of defending the rights of people and society, fostering clear human-computer interplay, and counteracting the proliferation of dangerous deepfakes and deceptive content material. This aligns with Microsoft’s dedication to accountable AI. That dedication contains Transparency Notes, which communicates the aim, capabilities, and limitations of an AI system.

Study extra

Azure Cognitive Providers brings AI inside attain. Learn the way you speed up innovation with breakthrough AI analysis.



Please enter your comment!
Please enter your name here

Most Popular

Recent Comments