In the new paper Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers, a Microsoft research team presents VALL-E, the first language model-based text-to-speech (TTS) system with strong in-context learning. VALL-E achieves state-of-the-art personalized speech synthesis quality via prompting in a zero-shot setting.
Suzhou City-based AI and speech technology company AISpeech announced today it had raised CN¥500 million (US$76 million) in Series D Funding led by Oriza Holdings and China Minsheng Investment Group. Chinese media reports the company is also planning an IPO on a local stock exchange.
The McKinsey Global Institute this month released the report “Notes From the AI Frontier Insights From Hundreds of Use Cases”. The 36-page discussion paper surveys cutting-edge machine learning algorithms, and discusses how they can be integrated or transformed into practical applications across 19 selected industries.
Microsoft researchers in the US and Asia sent a shockwave through the AI community today with their paper Achieving Human Parity on Automatic Chinese to English News Translation, which introduces a neural machine translation system they say equals the performance of human experts in Chinese-to-English translation.