谷歌语言交互新突破能更逼真模拟人声_BEC初级作文

谷歌语言交互新突破能更逼真模拟人声

Google's DeepMind have revealed a new speech synthesis generator that will be used to help computer voices, like Siri and Cortana, sound more human.

谷歌旗下的人工智能公司DeepMind近日研制出了一种新型语音合成系统, 该技术可以让如Siri和Cortana这样的计算机合成语音听起来更接近真实人声。

Named WaveNet, the model works with raw audio waveforms to make our robotic assistants sound, err, less robotic.

这项名为WaveNet的技术通过研究原始音频波形，使机器人助手的声音听起来不那么像机器人。

谷歌语言交互新突破能更逼真模拟人声1

WaveNet doesn't control what the computer is saying, instead it uses AI to make it sound more like a person, adding breathing noises, emotion and different emphasis into senteneces.

WaveNet并不会控制计算机的说话内容，它只会应用人工智能技术在句子中添加呼吸声、情感和各种重音，从而使计算机语音听起来更像真人。

Generating speech with computers is called text-to-speech (TTS) and up until now has worked by piecing together short pre-recorded syllables and sound fragments to form words.

用计算机合成语音的技术叫做“从文本到语音（TTS）”，现存的工作原理是将提前录制好的短音节和声音碎片合成语言。

As the words are taken from a database of speech fragments, it's very difficult to modify the voice, so adding things like intonation and emphasis is almost impossible.