Difference between revisions of "Voice imitation algorithms"
Line 8: | Line 8: | ||
China-based [https://en.wikipedia.org/wiki/Technology_company technology company] [https://en.wikipedia.org/wiki/Baidu Baidu] has used [https://en.wikipedia.org/wiki/Artificial_neural_network neural networks] and [https://en.wikipedia.org/wiki/Deep_learning deep learning] to create accurate voice imitations from thousands of collected voice samples.<ref>https://www.technologyreview.com/f/610386/a-new-algorithm-can-mimic-your-voice-with-just-snippets-of-audio/</ref><ref>http://research.baidu.com/Blog/index-view?id=91</ref> | China-based [https://en.wikipedia.org/wiki/Technology_company technology company] [https://en.wikipedia.org/wiki/Baidu Baidu] has used [https://en.wikipedia.org/wiki/Artificial_neural_network neural networks] and [https://en.wikipedia.org/wiki/Deep_learning deep learning] to create accurate voice imitations from thousands of collected voice samples.<ref>https://www.technologyreview.com/f/610386/a-new-algorithm-can-mimic-your-voice-with-just-snippets-of-audio/</ref><ref>http://research.baidu.com/Blog/index-view?id=91</ref> | ||
− | + | ===Research=== | |
+ | [https://en.wikipedia.org/wiki/University_of_Delaware University of Delaware] and [https://en.wikipedia.org/wiki/Nemours_Alfred_I._duPont_Hospital_for_Children Nemours Alfred I. duPont Hospital for Children's] jointly operated Applied Science and Engineering Laboratories (also know as ASEL), has researched and developed the [https://www.asel.udel.edu/speech/ModelTalker.html Model Talker].<ref>https://www.asel.udel.edu/</ref><ref>https://www.asel.udel.edu/speech/ModelTalker.html</ref> A software which is used with [https://en.wikipedia.org/wiki/Augmentative_and_alternative_communication AAC devices] to replicate human speech to assist those with hearing or speech impairments. | ||
== radnom == | == radnom == |
Revision as of 18:37, 13 March 2020
Voice imitation algorithms (also known as Speech synthesis[1]) are a form of Synthetic Media, used to imitate human speech. They achieve this by using machine learning and artificial intelligence techniques[2].
History
Commercial implementation
The Speak and Spell was originally introduced in 1978 by Texas Instruments. It featured a keyboard and a speech synthesizer, which was used to convert words that were typed onto the keyboard into synthesized audio that it played from speakers.
Lyrebird (also known as Lyrebird AI) was a Montreal based company founded in 2017 focused on speech synthesis and voice imitation.[3] In 2019 it was acquired by Descript, an American company focused on audio editing software, specifically tailored towards podcast creators.[4] Lyrebird AI uses artificial intelligence and voice samples to accurately replicate human speech.
China-based technology company Baidu has used neural networks and deep learning to create accurate voice imitations from thousands of collected voice samples.[5][6]
Research
University of Delaware and Nemours Alfred I. duPont Hospital for Children's jointly operated Applied Science and Engineering Laboratories (also know as ASEL), has researched and developed the Model Talker.[7][8] A software which is used with AAC devices to replicate human speech to assist those with hearing or speech impairments.
radnom
radnom
radnom
Examples, Lyrebird AI
- References