Difference between revisions of "Voice imitation algorithms"

Revision as of 18:37, 13 March 2020

Voice imitation algorithms (also known as Speech synthesis^[1]) are a form of Synthetic Media, used to imitate human speech. They achieve this by using machine learning and artificial intelligence techniques^[2].

History

Commercial implementation

The Speak and Spell was originally introduced in 1978 by Texas Instruments. It featured a keyboard and a speech synthesizer, which was used to convert words that were typed onto the keyboard into synthesized audio that it played from speakers.

Lyrebird (also known as Lyrebird AI) was a Montreal based company founded in 2017 focused on speech synthesis and voice imitation.^[3] In 2019 it was acquired by Descript, an American company focused on audio editing software, specifically tailored towards podcast creators.^[4] Lyrebird AI uses artificial intelligence and voice samples to accurately replicate human speech.

China-based technology company Baidu has used neural networks and deep learning to create accurate voice imitations from thousands of collected voice samples.^[5]^[6]

Research

University of Delaware and Nemours Alfred I. duPont Hospital for Children's jointly operated Applied Science and Engineering Laboratories (also know as ASEL), has researched and developed the Model Talker.^[7]^[8] A software which is used with AAC devices to replicate human speech to assist those with hearing or speech impairments.

radnom

Examples, Lyrebird AI

References

↑ https://thehill.com/opinion/cybersecurity/470826-perception-wont-be-reality-once-ai-can-manipulate-what-we-see

↑ https://www.sciencedirect.com/science/article/pii/S0007681319301600?via%3Dihub

↑ https://www.wired.com/brandlab/2018/10/lyrebird-uses-ai-find-artificial-voice/

↑ https://www.businessinsider.com/groupon-founder-andrew-mason-new-startup-descript-detour-2017-12

↑ https://www.technologyreview.com/f/610386/a-new-algorithm-can-mimic-your-voice-with-just-snippets-of-audio/

↑ http://research.baidu.com/Blog/index-view?id=91

↑ https://www.asel.udel.edu/

↑ https://www.asel.udel.edu/speech/ModelTalker.html

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

@@ Line 8: / Line 8: @@
 China-based [https://en.wikipedia.org/wiki/Technology_company technology company] [https://en.wikipedia.org/wiki/Baidu Baidu] has used [https://en.wikipedia.org/wiki/Artificial_neural_network neural networks] and [https://en.wikipedia.org/wiki/Deep_learning deep learning] to create accurate voice imitations from thousands of collected voice samples.<ref>https://www.technologyreview.com/f/610386/a-new-algorithm-can-mimic-your-voice-with-just-snippets-of-audio/</ref><ref>http://research.baidu.com/Blog/index-view?id=91</ref>
+===Research===
+[https://en.wikipedia.org/wiki/University_of_Delaware University of Delaware] and [https://en.wikipedia.org/wiki/Nemours_Alfred_I._duPont_Hospital_for_Children Nemours Alfred I. duPont Hospital for Children's] jointly operated Applied Science and Engineering Laboratories (also know as ASEL), has researched and developed the [https://www.asel.udel.edu/speech/ModelTalker.html Model Talker].<ref>https://www.asel.udel.edu/</ref><ref>https://www.asel.udel.edu/speech/ModelTalker.html</ref> A software which is used with [https://en.wikipedia.org/wiki/Augmentative_and_alternative_communication AAC devices] to replicate human speech to assist those with hearing or speech impairments.
 == radnom ==

Difference between revisions of "Voice imitation algorithms"

Revision as of 18:37, 13 March 2020

Contents

History

Commercial implementation

Research

radnom

radnom

radnom

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools