Voice imitation algorithms

From SI410
Revision as of 18:37, 13 March 2020 by Igesinsk (Talk | contribs)

Jump to: navigation, search

Voice imitation algorithms (also known as Speech synthesis[1]) are a form of Synthetic Media, used to imitate human speech. They achieve this by using machine learning and artificial intelligence techniques[2].

History

Commercial implementation

The Speak and Spell was originally introduced in 1978 by Texas Instruments. It featured a keyboard and a speech synthesizer, which was used to convert words that were typed onto the keyboard into synthesized audio that it played from speakers.

Lyrebird (also known as Lyrebird AI) was a Montreal based company founded in 2017 focused on speech synthesis and voice imitation.[3] In 2019 it was acquired by Descript, an American company focused on audio editing software, specifically tailored towards podcast creators.[4] Lyrebird AI uses artificial intelligence and voice samples to accurately replicate human speech.

China-based technology company Baidu has used neural networks and deep learning to create accurate voice imitations from thousands of collected voice samples.[5][6]

Research

University of Delaware and Nemours Alfred I. duPont Hospital for Children's jointly operated Applied Science and Engineering Laboratories (also know as ASEL), has researched and developed the Model Talker.[7][8] A software which is used with AAC devices to replicate human speech to assist those with hearing or speech impairments.

radnom

radnom

radnom

Examples, Lyrebird AI

  • References
  • https://thehill.com/opinion/cybersecurity/470826-perception-wont-be-reality-once-ai-can-manipulate-what-we-see
  • https://www.sciencedirect.com/science/article/pii/S0007681319301600?via%3Dihub
  • https://www.wired.com/brandlab/2018/10/lyrebird-uses-ai-find-artificial-voice/
  • https://www.businessinsider.com/groupon-founder-andrew-mason-new-startup-descript-detour-2017-12
  • https://www.technologyreview.com/f/610386/a-new-algorithm-can-mimic-your-voice-with-just-snippets-of-audio/
  • http://research.baidu.com/Blog/index-view?id=91
  • https://www.asel.udel.edu/
  • https://www.asel.udel.edu/speech/ModelTalker.html