OpenAI Unveils Voice Engine for Crafting Human-like Speech with AI

Voice Engine is currently powering preset voices in the text-to-speech API along with other applications like ChatGPT Voice and Read Aloud.
OpenAI Unveils Voice Engine for Crafting Human-like Speech with AI
Representative image by freepik.

OpenAI has introduced Voice Engine, a new model designed to generate lifelike speech from text input and a short audio snippet, showcasing its ability to produce emotional and realistic voices.

Voice Engine was initially developed in late 2022 and is currently powering preset voices in the text-to-speech API along with other applications like ChatGPT Voice and Read Aloud. However, OpenAI wants to proceed cautiously towards broader deployment due to concerns about potential misuse. Its goal is to initiate discussions on responsible use and societal adaptation to synthetic voices, drawing insights from small-scale tests to inform future decisions.

The early applications of Voice Engine are diverse. They include assisting non-readers with natural-sounding voices, facilitating content translation for global audiences, improving service delivery in remote areas, aiding non-verbal individuals, and restoring speech for patients with speech conditions. These applications demonstrate the versatility and potential impact of the technology across various domains.

Also Read
Why CDOs Need AI-Powered Data Management to Accelerate AI Readiness in 2024
OpenAI Unveils Voice Engine for Crafting Human-like Speech with AI

Focus on Ethical AI

To ensure the safe usage of Voice Engine, OpenAI collaborates closely with partners who agree to adhere to strict usage policies. These policies prohibit impersonation without consent, mandate explicit consent from original speakers, and require disclosure of AI-generated voices to audiences. OpenAI also implements safety measures such as watermarking and proactive monitoring to mitigate potential risks.

In advocating for societal resilience against challenges posed by advanced generative models, OpenAI proposes several measures. These include phasing out voice-based authentication, exploring policies to protect individuals' voices, educating the public on AI capabilities and limitations, and developing techniques to track the origin of audiovisual content.

Whether or not Voice Engine sees widespread deployment, OpenAI emphasizes the importance of global awareness regarding AI advancements. They are committed to engaging in ongoing discussions with policymakers, researchers, developers, and creatives to address the ethical and societal implications of synthetic voices. 

Through these collaborative efforts, OpenAI aims to ensure that the development and deployment of AI technologies are guided by ethical considerations and promote positive societal outcomes.

Related Stories

No stories found.
CDO Magazine
www.cdomagazine.tech