ARTIFICIAL INTELLIGENCE

OpenAI says it’s working on AI that mimics human voices

Apr 1, 2024, 5:00 PM

a laptop shows openai voice engine blog post...

OpenAI is releasing Voice Engine, which uses a sample audio clip of someone speaking to create an AI-generated version of their voice. (Costfoto/NurPhoto/Getty Images via CNN Newsource)

(Costfoto/NurPhoto/Getty Images via CNN Newsource)

New York (CNN) — OpenAI has unveiled a new artificial intelligence tool that can mimic human voices with startling accuracy. The AI voice generator has a range of potential applications, including for accessibility services, but could also prompt concerns about misinformation and other forms of abuse.

OpenAI on Friday shared samples from early tests of the tool, called Voice Engine, which uses a 15-second sample of someone speaking to generate a convincing replica of their voice. Users can then provide a paragraph of text and the tool will read it in the AI-generated voice.

Related: Some video game actors are letting AI clone their voices. They just don’t want it to replace them

There are several AI-generated voices services already available to the public but, as it did with the breakout chatbot ChatGPT, OpenAI has proven particularly adept at garnering widespread adoption of AI tools.

An AI-enabled text-to-voice tool could help with translation, reading assistance for children or aiding people who have lost the ability to speak, the company says. But some skeptics worry it could also fuel the creation of disinformation or make it easier to perpetrate scams.

OpenAI says Voice Engine is currently being used by only a “small group of trusted partners,” including education and health technology companies, and it will use their tests to determine whether and how to allow more widespread use. Those testers have agreed not to recreate people’s voices without their explicit consent and to clearly identify to listeners that what they’re hearing is AI-generated, according to the company.

“We recognize that generating speech that resembles people’s voices has serious risks, which are especially top of mind in an election year,” OpenAI said in a blog post. The company acknowledged the need for major changes as AI-generated audio becomes more widely available, although it doesn’t plan to release Voice Engine to the public immediately. For example, the company suggested phasing out voice-based authentication for bank accounts.

Related: Tech companies sign accord to combat AI-generated election trickery

“Any broad deployment of synthetic voice technology should be accompanied by voice authentication experiences that verify that the original speaker is knowingly adding their voice to the service and a no-go voice list that detects and prevents the creation of voices that are too similar to prominent figures,” OpenAI said.

Voice Engine can use a voice sample in one language to create a replica voice that can speak in multiple other languages.

Its blog post includes an example of an audio clip of a human reading a passage about friendship, alongside AI-generated audio that sounds like the same person reading the same passage in Spanish, Mandarin, German, French and Japanese. In each of the AI-generated samples, the tone and accent of the original speaker is maintained.

The preview of Voice Engine comes as users await the public release of Sora, the AI-generated video tool that OpenAI teased last month. Sora can create realistic looking 60-second videos from text instructions, with the ability to serve up scenes with multiple characters, specific types of motion and elaborate background details. OpenAI’s ChatGPT can also generate images from a text prompt.

Separately, OpenAI also announced on Monday it is making ChatGPT available to anyone without the need to sign up to use the service.

The company noted it may use any text that’s loaded into ChatGPT to improve its models but said this can be turned off through settings even without an account. Without an account, however, users will not be able to save or review chat history or access various features, including voice conversations and custom instructions.

–CNN’s Samantha Kelly contributed to this report.

The-CNN-Wire
™ & © 2024 Cable News Network, Inc., a Warner Bros. Discovery Company. All rights reserved.

We want to hear from you.

Have a story idea or tip? Send it to the KSL NewsRadio team here.

Artificial Intelligence

Artificial intelligence could be the next tool used in mental health therapy....

Amanda Dickson

Artificial intelligence may be able to help with your mental health

University of Utah researchers are working to understand how artificial intelligence and mental health therapists might work together.

9 days ago

Neuroscientist David Eagleman speaks at the University of Utah's Kingsbury Hall on Tuesday as the k...

Sky Mundell, KSL.COM

Neuroscientist David Eagleman proposes test of intelligence for AI to Utah audience

Eagleman spoke as part of the Natural History Museum's lecture series centered around the nature of intelligence.

16 days ago

Voice actor Sarah Elmaleh poses for a photo in Los Angeles on Thursday, Feb. 1, 2024. Recent years ...

MATT O'BRIEN AP Technology Writer

Some video game actors are letting AI clone their voices. They just don’t want it to replace them

Video game studios are using AI-generated voices in games. Some actors are cautiously open to it with fair compensation and usage safeguards.

2 months ago

nick clegg...

Associated Press

Tech companies sign accord to combat AI-generated election trickery

Tech executives from Amazon, Google, Meta, Microsoft, OpenAI and TikTok gathered to announce a new framework for responding to AI-generated deepfakes.

2 months ago

jordan school district building shown, the district will have ai in its schools...

Heather Peterson

Jordan School District joins program to use AI in classrooms

The Jordan School District is now one of 1500 school districts in America using AI as a tool in their classrooms.

2 months ago

deepfake...

Curt Gresseth

After deepfake porn images target Taylor Swift, is it safe to post photos of kids online?

After deepfake AI-generated pornographic images of Taylor Swift raced across the internet, a question arises: Is it safe to post images of kids online?

3 months ago

Sponsored Articles

a doctor putting her hand on the chest of her patient...

Intermountain Health

Intermountain nurse-midwives launch new gynecology access clinic

An access clinic launched by Intermountain nurse-midwives provides women with comprehensive gynecology care.

Young couple hugging while a realtor in a suit hands them keys in a new home...

Utah Association of Realtors

Buying a home this spring? Avoid these 5 costly pitfalls

By avoiding these pitfalls when buying a home this spring, you can ensure your investment will be long-lasting and secure.

a person dressed up as a nordic viking in a dragon boat resembling the bear lake monster...

Bear Lake Convention and Visitors Bureau

The Legend of the Bear Lake Monster

The Bear Lake monster has captivated people in the region for centuries, with tales that range from the believable to the bizarre.

...

Live Nation Concerts

All the artists coming to Utah First Credit Union Amphitheatre (formerly USANA Amp) this summer

Summer concerts are more than just entertainment; they’re a celebration of life, love, and connection.

Mother and cute toddler child in a little fancy wooden cottage, reading a book, drinking tea and en...

Visit Bear Lake

How to find the best winter lodging in Bear Lake, Utah

Winter lodging in Bear Lake can be more limited than in the summer, but with some careful planning you can easily book your next winter trip.

Happy family in winter clothing at the ski resort, winter time, watching at mountains in front of t...

Visit Bear Lake

Ski more for less: Affordable ski resorts near Bear Lake, Utah

Plan your perfect ski getaway in Bear Lake this winter, with pristine slopes, affordable tickets, and breathtaking scenery.

OpenAI says it’s working on AI that mimics human voices