In an effort to increase user interaction, OpenAI has added a voice capability to ChatGPT, the huge language model-based chatbot that was debuted last year. Both the paid and free versions of the app can now use this functionality.
How does it work?
It functions just like a standard speech capability, say, in Google Search. However, the primary distinction is in its capacity to accurately and more precisely understand speech, particularly with regards to different forms of pronunciation.
Users only need to download the app, find the headphones icon, then press to start the results prompt in order to use the ChatGPT voice feature.
According to former OpenAI President Greg Brockman, the update is revolutionary because it will make interacting with artificial intelligence more convenient and entertaining. ChatGPT has been redesigned to include a voice capability.
As per the official summary, ChatGPT’s voice function has five unique voices that were created in partnership with experienced voice actors.
Thanks to the use of OpenAI’s Whisper voice recognition technology, the function enables rather smooth translation of spoken words into prompts.
How does the voice feature of ChatGPT differ?
An advanced text-to-speech model developed by OpenAI uses speech samples and text inputs to produce audio that sounds human.
The result improves accessibility for a wide range of user demands while also opening up creative uses.
ChatGPT can understand spoken commands and respond in a conversational manner because to the integration of speech capabilities, resulting in interactions that are dynamic and natural. The significant advancement broadens the AI’s help capabilities, enhances application user experiences, and makes communication easier for people with disabilities.
An interesting twist to the story was added when OpenAI’s introduction video for ChatGPT’s new feature was released in tandem with tensions around CEO Sam Altman’s reinstatement and secret disclosure of board decisions.
What is the final result?
A new era of approachable and conversational AI experiences for a wider audience is being brought in by the ChatGPT speech capability, which represents a notable advancement toward human-like AI interactions.
- Now users can see turns and live ETA directly on their lockscreen with Google Maps - February 29, 2024
- Two companies WordPress and Tumblr are planning to sell User content to AI companies - February 29, 2024
- David Warner calls Kane Williamson a ‘legend’; He becomes a father for the third time, he shares the first picture of his baby girl - February 28, 2024