ChatGPT Can Now See, Hear and Speak: OpenAI, the brain behind ChatGPT, recently shared some exciting news. They have unveiled a shiny new update, giving ChatGPT the ability to “listen” through voice inputs and “see” through image inputs.
🎙️ What’s New with Voice?
ChatGPT now works just like the voice assistant on your mobile phone! Want to chat with it? Simply press a button, ask your question out loud, and wait for ChatGPT to read the answer back to you.
How to Start Chatting with Your Voice?
- Open the mobile app.
- Click on Settings.
- Choose New Features.
- Select “Join a voice conversation.”
- See that top-right corner of your home screen? Tap there and pick from five amazing sounds. That’s how ChatGPT will sound when it talks to you!
What Makes ChatGPT Sound So Real?
OpenAI used a brand-new text-to-speech model. This model can turn text into audio that sounds just like a real person! How did they do it? They teamed up with real voice actors to create these sounds. And when you talk to ChatGPT, it uses the open-source system called Whisper to understand what you’re saying.
Everyone at OpenAI believes this voice chat feels super natural and friendly. Thanks to the powerful tech behind ChatGPT, called LLM, it gives even better answers. They’ve even made a special model that can sound like a human after hearing just a short sample!
What Else Can This Model Do?
There are some cool plans for this model. One is teaming up with Spotify. Imagine listening to a podcast in another language but still hearing the host’s original voice. Neat, right? But, there’s something to remember. This tech is powerful, so it can be misused, like pretending to be someone famous. Because of that, OpenAI is keeping a close watch and will have rules on how to use it.
📸 What About the Image Input Feature?
Do you know how Google Lens lets you snap photos and fetches information about them? ChatGPT’s image function is somewhat like that. Just capture what sparks your curiosity, upload it, and wait for ChatGPT to dive deep into its vast knowledge and serve you an answer.
Don’t feel like snapping a photo? You can even draw your questions! Plus, if you crave more details or aren’t quite pleased with the first answer, just keep the conversation going. ChatGPT loves to chat!
However, a word of caution. For privacy reasons, if you upload a person’s photo, ChatGPT can’t exactly tell you who that person is.
So, What’s the Big Picture: ChatGPT Can Now See, Hear and Speak
Since its debut in 2022, OpenAI has been on a quest to make ChatGPT even better, while being cautious about potential challenges. They’re trying to strike the right balance between innovation and safety.
The voice feature will be available for both iOS and Android users, while the image feature will be accessible across all platforms.
In a nutshell, as ChatGPT keeps evolving and becomes an even more versatile assistant, striking the right balance between capabilities and safety will be a journey to watch!