OpenAI Enhances ChatGPT with Voice Conversations and Image Recognition
Users can now engage in voice conversations and utilize image-based queries with ChatGPT, available to Plus and Enterprise users initially
OpenAI is unveiling substantial upgrades to ChatGPT, enabling the chatbot to interact through voice commands and handle image-based queries. These enhancements are being rolled out now, with access initially granted to Plus and Enterprise users, while image-based features will become accessible to others in the future.
To engage in voice conversations with ChatGPT on Android and iOS, users will need to opt in within the ChatGPT app by navigating to Settings and then New Features. Once activated, users can select from five distinct voices by tapping the microphone icon.
OpenAI’s voice conversations are powered by a novel text-to-speech model capable of generating “human-like audio from just text and a few seconds of sample speech.” OpenAI collaborated with professional actors to create the five available voices. Conversely, the Whisper speech recognition system translates spoken words into text.