These features will allow users to engage in voice conversations with ChatGPT and show it images to provide context and information.
Voice capabilities will be available on iOS and Android, while image capabilities will be accessible on all platforms.
Users can use voice prompts to activate ChatGPT and hold voice-based conversations with it.
The voice feature utilizes a text-to-speech model and incorporates professional voice actors to generate human-like audio.
Image understanding is powered by multimodal GPT models, enabling ChatGPT to analyze a wide range of images, including photographs, screenshots, and documents containing text and images.
These enhancements aim to provide a more intuitive and interactive experience with ChatGPT.