Summary of OpenAI rolls out Advanced Voice Mode with more voices and a new look | TechCrunch

  • techcrunch.com
  • Article
  • Summarized Content

    ChatGPT's Advanced Voice Mode: A New Era of Conversational AI

    OpenAI has announced the rollout of Advanced Voice Mode (AVM) to a wider audience of ChatGPT users. This feature makes interacting with ChatGPT feel more natural, allowing you to speak with the AI instead of just typing. AVM is initially available for ChatGPT Plus and Teams subscribers, with Enterprise and Edu users gaining access next week.

    • AVM is a key feature that enhances the conversational experience with ChatGPT, making it feel more human-like.
    • It leverages advanced AI technology to enable users to speak to ChatGPT and receive responses via audio.
    • This feature is designed to improve accessibility and ease of use for ChatGPT.

    Enhanced Features: Custom Instructions and Memory

    With AVM, users can now personalize their interactions with ChatGPT through Custom Instructions, which allows them to dictate how the AI should respond.

    • Users can provide specific guidelines for ChatGPT's responses, ensuring that it aligns with their preferences.
    • This feature promotes tailored and personalized interactions with ChatGPT.

    Memory functionality enables ChatGPT to retain information from previous conversations, creating a more contextual and engaging experience.

    • ChatGPT can now recall past interactions, improving the flow and coherence of conversations.
    • This feature makes ChatGPT feel more like a conversational partner, remembering details and building upon prior interactions.

    Expanding the Voice Options: 5 New Voices

    The rollout of AVM brings five new voices to ChatGPT: Arbor, Maple, Sol, Spruce, and Vale. These additions join the existing voices: Breeze, Juniper, Cove, and Ember.

    • The inclusion of diverse voices enhances the overall experience, offering users more choice and personalization options.
    • The new voices cater to a wider range of preferences, making ChatGPT more adaptable to individual needs.
    • These new voices are inspired by nature, aligning with the overall goal of making ChatGPT feel more natural and engaging.

    The Absence of Sky: A Legal Dispute

    Notably absent from the new voice lineup is Sky, a voice showcased by OpenAI in its spring update. The voice was removed following a legal threat from Scarlett Johansson, who claimed it sounded too similar to her own voice, drawing parallels to her role in the movie "Her".

    • This incident highlights the ethical considerations and legal complexities surrounding AI-generated voices and their resemblance to real individuals.
    • OpenAI's response demonstrates the company's commitment to addressing legal concerns and ensuring ethical practices.

    A Glimpse into the Future: Video and Screen Sharing

    While AVM is a significant step towards more natural interaction with ChatGPT, OpenAI's spring update also unveiled plans for video and screen sharing capabilities.

    • This feature, powered by GPT-4, is intended to enable ChatGPT to process both visual and audio information, opening up new possibilities for multimodal interaction.
    • Users could potentially ask ChatGPT questions about real-time content, such as images or code, making it even more versatile and useful for various tasks.
    • OpenAI has yet to provide a specific timeline for the release of these multimodal capabilities.

    ChatGPT's Evolution: A Comparison with Gemini Live

    ChatGPT's expanding voice options bring it closer to other AI-powered conversational platforms like Google's Gemini Live, which features a similar voice-based interface.

    • Both platforms aim to revolutionize the way we interact with AI, making it feel more natural and intuitive.
    • However, Gemini Live has a wider range of voices, currently exceeding ChatGPT's offering.
    • The comparison highlights the ongoing competition and innovation within the field of conversational AI.

    ChatGPT's Journey: From Text to Voice

    ChatGPT's evolution from a text-based AI to a voice-powered conversational agent demonstrates the rapid progress in the field of AI. The advancements in AVM allow for a more immersive and engaging experience, offering users a more human-like interaction with AI.

    • This shift towards voice-based interaction reflects the increasing demand for seamless and natural communication with AI.
    • ChatGPT's voice-powered capabilities position it as a potential game-changer in various industries, including customer service, education, and entertainment.

    Ask anything...

    Sign Up Free to ask questions about anything you want to learn.