Summary of X uses your posts to train its AI chatbot: How to disable this default setting

  • indianexpress.com
  • Article
  • Summarized Content

    Grok AI: Training on X User Data

    Elon Musk's X, formerly Twitter, has been using user data, including posts, interactions, and conversations, to train Grok AI, a chatbot developed by his xAI company. Grok AI aims to be a rival to popular chatbots like ChatGPT.

    • X users are automatically opted into this data sharing program by default, with a checkbox already marked allowing the use of their data for training Grok.
    • The practice has raised concerns about data privacy and consent, as users were not explicitly informed or given a clear choice.

    Concerns and Pushback

    This data collection practice has been criticized by several X users and is facing scrutiny from data regulators in the UK and EU.

    • The UK's data regulation prohibits companies from using pre-ticked boxes or default consent for such practices. The UK Information Commissioner's Office (ICO) is making inquiries with X.
    • Ireland's Data Protection Commission (DPC) is also investigating the matter and expresses surprise at X's attempt to harvest user data for AI training.

    Data Usage for AI Training

    AI chatbots like Grok or ChatGPT require vast amounts of data to train their models and generate accurate responses to user queries. This data is typically scraped from the internet, including social media platforms.

    • However, this practice has faced pushback from news publishers, artists, and intellectual property holders who allege copyright infringement.
    • X, Meta, and other platforms are navigating the complex landscape of data collection and AI training while attempting to balance user privacy and model improvement.

    Opting Out of Data Sharing

    X users can opt out of having their data used to train Grok AI by following these steps:

    • Go to Settings and Privacy > Privacy and Safety > Grok.
    • Uncheck the Data Sharing box.
    • Delete conversation history.

    Impact on Other Social Media Platforms

    Meta, another tech giant, faced similar criticism for its data usage policies to train its AI virtual assistant, which was recently launched across WhatsApp, Instagram, and Facebook. Initially, Meta planned to use public posts by users without explicit consent, but it later decided to stop this policy in the EU and UK due to regulatory pressure. While Meta continues to use public user data in other markets, users can still opt out.

    • Reddit and Stack Overflow have signed content licensing deals with major AI players, allowing access to user posts for training and fine-tuning large language models (LLMs).
    • These agreements reflect the growing demand for data to train AI models, but also raise concerns about user privacy and consent.

    The Future of AI and Data Privacy

    The debate over data usage for AI training is a complex issue with significant implications for data privacy, intellectual property, and the future of AI development. As AI models become increasingly sophisticated and ubiquitous, finding a balance between innovation and protecting user data remains a crucial challenge for social media platforms and regulators alike.

    • The ongoing scrutiny of X and other platforms highlights the importance of transparency and user control over data usage.
    • Regulators are expected to play a crucial role in shaping data privacy regulations and ensuring ethical AI development.

    Ask anything...

    Sign Up Free to ask questions about anything you want to learn.