Leaked Meta AI Guidelines Show How the Chatbot Is Trained to Spot Sensitive Content and Avoid Controversy

0
459

Legal Nightmare: Inside the Explosive Feud Between Blake Lively and Justin Baldoni

Chatbot Training Techniques

Meta uses several well-established methods to train and refine its chatbot models. These include:

  • Reinforcement Learning from Human Feedback (RLHF)
    Contractors rate and provide feedback on anonymized user interactions.
  • Persona Simulation
    Workers prompt the AI to adopt fictional personas like:

    • “Wise and mystical wizard”
    • “Hyper-excited music theory student”
  • Emotion-Based Voice Training
    Through projects like Vocal Riff – Speech RLHF, contractors:

    • Record emotionally expressive prompts
    • Use romantic or flirty tones (non-sexual)
    • Employ light profanity to assess voice responsiveness
  • Prompt Sensitivity Filters
    Prompts involving hate, sex, violence, religion, gender, politics, or race are flagged or rejected.
  • Prohibited Impersonations
    Contractors were barred from simulating characters such as Homer Simpson, Tina Fey, or Achilles.

Joe Osborne, a Scale AI spokesperson, defended the practice:

Signup for the USA Herald exclusive Newsletter