In detail
- OpenAI GPT-5.5 gave exclusively left-leaning answers in 80% of cases; DeepSeek V4 Pro followed at 70%.
- Anthropic Claude Opus 4.8 responded exclusively left-leaning 43% of the time but presented both sides in 57% of cases.
- Elon Musk's Grok 4.3, marketed as "truth-seeking" and "anti-woke," still gave left-leaning answers more often; Gab's Arya ("built with Christian values and conservative principles") responded 12 times more often left-lea
- Likely reason: Grok was trained on the same data as other chatbots or even their outputs; that Grok made racist or antisemitic statements stems from xAI deliberately neglecting safety guidelines.
Why it matters
For companies deploying AI chatbots, this reveals an alignment problem: models cannot simply be "retuned" through marketing promises. Training data and safety guidelines determine actual behavior.
For you Test AI models against your specific neutrality or balance requirements rather than relying on vendor promises—especially for sensitive applications.