In detail
- Test used ten Guild articles from 2020–2022 (pre-mainstream generative AI).
- Pangram and Grammarly: 100% accuracy on human texts.
- Sidekicker: flagged all articles as mostly AI-generated, two scoring 100% AI.
- Paradox: professionally written human texts share statistical patterns with AI output because LLMs were trained on exactly that kind of writing.
Why it matters
AI detectors are unreliable and can cost authors contracts and reputations. For publishers and content platforms, this is a critical problem—false positives are expensive.
For you Never rely on a single detector; demand transparency from vendors about their methods and always give authors a chance to defend themselves.