Do AI detectors actually work?

Updated June 10, 2026

Both answers you've heard are wrong. "Detectors are snake oil" — no, the good ones catch most unedited model output. "Detectors are reliable" — also no, they false-flag real writers at rates no court would accept. Here's the actual picture.

What the testing shows

Independent evaluations — academic studies and journalist-run tests alike — consistently find the same shape: top commercial detectors catch a large majority of unedited AI text (often 80–95%+ depending on model and genre), miss more as text gets edited or as new models ship, and false-flag some percentage of human writing, with formal genres and non-native English writers hit hardest. OpenAI shut down its own public AI-text classifier in 2023 over low accuracy — a vendor admission worth remembering.

Accuracy also swings with length (short text is noise), genre (creative prose scores human; technical prose scores robotic) and time (every model release resets the arms race for a while).

The asymmetry that matters

A detector that's "95% accurate" sounds great until it processes a thousand honest essays: dozens of false accusations, each landing on a student who can't prove a negative. That's why Turnitin tells institutions scores should start conversations rather than verdicts, and why Grammarly built process-tracking — assertion is weak evidence in both directions.

If you're on the wrong end of a false flag, our false-positive guide covers what to do.

So how should you treat a score?

  • As a strong signal, never a verdict — in both directions.
  • Trust longer samples more than short ones; trust agreement between detectors more than any single one.
  • If you're being evaluated: verify your own text first, and keep process evidence (drafts, history).
  • If you're evaluating others: published error rates mean a score alone can't carry an accusation.

Where Humanize Studio sits in this

We build on the honest version of this picture: detectors work well enough that you should check your text, and unreliably enough that you should check it yourself rather than trust anyone's promise — including ours. Humanize, verify, iterate; no stored text, no forever-guarantees.

Frequently asked questions

Which AI detector is the most accurate?

Rankings shift with every model release; Copyleaks, Originality.ai, GPTZero and Turnitin all score well in some independent tests and badly in others. The stable finding is that all of them have real error rates.

Can detectors prove I used AI?

No — they output probabilities from statistical patterns, and their false-positive rates are documented. That's an argument you can make if falsely accused, with citations.

Will detectors get good enough to end the arms race?

The math is against certainty: as models write more like people, the distributions overlap more. Watermarking could change the game if universally adopted — see [our watermark explainer](/blog/does-chatgpt-watermark-text).

Humanize it — then verify it

Paste your text, get a rewrite that reads like a person wrote it, and check the AI-probability score yourself before anyone else does. 3-day free trial.