âšī¸ About the Tests
Excess Word Ratio: Flags overused, formalized vocabulary common in LLMs.
Type-Token Ratio: Measures diversity of word use. Lower = more repetition.
Lexical Richness: Ratio of unique words adjusted by length (entropy-like).
Average Sentence Length: AI often favors longer, clause-heavy sentences.
Verb / Adjective Ratio: AI tends to overuse action and descriptive words.
Trigram Repetition: Repeating the same 3-word patterns is typical in AI writing.
Pronoun Ratio: Human text often includes "I", "we", etc. AI text less so.
Inspired by: https://www.science.org/doi/10.1126/sciadv.adt3813