A Stylometric Application of Large Language Models
By: Harrison F. Stropkay , Jiayi Chen , Mohammad J. Latifi and more
Potential Business Impact:
Computer can tell who wrote a story.
We show that large language models (LLMs) can be used to distinguish the writings of different authors. Specifically, an individual GPT-2 model, trained from scratch on the works of one author, will predict held-out text from that author more accurately than held-out text from other authors. We suggest that, in this way, a model trained on one author's works embodies the unique writing style of that author. We first demonstrate our approach on books written by eight different (known) authors. We also use this approach to confirm R. P. Thompson's authorship of the well-studied 15th book of the Oz series, originally attributed to F. L. Baum.
Similar Papers
LLM one-shot style transfer for Authorship Attribution and Verification
Computation and Language
Finds who wrote text, even if it's AI.
ChatGPT-generated texts show authorship traits that identify them as non-human
Computation and Language
Computers write differently than people.
Does a Large Language Model Really Speak in Human-Like Language?
Computation and Language
Computers writing like people still sound fake.