Pooling Attention: Evaluating Pretrained Transformer Embeddings for Deception Classification
By: Sumit Mamtani, Abhijeet Bhure
Potential Business Impact:
Finds fake news using smart computer language.
This paper investigates fake news detection as a downstream evaluation of Transformer representations, benchmarking encoder-only and decoder-only pre-trained models (BERT, GPT-2, Transformer-XL) as frozen embedders paired with lightweight classifiers. Through controlled preprocessing comparing pooling versus padding and neural versus linear heads, results demonstrate that contextual self-attention encodings consistently transfer effectively. BERT embeddings combined with logistic regression outperform neural baselines on LIAR dataset splits, while analyses of sequence length and aggregation reveal robustness to truncation and advantages from simple max or average pooling. This work positions attention-based token encoders as robust, architecture-centric foundations for veracity tasks, isolating Transformer contributions from classifier complexity.
Similar Papers
What Signals Really Matter for Misinformation Tasks? Evaluating Fake-News Detection and Virality Prediction under Real-World Constraints
Computation and Language
Spots fake news and predicts how fast it spreads.
The Credibility Transformer
Machine Learning (CS)
Makes computer predictions more accurate and stable.
A Theoretically Grounded Hybrid Ensemble for Reliable Detection of LLM-Generated Text
Computation and Language
Finds fake writing in schoolwork better.