Anthropic
Corporate β’ πΊπΈ United States
Big TechPapers (L12M)
4
Researchers (β)
12
Papers w/ Code
3
Papers w/ Dataset
0
Topic Overview
Bubble chart placeholder
Recent Papers (see all )
Abstractive Red-Teaming of Language Model Character
Machine Learning (CS)
Cross-Architecture Model Diffing with Crosscoders: Unsupervised Discovery of Differences Between LLMs
Artificial Intelligence
Chunky Post-Training: Data Driven Failures of Generalization
Code
Machine Learning (CS)
The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?
Code
Artificial Intelligence
Shaping capabilities with token-level data filtering
Machine Learning (CS)