Scale AI
Corporate β’ πΊπΈ United States
Big TechPapers (L12M)
9
Researchers (β)
15
Papers w/ Code
2
Papers w/ Dataset
2
Topic Overview
Bubble chart placeholder
Recent Papers (see all )
LLM Novice Uplift on Dual-Use, In Silico Biology Tasks
Code
Artificial Intelligence
VeRO: An Evaluation Harness for Agents to Optimize Agents
Artificial Intelligence
LHAW: Controllable Underspecification for Long-Horizon Tasks
Computation and Language
Agentic Rubrics as Contextual Verifiers for SWE Agents
Code
Machine Learning (CS)
Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Code
Sound