Epoch AI is looking for a Researcher to develop and publish critiques and reviews of AI benchmarks.
About the role
We are looking for a Researcher to produce a steady stream of benchmark reviews. You will closely analyze a wide variety of new benchmarks, evaluate their methodologies, and write up your findings in public-facing research. You should be comfortable using coding agents to help you, without delegating your judgment.
Examples of the kind of reports you would produce include our reviews of SWE-bench Verified, OSWorld, and economic value benchmarks.
This role is fully remote; we are able to hire in many countries. We invite anyone who is interested to apply, regardless of background, experience, or credentials. Please do not include a cover letter, photograph, or headshot of yourself, or any personal information that is not relevant to the role for which you're applying (including marital status, age, identity traits, etc.).
If this role sounds interesting, we are also looking for researchers on multiple other teams.
Applications are rolling.
Additional Information
While we welcome applicants from all time zones, we prefer candidates who can overlap with US and UK time zones.
Please submit all of your application materials in English and note that we require professional level English proficiency.
Epoch is committed to building an inclusive, equitable, and supportive community for you to thrive and do your best work. We're committed to finding the best people for our team, so please don't hesitate to apply for a role regardless of your age, gender identity/expression, political identity, personal preferences, physical abilities, veteran status, neurodiversity or any other background. Please email [email protected] if you have any questions about this role, accessibility requests, or if you want to request an extension to the application deadline. However, we will not review applications submitted to this email address; please submit your application through the link on this page.
About Epoch AI
Epoch AI is a research institute that investigates trends in machine learning and the economic consequences of AI. Our mission is to develop a comprehensive, publicly accessible knowledge base on AI that informs policymakers, industry leaders, and society at large.
We strive to achieve both rigor and accessibility to our work, as exemplified by some of our most successful projects, including our database of AI models and our AI trends dashboard. Our body of research includes our work on compute trends (IJCN 2022), data scarcity (ICML 2024), and algorithmic progress (NeurIPS 2024). You can read more about our work and mission on our website and in this Time profile.
Epoch AI is a research institute investigating key trends and questions that will shape the trajectory and governance of Artificial Intelligence.
Key team members

Simon Jarvis

Fosco G. Loregian

Yann Rivière

Benjamin Todd
Jobr aggregates jobs directly from company career portals — no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.