Research Intern
atla
Office
London
Internship
About Atla
Atla is committed to engineering safe, beneficial AI systems that will have a massive positive impact on the future of humanity. We are a London-based start-up building the most capable AI evaluation models. Become part of our growing world-class team, backed by Y Combinator, Creandum, and the founders of Reddit, Cruise, Rappi, Instacart and more.Role
As Atla’s research intern, you will collaborate with our researchers and obtain deep experience in a growing AI startup. As part of your role, you will:- Conduct cutting-edge machine learning research, contributing to research initiatives that have practical applications in our product development.
- Disseminate your research results through the production of publications, datasets, and code.
Iterative Self ImprovementThis project applies iterative self-improvement to enhance our general-purpose evaluator. This involves using the model’s outputs to refine its training data iteratively, rather than relying on fixed datasets. Prior work [1, 2, 3, 4] demonstrates the effectiveness of this approach, and we aim to extend it to evaluation systems.We will leverage our internal training data, infrastructure, and benchmarks to iteratively refine the evaluator. You will collaborate with engineers to build infrastructure for iteratively generating better and more informative data. Techniques from our research on techmulti-stage synthetic data generation will be incorporated to improve data quality.Key challenges include addressing bias amplification, semantic drift, and maintaining diversity of data to ensure model stability and alignment. This project aims to advance safe iterative training methodologies and deliver a more capable evaluator, with findings targeted for a top-tier conference. The scope can be tailored to your skills and interests.[1] Wang, Y., et al. (2023). SELF-INSTRUCT: Aligning Language Models with Self-Generated Instructions.[2] Yuan, W., et al. (2024). Self-Rewarding Language Models.[3] Wang, T., et al. (2024). Self-Taught Evaluators.[4] Li, X., et al. (2024). MONTESSORI-INSTRUCT: Generate Influential Training Data Tailored for Student Learning.
Agentic EvaluationThis project investigates how to evaluate agentic systems using an LLM-as-a-Judge framework. Agents introduce new challenges due to their ability to reason, plan, and interact with external tools [1,2]. Evaluating their capabilities and safety requires new approaches, with potential directions including:
- Agent-as-a-Judge: Using agentic systems to evaluate other agentic systems, reducing reliance on human judgment and enabling automated, scalable evaluation frameworks [3].
- Task-driven and multi-step evaluation: Moving beyond single-action accuracy to assess long-horizon reasoning, adaptability, and decision-making in dynamic environments [4].
Qualifications
Evidence of exceptional research engineering ability:- Are currently pursuing, or in the process of obtaining, a PhD in Machine Learning, NLP, Artificial Intelligence, or a related discipline. We will also consider exceptional non-PhD candidates.
- Proven track record in empirical research, including designing and executing experiments, and effectively writing up and communicating findings.
- Publications in top AI conferences.
- Aptitude for distilling and applying ideas from complex research papers.
Nice to have
- Previous internship experience at elite AI research labs (OpenAI, DeepMind, Meta, Anthropic, etc.).
- Experience using large-scale distributed training strategies, data annotation and evaluation pipelines, or implementing state of the art ML models.
- Interested in and thoughtful about the impacts of AI technology.
About you
You'll work by and thrive through our core principles:Own the Outcome- Create real value: Every action should deliver tangible, meaningful value for the people who use what we build.
- Drive to completion: Do the second 90%.
- Do fewer things, better: Prioritize focus over breadth.
- Collaborate for excellence: The whole is greater than the sum of its parts.
- Seek truth: Let the best ideas win, no matter where they come from, and let go of ego.
- Argue passionately, then commit fully: Debate fiercely, but once a decision is made, own it like it’s yours.
- Advance AI safety: Every action should contribute towards the safe development of AI.
- Go big or go home: “The people who are crazy enough to think they can change the world are the ones who do.”
Compensation
- Highly competitive
Research Intern
Office
London
Internship
August 8, 2025