Robotics Intern - Large Behavior Models, Learning From Videos (LFV)
Toyota Research Institute.com
90k - 130k USD/year
Office
Los Altos, CA
Internship
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team in Automated Driving, Energy & Materials, Human-Centered AI, Human Interactive Driving, Large Behavior Models, and Robotics.
This is a summer 2026 paid 12-week internship opportunity. Please note that this internship will be a hybrid in-office role.
The Internship
As a Research Intern, you will work with a multidisciplinary team proposing, conducting, and transferring pioneering research on the intersection between Computer Vision and Robotics. You will use large amounts of data from different sources and modalities, and train large-scale foundation models aimed at solving open problems, work towards publications at top academic venues, and test your ideas in simulators as well as in the real world.
The Team
The Learning From Videos (LFV) team in the Robotics division is looking for research interns for the summer of 2026 in a variety of areas such as Video Generation, World Modeling, 4D Reconstruction, Multi-Modal Foundation Models, Multi-View Geometry, Data Augmentation, and Large Vision Models, with a primary focus on on embodied applications. We are aiming to make progress on some of the hardest scientific challenges around the deployment of robots in real-world unstructured environments, by leveraging data from different sources and modalities, and learning transferable priors grounded in the physical properties of the world. Our mission is to develop foundational models capable of understanding how the world works, and in doing so predict possible future states and adapt to new environments and circumstances.
The pay range for this position at commencement of employment is expected to be between $45 and $65/hour for California-based roles; however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. Note that TRI offers a generous benefits package including vacation and sick time. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.
TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.
This is a summer 2026 paid 12-week internship opportunity. Please note that this internship will be a hybrid in-office role.
The Internship
As a Research Intern, you will work with a multidisciplinary team proposing, conducting, and transferring pioneering research on the intersection between Computer Vision and Robotics. You will use large amounts of data from different sources and modalities, and train large-scale foundation models aimed at solving open problems, work towards publications at top academic venues, and test your ideas in simulators as well as in the real world.
The Team
The Learning From Videos (LFV) team in the Robotics division is looking for research interns for the summer of 2026 in a variety of areas such as Video Generation, World Modeling, 4D Reconstruction, Multi-Modal Foundation Models, Multi-View Geometry, Data Augmentation, and Large Vision Models, with a primary focus on on embodied applications. We are aiming to make progress on some of the hardest scientific challenges around the deployment of robots in real-world unstructured environments, by leveraging data from different sources and modalities, and learning transferable priors grounded in the physical properties of the world. Our mission is to develop foundational models capable of understanding how the world works, and in doing so predict possible future states and adapt to new environments and circumstances.
Responsibilities
- Conduct daring research in Computer Vision that solves open problems of high theoretical and practical value, and evaluate solutions on real-world benchmarks and systems, with a focus on robotics.
- Push the boundaries of knowledge and the state-of-the-art in Visual Systems for Robotics.
- Partner with a multidisciplinary team, including other research scientists and engineers across the LFV team, the Robotics division, TRI, Toyota, and our university partners.
- Stay up to date on the state-of-the-art in Machine Learning ideas and software.
- Present results in verbal and written communications at international conferences, internally, and via open-source contributions to the community.
Qualifications
- Currently pursuing a Ph.D. in Machine Learning, Robotics, or related fields.
- Publication or desire to publish at high-impact conferences/journals (e.g., CoRL, ICLR, NeurIPS, CVPR, ICCV, ECCV, ICML, UAI, AISTATS, AAAI, TMLR, RSS, ICRA, IROS, RA-L, etc.) on some of the aforementioned topics.
- Passionate about large scale challenges in ML and CV grounded in physical systems, especially in the space of robotics.
- Proficiency with one or more coding languages and systems, preferably Python, Unix, and a Deep Learning framework (e.g., PyTorch).
- Ability to collaborate with other researchers and engineers of the LFV team, and, more broadly, the Robotics division to invent and develop interesting research ideas.
- A reliable teammate who loves to think big, go deeper, and deliver with integrity.
The pay range for this position at commencement of employment is expected to be between $45 and $65/hour for California-based roles; however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. Note that TRI offers a generous benefits package including vacation and sick time. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.
TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.
Robotics Intern - Large Behavior Models, Learning From Videos (LFV)
Office
Los Altos, CA
Internship
90k - 130k USD/year
September 19, 2025