Applied Science Internship - Multimodal Foundation Models & Robotics
Microsoft.com
Office
Zürich, Zürich, Switzerland
Internship
Location: Zürich, Switzerland
Contract Type: Internship
Duration: 12-weeks (40hrs/week)
The Spatial AI Lab is part of the Applied Sciences Group, a Microsoft research and development organization dedicated to creating next-generation human-computer interaction technologies leveraging the most recent AI developments and exploring new hardware capabilities and device form-factors. Our team of scientists and engineers has strong expertise in computer vision and multi-modal AI, with a particular focus on spatial and embodied AI.
As part of our growing team, you will work alongside our researchers at the intersection of large-scale generative modeling and embodied AI, with a focus on robotics. You will be an integral part of our team’s mission of building the core intelligence for a new generation of agents, training the multimodal foundation models that empower them to perceive complex environments, reason about tasks, and act seamlessly across both the physical and digital worlds.
This opportunity will allow you to gain invaluable hands-on experience in training embodied foundation models, receive mentorship from leading experts, and contribute to our pioneering research through both advancement of internal capabilities and publications.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
- Contribute to the design and implement novel AI algorithms and models for general-purpose embodied agents;
- Gain hands-on experience optimizing and deploy AI models on robot hardware;
- Contribute developing high-performance machine-learning pipelines and optimize data and learning stacks for scalability, efficiency, and performance.
- Collaborate across Microsoft research and engineering teams to transition cutting-edge research into real-world impact.
- Contribute to research that leads to publications at leading AI and robotics conferences (e.g., CoRL, RSS, NeurIPS, ICML).
Qualifications
Required Qualifications:
- Currently enrolled in a Master's or PhD program in Computer Science, Robotics, Electrical Engineering, or a related technical field.
- Hands-on experience with modern deep learning frameworks (e.g. Pytorch/Tensorflow/Jax).
- Fluent in English
Preferred Qualifications:
Experience in one or more of the following areas:
- Foundation Models: hands-on training experience in at least one of the following topics: LLMs; Large vision-language models (VLMs); Video generative models and diffusion algorithms; or action-based transformers and Vision Language Action models (VLAs).
- Large-Scale ML Systems: Experience with large scale machine learning compute systems.
- Robotics:
- Self-motivated team-player, problem solver, and keen to learn.
- Hands-on training experience in robot learning techniques, such as reinforcement learning, imitation learning as well as classical control methods
- Solid understanding of robot kinematics, dynamics and sensors
- Familiarity with control algorithms such as PID, model predictive control (MPC), and whole-body control.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#Eip
Applied Science Internship - Multimodal Foundation Models & Robotics
Office
Zürich, Zürich, Switzerland
Internship
September 19, 2025