GPU Application Platform Engineer Graduate (Server Research and Development) - 2026 Start (PhD)
TikTok
Office
San Jose, California, United States
Full Time
About the team
Server Research and Development team is responsible for architecting, designing, and building the best server and storage system to meet the requirements of high-performance, low cost and easy to operate. By joining this team, you will work with the best engineers and talents in this industry and have a broad opportunity to get in touch with the latest AI application system and newly emerged technology in computing , storage and silicon validation. You will gain remarkable hardware architect, development and validation experience in the most advanced hardware infrastructure at a massive scale. We are looking for a self-motivated GPU/AI Application Platform Architect with focus on giant model system optimization.
We are looking for talented individuals to join our team in 2026. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with us.
Successful candidates must be able to commit to an onboarding date by end of year 2026.
Applications will be reviewed on a rolling basis. We encourage you to apply as early as possible.
Responsibilities:
- Develop application benchmarks, tools and performance optimization methods for GPU/AI systems, including giant model training and inference systems such as LLM.
- Identify the system bottleneck/opportunity with deep system-level data-driven study, explore innovative options through SW-HW co-design, and lead them towards implementation to improve training and inference system efficiency.
- Develop GPU/AI system TCO model, based on application benchmark and performance optimization.
- Work with industry consortiums and open standard committees to investigate the emerging standards or technologies, and to contribute our research results to the industry.
- Work with our technology partners and suppliers to setup POC or prototypes to evaluate and test the new technologies or architectural designs.
Server Research and Development team is responsible for architecting, designing, and building the best server and storage system to meet the requirements of high-performance, low cost and easy to operate. By joining this team, you will work with the best engineers and talents in this industry and have a broad opportunity to get in touch with the latest AI application system and newly emerged technology in computing , storage and silicon validation. You will gain remarkable hardware architect, development and validation experience in the most advanced hardware infrastructure at a massive scale. We are looking for a self-motivated GPU/AI Application Platform Architect with focus on giant model system optimization.
We are looking for talented individuals to join our team in 2026. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with us.
Successful candidates must be able to commit to an onboarding date by end of year 2026.
Applications will be reviewed on a rolling basis. We encourage you to apply as early as possible.
Responsibilities:
- Develop application benchmarks, tools and performance optimization methods for GPU/AI systems, including giant model training and inference systems such as LLM.
- Identify the system bottleneck/opportunity with deep system-level data-driven study, explore innovative options through SW-HW co-design, and lead them towards implementation to improve training and inference system efficiency.
- Develop GPU/AI system TCO model, based on application benchmark and performance optimization.
- Work with industry consortiums and open standard committees to investigate the emerging standards or technologies, and to contribute our research results to the industry.
- Work with our technology partners and suppliers to setup POC or prototypes to evaluate and test the new technologies or architectural designs.
GPU Application Platform Engineer Graduate (Server Research and Development) - 2026 Start (PhD)
Office
San Jose, California, United States
Full Time
July 2, 2025