Site Reliability Engineer - Video Platform - USDS (MTV)
TikTok
Office
Mountain View, California, United States
Full Time
Team Intro
TikTok video system is a world-leading video platform that provides multimedia storage, delivery, transcoding services. As part of the USDS, the Video Platform team is responsible for building the next generation video processing platform which provides excellent experiences for billions of users around the world.
The USDS Video Platform team is seeking an experienced Site Reliability Engineer to help us continue improving TikTok's video system. If you are passionate about ensuring software reliability, love problem-solving, and are prepared for exciting challenges, we would like you on our team.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
- Responsible for overall reliability of TikTok's video system, including video publishing and distribution.
- Perform lifecycle management of production systems including change management, service deployment, operations and emergency response.
- Monitor the system and respond to incidents to maintain system service level agreement (SLA), review and follow up all production incidents.
- Perform capacity management of compute, storage and network bandwidth resources to ensure system stability and save infrastructure costs.
- Provide strong support during big events to ensure the system is capable of consuming a large volume of Internet traffic.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.
TikTok video system is a world-leading video platform that provides multimedia storage, delivery, transcoding services. As part of the USDS, the Video Platform team is responsible for building the next generation video processing platform which provides excellent experiences for billions of users around the world.
The USDS Video Platform team is seeking an experienced Site Reliability Engineer to help us continue improving TikTok's video system. If you are passionate about ensuring software reliability, love problem-solving, and are prepared for exciting challenges, we would like you on our team.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
- Responsible for overall reliability of TikTok's video system, including video publishing and distribution.
- Perform lifecycle management of production systems including change management, service deployment, operations and emergency response.
- Monitor the system and respond to incidents to maintain system service level agreement (SLA), review and follow up all production incidents.
- Perform capacity management of compute, storage and network bandwidth resources to ensure system stability and save infrastructure costs.
- Provide strong support during big events to ensure the system is capable of consuming a large volume of Internet traffic.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.
Site Reliability Engineer - Video Platform - USDS (MTV)
Office
Mountain View, California, United States
Full Time
August 12, 2025