company logo

Site Reliability Engineer - TikTok Shop

TikTok.com

Office

Singapore, Singapore

Full Time

About the team
TikTok Shop is a content e-commerce business utilising international short video products as carriers. Our aim is to become the preferred choice for users seeking to discover and purchase affordable, high-quality products. We provide users with tailored, vibrant, and efficient consumption experiences while enabling merchants to access robust and dependable platform services in various scenarios, such as live e-commerce and short video content e-commerce. Our vision is to make affordable and high-quality products easily accessible, enhancing the quality of life for all. We are looking for passionate and talented people to join our product and operations team, to build an e-commerce ecosystem that is innovative, secure and intuitive for our users and brands.

Our role combine software and systems engineering disciplines to run high-performance, large-scale distributed infrastructure. This means you will be deeply involved in the developmental lifecycle of critical software services, collaborating closely with product engineers to combine software code and systems knowledge to ensure that TikTok Shop's services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability.

Responsibilities:
- Focused on TikTok Shop business, provide SRE solutions that cater to actual business scenarios based on cross-team, cross-timezone, and cross-region collaboration mechanisms.
- Participate in building disaster recovery capabilities for TikTok Shop, offering end-to-end disaster recovery solutions to ensure the ability to switch over during extreme failure scenarios.
- Continuously enhance the core capabilities of TikTok Shop SRE in terms of stability, efficiency, cost, and security, and participate in the operation of key metrics (including incident recall rate, SLI, MTTD, MTTR, resource utilization, etc.).
- Promote the design and implementation of operation and maintenance tools and platform solutions to improve the infrastructure capabilities of the TikTok Shop platform.
- Participate in on-call duty, respond to performance and availability issues, resolve problems, and minimize downtime as much as possible.

Site Reliability Engineer - TikTok Shop

Office

Singapore, Singapore

Full Time

October 10, 2025

company logo

TikTok