
About this role
About the Team
The Global E-commerce SRE team of US Tech and Product works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. As an SRE, you will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures.
- Support the service level of a critical, revenue generating E-commerce platform as well as related infrastructure and services. This role will focus on service reliability, highly-scalable design and release management in a cloud-native environment.
- Implement SRE practices around incident management, post-mortems
- Provide additional support as this role is part of a team that provides 24/7 support and requires working scheduled shifts, which may include holidays
- Define service level indicators and data-driven objectives to uphold and improve uptime, latency, and system health of a core TikTok production platform.
- Collaborate cross team with engineering and product to ensure that key requirements (such as capacity planning and launch reviews) are performed to enable transparent service delivery to customers.
- Automation geared towards efficiency, scalability and service resiliency
The Global E-commerce SRE team of US Tech and Product works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. As an SRE, you will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures.
- Support the service level of a critical, revenue generating E-commerce platform as well as related infrastructure and services. This role will focus on service reliability, highly-scalable design and release management in a cloud-native environment.
- Implement SRE practices around incident management, post-mortems
- Provide additional support as this role is part of a team that provides 24/7 support and requires working scheduled shifts, which may include holidays
- Define service level indicators and data-driven objectives to uphold and improve uptime, latency, and system health of a core TikTok production platform.
- Collaborate cross team with engineering and product to ensure that key requirements (such as capacity planning and launch reviews) are performed to enable transparent service delivery to customers.
- Automation geared towards efficiency, scalability and service resiliency