company logo

SRE Project Intern (RD) - 2026 Start (BS/MS)

TikTok.com

Office

Dublin, Dublin, Ireland

Internship

About the team:
The team is responsible for infrastructure systems, including Storage/Computing/DB/Big Data. We aim to be the leading SRE team across the industry. In the SRE team, you will have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of diversity, intellectual curiosity, openness, and problem-solving. We also encourage ownership, self-governance and independence to work on various projects, and an environment that provides the support and mentorship needed to learn and grow as an engineer.

Our Mission:
Design and build highly available and reliable online distributed infra system for company's business. and achieve a balance between stability and cost through efficiency methods and providing SRE services.

Our Vision:
Fully comprehensive capability team that can meet the SRE Standard capability model

As a project intern, you will have the opportunity to engage in impactful short-term projects that provide you with a glimpse of professional real-world experience. You will gain practical skills through on-the-job learning in a fast-paced work environment and develop a deeper understanding of your career interests.

Onboarding Date, Project Duration & Application Deadline:
Applications will be reviewed on a rolling basis - we encourage you to apply early.
Successful candidates must be able to commit to at least 3 months long internship period.


Responsibilities:
- Be responsible for the basic engineering construction of byte infrastructure products & components, focusing on infrastructure O&M architecture optimization, automated O&M platform research and development, data and intelligent O&M. Through the methodology of software engineering and digital intelligence, O&M, around the O&M requirements of infrastructure products & components, built a layered and systematic O&M platform to solve the problem of ultra-large-scale cluster O&M management. (Goals) To provide stable, efficient, and low-cost serverless infrastructure facilities for Mid-Platform & Business. We aim to be the leading SRE team across the industry。
- Reliability: Ensure the stability of the company's core infrastructure (system high availability and reliability), focus on system performance and capacity, establish O&M(Operation & Maintenance) standards and SOP processes.
- Reliability: Troubleshooting and locating technical issues,collaborate with the technical team to develop and implement system capacity planning, performance testing, anomaly analysis, and fault diagnosis and resolution strategies.
- Efficiency: Research and evaluate large-scale system architectures and technologies, use new tools and technologies to improve existing systems and processes to support business development.
- Efficiency: Design and implement O&M platforms to achieve efficient, automated, and intelligent system maintenance.
- Cost: Develop delivery standards for mass production system scales, from budgeting to resource delivery, to online system capacity assessments, to help the company optimize IT costs.
- Compliance: Design and establish new IDC, design and implement data protection plans to meet standard requirements.

SRE Project Intern (RD) - 2026 Start (BS/MS)

Office

Dublin, Dublin, Ireland

Internship

October 2, 2025

company logo

TikTok