Manager - Site Reliability Engineering

Fivetran.com

Office

Novi Sad, South Bačka, Serbia, EMEA

Full Time

From Fivetran’s founding until now, our mission has remained the same: to make access to data as simple and reliable as electricity. With Fivetran, customer data arrives in their warehouses, canonical and ready to query, with no engineering or maintenance required. We’re proud that more organizations continue to leverage our technology every day to become truly data-driven.

About The Role

Fivetran is building data pipelines to power the modern data stack for thousands of companies.

As a Manager of Site Reliability Engineering, you will take on the responsibility for the Serbia-based group of SRE Engineers. Together with other SRE managers and engineers in Ireland, India, and the US, you will take ownership of the reliability of Fivetran’s service, including building and monitoring repeatable infrastructure, reliability, and robustness of the continuously deployed release pipeline, as well as timely and effective incident response and resolution. You will co-own the responsibility for the scalability and reliability of Fivetran’s connector infrastructure on AWS, GCP, and Azure. You will bring together and grow a Serbia-based team that reliably delivers excellent results while maintaining a culture of strong collaboration, engagement, and continuous improvement.

This is a full-time position based out of our Novi Sad office. Our hybrid work model offers a blend of remote flexibility and in-person collaboration, including two days in the office each week to connect and build as a team.

Technologies You’Ll Use

Cloud Providers: AWS, Azure, Google Cloud Platform (GCP)
Kubernetes: EKS, AKS, GKE (managed services)
Cloud Providers: AWS, Azure, Google Cloud Platform (GCP)
Kubernetes: EKS, AKS, GKE (managed services)
CI/CD: Buildkite, ArgoCD
Databases: PostgreSQL, Cloud Datastore
Programming Languages: Go, Java
Scripting: Python, Shell
Infrastructure as Code (IaC): Terraform, Pulumi
API Frameworks: FastAPI (RESTful APIs)

CI/CD: Buildkite, ArgoCD

Databases: PostgreSQL, Cloud Datastore
Programming Languages: Go, Java

Scripting: Python, Shell

Infrastructure as Code (IaC): Terraform, Pulumi
API Frameworks: FastAPI (RESTful APIs)
Cloud Networking: PrivateLinks (AWS, Azure), Private Service Connect (GCP), site-to-site VPNs across major cloud providers
Monitoring & Observability: Grafana

What You’Ll Do

Leadership and Talent Management

Build, hire, and plan the growth of the Serbia-based SRE organization
Help engineers advance in their careers; Actively guide and coach them
Set clear expectations and create a positive work environment based on accountability
Establish strong global and cross-team relationships with product, field, software teams, and the other SRE teams around the world

SRE Subject Matter Expertise

Drive initiatives that improve service reliability, scalability, and performance through automation, observability, and proactive problem-solving
Advocate for simple, elegant, and easily scalable system design
Support new services before they go live through activities such as system design consulting/review, capacity planning, and launch reviews
Ability to be hands-on and willing to act as player-coach in SRE areas such as IaC, Observability & Alerting, and Release Management
Demonstrate strong accountability for infrastructure cost management
Optimize our continuous integration and deployment process, striving for safe, frequent, and automated releases
Oversee incident management practices, ensuring timely response, effective/blameless postmortems, and systemic improvements
Stay current with emerging technologies, tools, and industry best practices relevant to reliability engineering
Skills We’re Looking For
Demonstrate strong accountability for infrastructure cost management
Optimize our continuous integration and deployment process, striving for safe, frequent, and automated releases
Oversee incident management practices, ensuring timely response, effective/blameless postmortems, and systemic improvements
Stay current with emerging technologies, tools, and industry best practices relevant to reliability engineering
Skills We’re Looking For
Experience in managing or leading a Site Reliability Engineering (SRE), DevOps, or Infrastructure Engineering team operating in a public cloud at scale
Demonstrate significant working knowledge of Continuous Integration and Deployment processes and tooling
Proven experience in cloud-based infrastructure design and IaC
Strong understanding and experience in security control design, implementation, and operations
Solid technical working experience on AWS, GCP, or Azure, distributed systems, networking, and container orchestration(Kubernetes)
Deep understanding of reliability concepts, including monitoring/observability, capacity planning, and disaster recovery
Experience leading incident response, root cause analysis, and reliability-focused postmortems.
Familiarity with cost optimization strategies in large-scale cloud environments
Excellent leadership, communication, and stakeholder management skills
Ability to iterate in the context of an evolving service environment
Experience in managing changes and getting buy-in from the organization
A passion for SRE/DevOps and running highly resilient/automated systems

(Optional) Bonus Skills

Knowledge of compliance and security practices in production environments (SOC2, ISO27001, etc.).
Experience with multi-cloud support
Hands-on coding/scripting experience in languages such as Python, Go, or Java.

#Li-Hyrbid #Li-Im1

Perks And Benefits

100% employer-paid medical insurance*
Generous paid time-off policy (PTO), plus paid sick time, inclusive parental leave policy, holidays, and volunteer days off
RSU stock grants*
Professional development and training opportunities
Company virtual happy hours, free food, and fun team-building activities
Monthly cell phone stipend
Access to an innovative mental health support platform that offers personalized care and resources in areas such as: therapy, coaching, and self-guided mindfulness exercises for all covered employees and their covered dependents.

*May vary by country and worker type - please reach out to your recruiter for more information

Click here to learn more about Fivetran's Benefits by Region.

We’re honored to be valued at over $5.6 billion, but more importantly, we’re proud of our core values of Get Stuck In, Do the Right Thing, and One Team, One Dream. Read about us in Forbes.

Fivetran brings together high-quality talent across the globe to make data access as easy and reliable as electricity for our customers. We value and recognize that our customers benefit from having innovative teams made of people from many backgrounds, experiences, and identities. Fivetran promotes diversity, equity, inclusion & belonging through attracting, recruiting, developing, and retaining a diverse workforce, not only because it is the right thing to do, but because it helps us build a world-class company to better serve our customers, our people and our communities.

To learn more about Fivetran’s culture and what it’s like to be part of the team, click here and enjoy our video.

To learn more about our candidate privacy policy, you can read our statement here.

We are committed to ensuring that all candidates have an equal opportunity to participate in our interview process. If you require accommodations at any stage of the process due to a disability, medical condition, or any other circumstance, please don't hesitate to submit your request by filling out this form. We will work with you to provide reasonable accommodations to facilitate your participation and ensure a fair and accessible interview experience. Your request and any information provided will be kept confidential and will not impact your candidacy. We look forward to hearing from you and accommodating your needs to the best of our ability.