Voodoo logo

Lead SRE - BeReal

Posted 4 days ago

RemoteParisSE

About BeReal

At BeReal, we are dedicated to authenticity in social media. By encouraging users to share unfiltered moments, we foster genuine connections and celebrate real life. We are now an international team of 100+ and have 40M+ monthly active users. Backed by Voodoo, our team is fully focused on scaling BeReal into an iconic social network used by hundreds of millions.

The Infrastructure team provides the backbone that powers the company’s growth, ensuring the scalability, efficiency, and reliability of our platform. We design and operate our infrastructure on GCP. Working hand in hand with developers, we enable teams to ship fast and efficiently while maintaining a strong focus on costs and performance. Our mission is to create a developer-friendly, cost-effective, and highly automated infrastructure that supports innovation at scale.

Role

  • Define and drive SRE practices across the organization, including SLIs, SLOs, error budgets, incident management, postmortem processes, and long-term reliability improvements across the platform

  • Design, implement, and optimize infrastructure for availability, scalability, reliability, and cost efficiency

  • Own and evolve our observability stack, improving monitoring, alerting, logging, and distributed tracing

  • Drive automation of infrastructure and operational workflows (e.g., Terraform, Terragrunt, Kubernetes)

  • Lead FinOps initiatives, developing tools and insights to optimize cloud costs

  • Partner closely with development squads to improve service reliability, performance, and operational excellence

  • Influence architectural decisions and establish best practices for building resilient distributed systems

  • Mentor and support Infrastructure engineers, helping raise the bar on reliability, operational excellence, and technical execution

  • Analyze performance bottlenecks and work on solutions such as scaling strategies, service optimizations, and system debugging

Profile

  • Strong knowledge of Kubernetes

  • Experience with high traffic, distributed systems architectures, and related tools (service discovery, config/secret management, etc.)

  • Strong knowledge of one Cloud provider (AWS or GCP preferred)

  • Proven experience defining and operating SRE practices (SLOs, incident management, observability, reliability engineering)

  • Strong operational mindset with experience managing production incidents and driving reliability improvements

  • Leadership and mentoring experience, with the ability to influence technical decisions across teams

  • Ownership-driven – If something isn’t working, you don’t wait for instructions; you improve it

  • Pragmatic and impact-oriented – You balance reliability, delivery speed, and business priorities

  • Performance vs cost-conscious – You make decisions that align with both technical excellence and financial sustainability

Our Stack

  • Operator: Kubernetes

  • CI/CD: Argocd, Github actions

  • Cloud provider: GCP

  • Monitoring: Datadog

  • Infra as code: Terraform / Terragrunt

  • Languages: golang / node

  • Datastores: Spanner / PostgreSQL / Redis

Benefits

  • Competitive salary based on experience

  • Swile Lunch voucher

  • Gymlib (100% covered by Voodoo)

  • Premium healthcare coverage with SideCare, 100% covered for you and your family

  • Wellness activities in our Paris office

Job details
Workplace
Remote
Location
Paris
Experience
SE

We entertain the world with iconic apps and games.

Key team members

Maxime Montasheri

Maxime Montasheri

Guillaume Portes

Guillaume Portes

Michel Savariradjalou

Michel Savariradjalou

Luca Bilotta

Luca Bilotta

Apply smarter with Jobr

Jobr aggregates jobs directly from company career portals — no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.

Direct from company career pages
AI-personalised cover letters
Human review before every submit
Application tracking & follow-ups