
Senior Site Reliability Engineer — Observability Engineer | NordVPN
Nord Security
Posted about 7 hours ago
The world’s most advanced VPN, and a whole lot more.
If you’re a curious problem-solver who carves their own path, join the team behind Threat Protection Pro, the NordLynx protocol, and the fastest VPN on the planet—tools that put privacy, security, and control back in people’s hands.
Your impact? Helping millions take back control of their online security, privacy, and data.
NordVPN runs a global edge infrastructure serving millions of users. Knowing what's happening across that infrastructure - in real time, at scale, without drowning in noise - is what this role exists to solve.
We're looking for a Senior Site Reliability Engineer (SRE) focused on observability: designing monitoring systems, improving signal quality, reducing alert fatigue, and collaborating with data teams on anomaly detection. You'll own how we understand the health and behavior of our distributed systems.
Main responsibilities
Design, build, and improve monitoring pipelines and observability tooling across globally distributed infrastructure
Define and implement service-level monitoring based on golden signals (latency, traffic, errors, saturation)
Reduce alert fatigue - build meaningful, actionable alerts that engineers trust
Develop and maintain custom exporters, scripts, and integrations for metrics and log collection
Collaborate with the data team on anomaly detection and data-driven operational insights
Understand service signals - know what to measure, why, and what the numbers actually mean
Core requirements
Distributed systems observability - monitoring architecture, signal design, dashboarding
Golden signal thinking - you design monitoring around what matters, not what's easy to measure
Alert design - reducing noise, building actionable alerts, managing on-call sanity
Python - scripting, custom exporters, automation, data processing
Linux administration and debugging
Networking fundamentals
Bonus Points For
SaltStack
Advanced networking - traffic analysis, protocol-level debugging
Advanced data knowledge - aggregation strategies, downsampling, cardinality management, retention trade-offs
Proven track record of onboarding new systems/services into monitoring from scratch
Familiarity with agentic engineering - Claude Code, LLM integrations, MCP workflows
Tools You Will Use
Naemon (Nagios) and Gearmand
Prometheus-based exporters
Telegraf
Fluent Bit
VictoriaMetrics ecosystem
OpenSearch
Grafana
What We Offer
Innovate with industry leaders
Work alongside global experts to build world-leading cybersecurity tools, impacting millions of users around the world.
Learn & grow
Boost your skills via our extensive training programs (online and offline) & other resources. Benefit from mentorship and career-switch opportunities to grow within the company.
Work in a next-gen Cyber City office
Thrive in our bustling office, featuring ergonomic workspaces, modern meeting rooms, engaging events, and specialty coffee to fuel your day.
Hybrid work
Enjoy the flexibility with 3 office days and working from home for the remaining 2.
Work from anywhere
Recharge with a change of scenery – choose work from any location when you feel a need to power your creativity and drive.
Physical well-being
Boost your health with free-of-charge 24/7 gym access, onsite and online workouts, and consultations led by in-house Physical Well-Being experts.
Mental & emotional health
Nurture your mind with free psychologist consultations, dedicated mental health events, and premium access to top-rated wellness apps like Calm, Headspace, and Mindletic.
Premium healthcare
Receive private health insurance giving you peace of mind for your health needs.
Extra days off
Enjoy additional vacation days off as you grow with us. Plus, get extra days for sick leave, special occasions, or parenting needs.
Joyful moments – special treats
Celebrate life’s big moments with special gifts from us on your birthday, anniversary, and other major events, such as weddings or the arrival of a new family member.
Company events & team-building
Experience iconic Nord Security celebrations, team-buildings, and knowledge-sharing events, nurturing bonds that fuel our success.
Workation
Embark on a legendary company getaway abroad, filled with exciting activities, live concerts, engaging workshops, and epic time together.
Kindly refer to our Privacy Notice for Recruitment Candidates for comprehensive information regarding our data handling procedures throughout recruitment processes.
We expect all candidates to provide accurate and complete information during the recruitment process. While limited use of AI tools to refine application materials is acceptable, candidates remain fully responsible for ensuring that their submissions reflect their own qualifications, skills, and experience. Any failure to do so may negatively affect participation in the recruitment process. If broader AI assistance is allowed for a particular role or stage, we’ll let you know in advance.
By submitting your application, you acknowledge that it may be processed using automated tools for evaluation purposes. As part of our recruitment process, we may use an AI-based application review tool to help assess applications based on skills and experience relevant to the role. This technology is used to support - not replace - human decision-making, and every application is ultimately reviewed by a recruiter.
Job details
Jobr Assistant extension
Get the extension →