Senior Site Reliability Engineer
DataSnipper.com
Remote
ET)
Full Time
We are looking for a skilled and passionate Senior Site Reliability Engineer (SRE), based on the East Coast of the United States to join the Cloud Platform team, which empowers DataSnipper's growth through a secure and scalable enterprise cloud platform.
As a Senior SRE at DataSnipper, you will set the strategic direction for our cloud infrastructure on Microsoft Azure. You will define target-state architectures and roadmaps, lead enterprise-scale landing zone design and governance, and partner with product, SRE, security, and data teams to deliver multi-tenant, multi-region, secure-by-default solutions. You'll standardize patterns, automate with Infrastructure as Code, and guide migrations and modernizations, turning best practices into measurable reliability, security, and cost outcomes.
About Datasnipper:
DataSnipper is the driving force behind an intelligent automation platform that's transforming the world of audit and finance.
Founded in 2017, DataSnipper has skyrocketed to unicorn status, achieving a valuation of $1 billion following a successful funding round led by Index Ventures. With over 500,000 users across 160+ countries and offices in Amsterdam, New York, Kuala Lumpur, Tokyo, and Mexico City, DataSnipper is shaking things up - and we're not stopping there!
What You Will Do:
- Define and own the cloud infrastructure strategy, reference architectures, and platform roadmaps for Azure across compute, networking, identity, data, security, and observability
- Design and implement an enterprise-scale Azure Landing Zone (management groups, subscriptions, RBAC, Azure Policy) and governance for multi-tenant SaaS and regulated customers
- Architect highly available, multi-region solutions leveraging services such as AKS/Container Apps, App Service, Azure DB for PostgreSQL, Redis, Service Bus/Event Grid, Front Door/Traffic Manager, and CDN
- Enable secure private connectivity patterns (Private Link, VNet integration, Azure Firewall/WAF, ExpressRoute/VPN) and champion zero-trust principles with Entra ID and Managed Identity
- Establish platform engineering "golden paths" and reusable accelerators: Terraform modules, environment bootstrapping, and CI/CD templates in GitHub Actions
- Drive well-architected reviews for mission-critical workloads; translate findings into actionable improvements for reliability, security, performance, and cost optimization with measurable SLOs/SLIs
- Implement end-to-end observability using Azure Monitor, Log Analytics, Application Insights, and (where applicable) Prometheus/Grafana; automate proactive detection and post-incident
Improvement Plans
- Partner with Security to implement least-privilege access, PIM, Defender for Cloud, Key Vault, secret rotation, and compliance controls (e.g., SOC 2, ISO 27001)
- Define and validate DR/BCP strategies (RTO/RPO), including zone-redundancy, geo-replication, backups, and failover testing
- Mentor and coach engineering teams; lead architecture reviews, threat modeling, technical workshops, and author clear documentation and reference architectures
- Evaluate and guide adoption of new Azure capabilities; collaborate with partners and vendors to enhance our platform
What You Will Bring:
- 7+ years in cloud architecture or platform engineering, with deep hands-on expertise in Microsoft Azure and experience setting cloud strategy and roadmaps
- Proven track record designing multi-tenant, multi-region SaaS architectures and enterprise-scale Azure Landing Zones with strong governance and policy
- Expertise across Azure services: AKS/Container Apps, App Service, VMSS; VNet/vWAN, Private Link, Azure Firewall, App Gateway/WAF, Front Door; Entra ID (Azure AD), RBAC, Managed Identity, PIM; Storage, Azure SQL DB; Service Bus/Event Grid; Key Vault; Defender for Cloud; Azure Monitor/Log Analytics/App Insights
- Strong DevOps/SRE practices: CI/CD (GitHub Actions), GitOps, blue/green and canary deployments, infrastructure testing, and progressive delivery
- Hands-on with Infrastructure as Code (Terraform and/or Bicep; ARM), policy-as-code, and environment bootstrapping at scale
- Solid grasp of networking and hybrid connectivity (ExpressRoute, VPN), security-by-design, and zero trust
- FinOps mindset with demonstrable cost optimization, tagging/chargeback, budgets/alerts, and rightsizing
- Strong communication and stakeholder management skills; ability to influence across product, SRE, security, and leadership
- Proficiency in scripting/coding (PowerShell and one of Python/C#/Go)
- Nice to have: Azure Solutions Architect Expert (AZ-305), Azure DevOps Engineer Expert (AZ-400), CKA/CKAD; experience in regulated environments (SOC 2, ISO 27001, HIPAA, GDPR); contributions to public docs/reference architectures
What We Offer:
Excellent Salary.
- Flexible paid time off.
Remote Work
- Comprehensive medical and dental coverage.
401k Match.
Paid Parental Leave.
Stock Participation Plan.
- Being part of one of the fastest-growing scale-ups in the world.
- Make an impact by disrupting the finance industry with us.
- A flexible and growing organization with lots of opportunities to learn and develop.
- International working environment, with a team of friendly and driven colleagues.
- Access to OpenUp and Talkspace, the mental health and wellness platform.
Next Steps:
- 30-minute intro call with the Recruiter.
- 45-minute call with the Hiring Manager.
- 1 hour live coding
- 1 hour of system design
Final Interview
Apply and let's disrupt the auditing world together! 🚀
Senior Site Reliability Engineer
Remote
ET)
Full Time
September 29, 2025