
Senior Production Engineer
Anduril Industries
Posted about 2 hours ago
Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril’s family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.
ABOUT THE TEAM
The SRE team owns reliability and infrastructure for Anduril's cloud deployments. We operate Kubernetes clusters, Terraform infrastructure, and observability platforms across 10+ production environments supporting active defense contracts. When platform services break under real operational load, we're the team that fixes them — often at the code level, not just the config level.
ABOUT THE JOB
We are looking for a Senior Production Engineer to join our team in Costa Mesa, CA (or DC). In this role, you will be responsible for diagnosing and fixing stability vulnerabilities in core platform services that cause cascading failures in multi-tenant cloud deployments. You will write production Go to implement resilience patterns — leader election, circuit breakers, failure domain isolation — directly in service code. This will require deep experience with distributed systems, debugging complex failure modes across service boundaries, and writing production-quality Go. If you are someone who thrives on fixing hard reliability problems in live systems rather than building greenfield, this role is for you.
WHAT YOU'LL DO
- Diagnose and fix stability vulnerabilities in core platform services that cause cascading failures under multi-replica, multi-tenant operation
- Implement resilience patterns (leader election, circuit breakers, failure domain isolation) directly in service code
- Design multi-replica support for services that currently assume single-instance operation
- Collaborate with service owners on contract testing and upgrade validation
- Trace cascading failures across service boundaries and drive them to root-cause fixes
- Contribute to observability platform improvements to support service stability
- Light infrastructure work: Terraform/Kubernetes changes to support service fixes (~20% of time)
REQUIRED QUALIFICATIONS
- Production-quality Go — you'll be modifying core platform services, not writing scripts
- Practical experience with distributed systems: leader election, consensus, replication, failure modes
- Kubernetes — enough to understand how services run (not necessarily cluster administration)
- Debugging complex systems — tracing cascading failures across service boundaries
- 4+ years in SRE, platform engineering, or backend development roles
- Must be a U.S. Person due to required access to U.S. export controlled information or facilities
NICE-TO-HAVE QUALIFICATIONS
- Rust (some platform services use it)
- Experience fixing reliability problems in production services (not just building greenfield)
- Familiarity with gRPC service architectures
- HashiCorp Consul or similar service discovery/mesh
- FedRAMP/IL5 compliance environment experience
- ArgoCD / GitOps workflows
The salary range for this role is an estimate based on a wide range of compensation factors, inclusive of base salary only. Actual salary offer may vary based on (but not limited to) work experience, education and/or training, critical skills, and/or business considerations. Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package. Additionally, Anduril offers top-tier benefits for full-time employees, including:
Benefits
At Anduril, we invest in our people. Our comprehensive, competitive benefits package (available at little to no cost to employees) ensures you’re supported in health, recovery, and whatever comes next. For more information, Explore Our Benefits.
Protecting Yourself from Recruitment Scams
Anduril is committed to maintaining the integrity of our Talent acquisition process and the security of our candidates. We've observed a rise in sophisticated phishing and fraudulent schemes where individuals impersonate Anduril representatives, luring job seekers with false interviews or job offers. These scammers often attempt to extract payment or sensitive personal information.
To ensure your safety and help you navigate your job search with confidence, please keep the following critical points in mind:
-
No Financial Requests: Anduril will never solicit payment or demand personal financial details (such as banking information, credit card numbers, or social security numbers) at any stage of our hiring process. Our legitimate recruitment is entirely free for candidates.
- Please always verify communications:
- Direct from Anduril: If you receive an email from one of our recruiters, it will only come from an
@anduril.comaddress. - Via Agency Partner: If contacted by a recruiting agency for an Anduril role, their email will clearly identify their agency. If you suspect any suspicious activity, please verify the agency's authenticity by reaching out to [email protected].
- Direct from Anduril: If you receive an email from one of our recruiters, it will only come from an
-
Exercise Caution with Unsolicited Outreach: If you receive any communication that appears suspicious, contains grammatical errors, or makes unusual requests, do not engage. Always confirm the sender's email domain is @anduril.com before providing any personal information or clicking on links.
-
What to Do If You Suspect Fraud: Should you encounter any questionable or fraudulent outreach claiming to be from Anduril, please report it immediately to [email protected]. Your proactive caution is invaluable in protecting your personal information and upholding the security and trustworthiness of our recruitment efforts.
Data Privacy
To view Anduril's candidate data privacy policy, please visit https://anduril.com/applicant-privacy-notice/.
By submitting your application, you consent to Anduril Industries using a third-party service provider to conduct pre-employment risk, integrity, and due diligence screening and assessing potential risks as part of your application process.
Job details
Workplace
Office
Location
Costa Mesa, California, United States; Washington, District of Columbia, United States
Experience
SE
Salary
191k - 287k USD
per year
Jobr Assistant extension
Get the extension →