
Senior Site Reliability Engineer
SecurityScorecard
Posted about 12 hours ago
About SecurityScorecard:
SecurityScorecard is the global leader in cybersecurity ratings, with over 12 million companies continuously rated, operating in 64 countries. Founded in 2013 by security and risk experts Dr. Alex Yampolskiy and Sam Kassoumeh and funded by world-class investors, SecurityScorecard’s patented rating technology is used by over 25,000 organizations for self-monitoring, third-party risk management, board reporting, and cyber insurance underwriting; making all organizations more resilient by allowing them to easily find and fix cybersecurity risks across their digital footprint.
Headquartered in New York City, our culture has been recognized by Inc Magazine as a "Best Workplace,” by Crain’s NY as a "Best Places to Work in NYC," and as one of the 10 hottest SaaS startups in New York for two years in a row. Most recently, SecurityScorecard was named to Fast Company’s annual list of the World’s Most Innovative Companies for 2023 and to the Achievers 50 Most Engaged Workplaces in 2023 award recognizing “forward-thinking employers for their unwavering commitment to employee engagement.” SecurityScorecard is proud to be funded by world-class investors including Silver Lake Waterman, Moody’s, Sequoia Capital, GV and Riverwood Capital.
About the Team:
As a Senior Site Reliability Engineer, you will be a key technical leader driving the design and optimization of our Kubernetes-based infrastructure and CI/CD systems. You will also own the infrastructure behind our AI tooling — building MCP servers and defining safe, auditable AI access patterns for production systems. You'll work hands-on with engineering teams to accelerate delivery, ensure production reliability, and embed best practices for automation, observability, and resilience.
About the Role:
- Design, build, and scale Kubernetes infrastructure for secure, multi-tenant, high-availability applications.
- Build and operate AI tooling infrastructure — stand up MCP servers and establish secure, governed AI access and guardrails for production systems.
- Optimize and maintain CI/CD pipelines, improving reliability, speed, and rollback safety.
- Implement progressive delivery strategies such as blue/green and canary deployments.
- Advance Infrastructure as Code with Terraform, Helm, and Argo CD, defining reusable patterns for the org.
- Operate and optimize streaming and analytics infrastructure: Kafka, Flink, and ClickHouse.
- Build automated testing into the CI/CD lifecycle.
- Improve system observability — define SLOs, alerts, and dashboards.
- Lead incident response and postmortems, focusing on root cause and durable fixes.
- Mentor engineers across teams on Kubernetes, CI/CD, and cloud infrastructure.
Required Qualifications:
- 6+ years in SRE, DevOps, or Infrastructure roles, with significant production Kubernetes experience.
- Hands-on experience integrating AI/LLM tooling into engineering or operational workflows (e.g., MCP servers, AI agents acting on infrastructure), and a clear grasp of the security and governance considerations of giving AI access to production.
- Proven success building CI/CD pipelines (GitHub Actions, Jenkins, GitLab CI, or similar).
- Strong with Kubernetes internals and managed services like EKS, GKE, or AKS.
- Expertise with Infrastructure as Code (Terraform, Helm, Pulumi) and GitOps.
- Proficient in Python, Bash, or Go.
- Knowledge of observability tooling (Prometheus, Grafana, Datadog, OpenTelemetry).
- Production experience with Kafka, Flink, and ClickHouse.
- Strong communication and cross-team collaboration skills.
Preferred Qualifications:
- Multi-region or multi-cluster Kubernetes experience.
- Chaos engineering or resilience testing.
- Security scanning, compliance automation, or policy-as-code.
- LLM observability/tracing tooling (Langsmith, Langfuse) or MLOps workflows.
- Contributions to open-source Kubernetes or CI/CD projects.
Benefits:
Specific to each country, we offer a competitive salary, stock options, Health benefits, and unlimited PTO, parental leave, tuition reimbursements, and much more!
The estimated total compensation range for this position is $152,000 - X195,000 (base plus bonus). Actual compensation for the position is based on a variety of factors, including, but not limited to affordability, skills, qualifications and experience, and may vary from the range. In addition to base salary, employees may also be eligible for annual performance-based incentive compensation awards and equity, among other company benefits.
SecurityScorecard is committed to Equal Employment Opportunity and embraces diversity. We believe that our team is strengthened through hiring and retaining employees with diverse backgrounds, skill sets, ideas, and perspectives. We make hiring decisions based on merit and do not discriminate based on race, color, religion, national origin, sex or gender (including pregnancy) gender identity or expression (including transgender status), sexual orientation, age, marital, veteran, disability status or any other protected category in accordance with applicable law.
We also consider qualified applicants regardless of criminal histories, in accordance with applicable law. We are committed to providing reasonable accommodations for qualified individuals with disabilities in our job application procedures. If you need assistance or accommodation due to a disability, please contact [email protected].
Any information you submit to SecurityScorecard as part of your application will be processed in accordance with the Company’s privacy policy and applicable law.
SecurityScorecard does not accept unsolicited resumes from employment agencies. Please note that we do not provide immigration sponsorship for this position. #LI-DNI



