
Senior Site Reliability Engineer (Hybrid)
Rewards Network
Posted about 6 hours ago
About Rewards Network
For 41 years, Rewards Network has been helping restaurants grow revenue, increase traffic, and boost customer engagement through innovative financial, marketing services, and premier dining rewards programs. By offering unique card-linked offers, we introduce diners to fantastic restaurant experiences, leveraging advanced technology and data analytics to deliver value to restaurants, diners, and our strategic partners' loyalty programs.
Our Culture
At Rewards Network, you'll be part of a driven and diverse team that excels in collaboration, issue resolution, and taking ownership of both personal growth and the company's success. We take pride in partnering with the world's most powerful loyalty programs to drive full-price paying customers to local restaurants through marketing services and flexible funding options. Our engaging and rewarding environment is designed to help you gain your full potential.
Job Overview
The Site Reliability Engineer supports deployments, cloud infrastructure, and monitoring systems that power Rewards Network's applications and services. This role exists to ensure reliable, secure, and scalable operations across our Kubernetes clusters, AWS environments, and observability platforms.
We are hiring an experienced Site Reliability Engineer to be focused on supporting our engineering teams with deployments, troubleshooting, and infrastructure improvements. You'll be joining a small, senior SRE team with broad ownership of the platforms and infrastructure that power everything Rewards Network runs on. This position is well-suited for someone with strong hands-on experience who can quickly get up to speed and begin making meaningful contributions.
We’re open to hiring at the mid to senior level based on experience.
This is a hybrid position that requires in office presence 3 days a week (Tuesday-Thursday) in Chicago.
What you’ll bring to the table: (Responsibilities)
• Support and improve deployment pipelines to production and staging environments, with a focus on reliability, consistency, and reducing toil.
• Troubleshoot and resolve Kubernetes cluster and application-level issues, Docker containers, and Linux-based environments to support applications and services.
• Leverage Grafana, Prometheus, and Elasticsearch to monitor, diagnose, and improve system health.
• Build and improve internal tooling and automation to improve developer experience.
• Partner with development teams to address infrastructure and deployment needs, both planned and ad hoc.
• Maintain and improve AWS infrastructure using Terraform and Atlantis.
• Manage secrets and security operations with HashiCorp Vault.
• Participate in an on-call rotation to support production systems and incident response.
• Collaborate across teams to improve system observability, resilience, and automation.
• Document processes and contribute to knowledge sharing to improve engineering efficiency.
Do you have the right mix of ingredients:
Core Skills
- Kubernetes administration and troubleshooting.
- Infrastructure as code using Terraform or similar tools (we use Terraform with Atlantis).
- AWS services (EC2, S3, IAM, RDS, etc.).
- Monitoring and observability tools (Grafana, Prometheus, Elasticsearch).
- Secrets management with HashiCorp Vault or similar tools.
- Linux system administration and Docker containerization.
- Proficiency in at least one non-shell programming language for building tooling and automation (we use Go).
- CI/CD pipeline management and deployment automation (we use GitLab CI and TeamCity).
Additional Skills
- Familiarity with Kafka and Logstash.
- Experience with incident response and operational support best practices.
- Ability to balance ad hoc support requests with project priorities.
- Strong communication skills to work effectively across technical and non-technical teams.
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
- 5+ years of experience in Site Reliability Engineering, DevOps, or related infrastructure roles.
What you’ll love about us:
Comprehensive benefits package, which includes:
- This is a full-time, exempt position. The base salary range for this role in Chicago is $145,000–$175,000 annually, depending on level (mid-level or senior), as well as candidate experience, skills, and other factors. This role is also eligible for an annual bonus target of 10%, bringing total target compensation to $159,500-$192,500.
- Competitive Time Off Benefits: including flexible PTO, 11 company holidays, and parental leave.
- Generous dining reimbursement when you dine with our restaurant clients
- 401(k) plan with a company match
- Two medical plan options- Standard PPO or High Deductible Health Plan (HSA with company match for HDHP participants)
- Partnership with Rx n Go, offering certain prescriptions for free
- Two dental plan options and a vision plan
- Flexible Spending Accounts and a pre-tax commuter benefit program
- Accident, Critical Illness, and Hospital Indemnity Insurance Plans
- Short Term and Long Term disability
- Company-paid life insurance and AD&D insurance, supplemental employee, spouse, and child life insurance
- Employee Life Assistance Program
- Hybrid working environment in a new office space downtown near the Metra Train stations and catered lunches on Tuesdays.
Rewards Network is an Equal Opportunity Employer (EOE). We encourage and strongly support workplace diversity.



