Principal Site Reliability Engineer Ι
Kaizen Gaming.com
Office
Athens, Greece
Full Time
We are Kaizen Gaming
Kaizen Gaming, the team powering Betano, is one of the biggest GameTech companies in the world, operating in 19 markets. We always aim to leverage cutting-edge technology, providing the best experience to our millions of customers who trust us for their entertainment.
We are a diverse team of more than 2.700 Kaizeners, from 40+ nationalities spreading across 3 continents.
Our #oneteam is proud to be among the Best Workplaces in Europe and certified Great Place to Work across our offices. Here, there’ll be no average day for you. Ready to Press Play on Potential?
Let's Start With The Role
We are looking for a Principal Site Reliability Engineer I who will be a subject matter expert, mastering specific infrastructure domains to drive technical excellence across the organization. In this capacity, you will act as the primary escalation point for resolving high-complexity problems while proactively defining backlog items to maintain and modernize your areas of responsibility.
As a Principal Site Reliability Engineer, you will:Work closely with our SRE teams, providing technical mentorship to help them optimally refine tasks and improve implementations. Furthermore, you will collaborate with the Principal Engineer 2 to effectively disseminate architectural best practices and engineering standards, ensuring the team remains aligned with the organization's strategic vision.
What You'Ll Bring
- 3-5 years of experience building and maintaining scalable production environments in a Senior SRE, Tech Lead, or Architect capacity.
- Expert-level knowledge in at least 50% of the following tools and domains, with a senior-level understanding of the rest:
- Observability/Monitoring/Logging: Prometheus, Grafana, Graylog, Zabbix, Instana
Brokers: RabbitMQ, Kafka
- Database Infra: Redis, Mongo, CockroachDB
- Networking & Traffic Management: Cloudflare, HAProxy, Varnish, Nginx
- Platform Infrastructure:Kubernetes (k8s), OpenShift, ESXi
GitOps: ArgoCD
- CI/CD: GitLab, RedHat Tower (AWX/Ansible)
- Cloud & IaC:Ansible, Terraform, Azure
- Strong scripting skills in languages such as Bash, PowerShell, Python, or Go.
- Solid programming skills in either Java or .NET.
- Demonstrated ability to utilize AI tools (e.g., AI code assistants, AIOps platforms) to increase productivity, quality, and reliability.
- Ability to resolve complex technical challenges optimally, taking into account time, capacity, and budget constraints.
- Proven ability to work effectively as part of a distributed, international team.
- An easy-going, flexible personality with a genuine eagerness to learn new technologies and work outside of your comfort zone.
- A "people-first" and continuous improvement mentality, always looking for ways to make things better for our teams and our customers.
- A strong understanding of Scrum/Agile methodologies and principles.
Recruitment Privacy Notice
Regarding the data you share with us, you may find and read our recruitment privacy notice here.
We are an equal opportunity employer committed to fostering a diverse and inclusive workplace. We welcome applications from individuals of all backgrounds, regardless of race, gender, religion, sexual orientation,or age.
Principal Site Reliability Engineer Ι
Office
Athens, Greece
Full Time
October 8, 2025