
About this role
Company Description
KMS Technology is a strategic engineering company helping businesses turn bold ideas into high-impact solutions—faster. Founded in 2009 as a U.S.-based services company, we’ve grown into a global organization with locations in the US, Vietnam, Mexico and Poland. KMS is trusted globally for the quality of our engineering and consulting services. We bring deep expertise in product development and quality assurance, Data & AI-native engineering, and delivery excellence to every engagement. Our mission is to help customers build what’s next—accelerating innovation, crafting brilliant solutions, and creating real-world impact. At KMS, we believe sustainable growth is built on the success of our clients and employees, and in making a lasting contribution to our communities.
More about KMS Technology:
Website: https://kms-technology.com
Linkedin: https://www.linkedin.com/company/kms-technology
Job Description
About the Role
Build production cloud infrastructure for Fortune 500 clients in healthcare, finance, and manufacturing. Architect Kubernetes platforms running enterprise applications and AI workloads—analytics systems, GPU inference servers, and autonomous deployment pipelines.
The difference: You'll create infrastructure where AI agents deploy code, run automated reviews, and handle complex operations—delivering the 60-90% efficiency gains companies like Spotify already achieve.
Experience required: 5-10+ years DevOps/SRE
What You'll Do
Platform Operations (40%)
- Design and operate Azure Kubernetes clusters for production workloads
- Implement Infrastructure-as-Code with Terraform and Crossplane
- Deploy platforms using GitOps (ArgoCD/Flux)
- Build CI/CD pipelines with AI-powered code reviews and testing
- Manage multi-cluster environments with Azure Arc
Site Reliability Engineering (30%)
- Build observability stack: Prometheus, OpenTelemetry, Jaeger, Loki, Grafana
- Define SLIs, SLOs, error budgets (99.9% uptime target)
- Deploy automated incident response and root cause analysis
- Implement DevSecOps: SBOM generation, policy-as-code, container scanning
- Lead post-incident reviews and preventive measures
FinOps & Cost Engineering (15%)
- Deploy OpenCost/Kubecost for cost attribution
- Build cost dashboards with team showback/chargeback
- Optimize cloud spending (target: 20-30% reduction)
Platform Engineering (15%)
- Build internal developer platforms (Backstage)
- Create golden path templates and self-service tools
- Track DORA metrics and developer productivity
- Develop automation for infrastructure tasks
Qualifications
Must Have:
Kubernetes & Containers
5+ years production Kubernetes experience
Cluster design, RBAC, networking, troubleshooting
Performance tuning and optimization
Infrastructure & Automation
Terraform for infrastructure provisioning
GitOps workflows (ArgoCD or Flux)
CI/CD pipelines (GitHub Actions, Azure DevOps, Jenkins, GitLab CI)
Python OR Go for automation
Bash scripting
Cloud Platforms
Azure (AKS, VNets, Identity) - preferred
OR AWS/GCP experience (we'll train on Azure)
Observability
Prometheus and Grafana (required)
Log aggregation (Loki, ELK, Splunk, or similar)
Distributed tracing concepts
Alerting and on-call experience
Foundation
Strong Linux/Unix administration
Git and code review workflows
English proficiency
Technical documentation skills
Nice-to-Have
Helm, Kustomize, Crossplane
Container security and policy-as-code
Service mesh (Istio, Linkerd)
Chaos engineering
FinOps tools and practices
Certifications: Azure (AZ-104, AZ-305), Kubernetes (CKA, CKS), Terraform
AI Skills – We'll Train You
No AI experience required. We provide comprehensive training:
Claude Code and Cursor for development
AI agent integration in CI/CD
Multi-agent workflow orchestration
Additional Information
Perks You'll Enjoy
- Working in one of the Best Places to Work in Vietnam
- Building large-scale & global software products
- Working & growing with Passionate & Talented Team
- Diverse careers opportunities with Software Outsourcing, Software Product Development, IT Solutions & Consulting
- Attractive Salary and Benefits
- Performance appraisals every year and performance bonus
- Onsite opportunities: short-term and long-term assignments in North American (U.S, Canada), Europe, Asia.
- Flexible working time
- Various training on hot-trend technologies, best practices and soft skills
- Premium healthcare insurance for you and your loved ones
- Company trip, big annual year-end party every year, team building, etc.
- Fitness & sport activities: football, tennis, table-tennis, badminton, yoga, swimming…
- Joining community development activities: 1% Pledge, charity every quarter, blood donation, public seminars, career orientation talks,…
- Free in-house entertainment facilities (foosball, ping pong, gym…), coffee, and snack (instant noodles, cookies, candies…)
And much more, join us and let yourself explore other fantastic things!