Site Reliability Engineer II
Microsoft.com
101k - 215k USD/year
Office
Redmond, Washington, United States
Full Time
As a Site Reliability Engineer II, you will help design, build, and run distributed services at global scale. You’ll use your software engineering skills to eliminate toil, improve system resiliency, and deliver meaningful telemetry. This opportunity will allow you to accelerate your career growth, learn how to operate complex cloud services at scale, and develop deep expertise in modern reliability engineering practices.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
- Participate in design and code reviews to ensure services are reliable, scalable, and secure.
- Operate services through on-call rotations, incident response, and post-mortems.
- Partner with product teams to drive improvements in resiliency, cost efficiency, and performance.
- Develop automation to reduce manual operations and improve recovery time.
- Build and maintain observability (metrics, logs, traces) that drives data-driven engineering decisions.
- Contribute to a blameless culture of learning through continuous improvement and knowledge sharing.
Qualifications
Required Qualifications:
- Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
- 2+ years coding skills in languages such as C#, Python and PowerShell.
- OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
- OR equivalent experience.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- 3+ years coding skills in languages such as C#, Python and PowerShell.
- Experience with monitoring, logging, and distributed systems troubleshooting.
- Knowledge or hands-on experience in AI/ML systems.
- 2+ years technical experience working with large-scale cloud or distributed systems.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Site Reliability Engineer II
Office
Redmond, Washington, United States
Full Time
101k - 215k USD/year
September 18, 2025