company logo

Site Reliability Engineer II - CTJ - Poly

Microsoft.com

101k - 215k USD/year

Office

Reston, Virginia, United States

Full Time

The Resiliency Services team is seeking a Site Reliability Engineer II to help drive the reliability, scalability, and operational excellence of our Azure-based solutions. Our team owns and operates several critical services within the AGC, including Azure Automation (streamlining and automating cloud operations), Azure Backup (secure, scalable data protection), Azure Site Recovery (disaster recovery and business continuity), Azure Migrate (cloud migration planning and execution), and the Learn Documents (comprehensive technical documentation and training resources). We are a geographically distributed, collaborative group with in-person coverage at Reston, Elkridge, and Annapolis Junction, and we pride ourselves on fostering a fun, supportive, and high-performing team environment.

We are looking for an individual who is quality-focused, proactive, and passionate about reliability. The ideal candidate is someone who can identify issues and drive solutions, communicates clearly, and thrives as a team player. You’ll have the opportunity to work across a diverse set of Azure services, ensuring they meet the highest standards for resiliency and customer experience. If you enjoy solving problems, collaborating with talented colleagues, and making a real impact, you’ll find our team both rewarding and enjoyable to work with.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

  • Participate in the team's on-call rotation to ensure rapid response and resolution of service incidents, minimizing downtime and impact to customers.
  • Monitor, maintain, and improve the reliability and availability of Azure Automation, Azure Backup, Azure Site Recovery, Azure Migrate, and Learn Documents, proactively identifying and addressing potential issues before they affect users.
  • Implement and optimize automation solutions using Azure Automation to streamline operational tasks, reduce manual intervention, and enhance service efficiency.
  • Drive continuous improvement in backup, recovery, and migration processes by maintaining Azure Backup and Azure Site Recovery, ensuring robust disaster recovery and business continuity strategies.
  • Support and enhance cloud migration initiatives with Azure Migrate, helping teams plan, execute, and validate migrations to the Azure platform.
  • Contribute to the creation and maintenance of technical documentation and training resources (Learn Documents), ensuring clarity, accuracy, and accessibility for both internal teams and external customers.
  • Collaborate effectively with team members and stakeholders, communicate clearly about issues and solutions, and foster a positive, fun, and supportive team environment—always striving for quality and taking initiative to fix what needs fixing.
  • Embody our culture and values.

Qualifications

Required/Minimum Qualifications:

  • Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.

Other Requirements:

Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:  
 

  • The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph. Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. Failure to maintain or obtain the appropriate U.S. Government clearance and/or customer screening requirements may result in employment action up to and including termination.
  • Clearance Verification: This position requires successful verification of the stated security clearance to meet federal government customer requirements. You will be asked to provide clearance verification information prior to an offer of employment.
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.   
  • Citizenship & Citizenship Verification: This position requires verification of U.S. citizenship due to citizenship-based legal restrictions. Specifically, this position supports United States federal, state, and/or local United States government agency customer and is subject to certain citizenship-based restrictions where required or permitted by applicable law. To meet this legal requirement, citizenship will be verified via a valid passport, or other approved documents, or verified US government Clearance 

Preferred/Additional Qualifications:

  • Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 5+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience.
  • 2+ years technical experience working with large-scale cloud or distributed systems.

Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until September 26, 2025.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

#Silver

Site Reliability Engineer II - CTJ - Poly

Office

Reston, Virginia, United States

Full Time

101k - 215k USD/year

September 19, 2025

company logo

Microsoft

Microsoft