Microsoft Site Reliability Engineer II

New job, posted less than a week ago!

Job Details

Posted date: May 14, 2025

There have been 13 jobs posted with the title of Site Reliability Engineer II all time at Microsoft.

Category: Software Engineering

Location: Redmond, WA

Estimated salary: $153,550
Range: $98,300 - $208,800

Employment type: Full-Time

Travel amount: 25.0%

Work location type: Up to 50% work from home

Role: Individual Contributor


Description

Microsoft has been a leading company in computing for decades. We are a global service, relied on by governments, utilities, schools, and co-operatives to deliver the things they need to work, every day.

  

To make this work for our customers, we need continual effort to make that delivery reliable. To drive reliability, we are looking for individuals who already are, or is interested in becoming, a Site Reliability Engineer II (also known as SRE). 

SREs are people who take engineering-based approaches to solve operations problems: we like infrastructure, we like seeing how big complicated things work, and most importantly, we gain fulfillment from making it better. We have backgrounds in lots of things -- of course, Computer Science, System Administration, Networking, Mathematics, and Engineering generally, but you can also find individuals who've worked in Physics, Chemistry, Biology, Statistics, and even English. 

The OneDrive SharePoint (ODSP) team is seeking a Site Reliability Engineer II. SREs build, monitor, and maintain the systems and infrastructure that ensure our customers can quickly access their data and run workloads whenever and wherever they need to. We identify service problems and areas for improvement, and we follow up by fixing those problems. Our work is key to the success of many of the Microsoft services you'll have heard of, and a number you haven't. There are very few bits of Microsoft which aren't touched by SREs in some way or other.  SREs come in two kinds: SRE-SWE (people with a software engineering background), and SRE-SE (people with a systems engineering background). 

Since how software is written determines the behavior of a system generally, often in subtle or initially misunderstood ways, an SRE-SWE cares deeply about software quality and how software is constructed. Release processes, safe deployments, measuring performance, and similar concerns beyond pure functionality are important to SREs and are relevant to improving reliability. If this resonates with you, we'd like to connect!

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Develops technical expertise in the code, features, and operations of specific products as required to identify opportunities to improve product availability, reliability, efficiency, observability, and/or performance; actively participates in on-boarding, code/design reviews, and regular meetings with engineering teams that develop and/or manage those products.Develops, tests, and implements changes to optimize code and improve the observability, reliability and operability of components and features of one or more platforms, systems, or products operating at scale.Leverages technical expertise in large scale distributed systems and specific products, as well as objective insights drawn from analyses of production telemetry data to suggest changes or add-ons to product features or code to improve the availability, reliability, efficiency, observability, and performance of product components or features supported by their team.Engages with product engineering teams by participating code/design reviews, regular meetings, on-call rotations and incident responses throughout product development and operations cycles; leverages technical expertise on underlying systems/platforms and insights drawn from engagements with product engineering teams and telemetry analyses to propose potential improvements in code base and designs across components and features of one or more products.Responds to incidents during regular on-call rotations by identifying the level of impact, troubleshooting issues, and deploying appropriate fixes to resolve root cause(s); alerts product teams and owners to major customer impacting issues and escalates resolution of highly impactful issues affecting multiple components or features to other engineers or engineering teams as needed. Shares details related to incidents and their resolution through post-mortem reports and during regular review meetings.Independently uses existing tools and/or models to troubleshoot problems or flaws affecting the availability, reliability, performance, and/or efficiency of components and features; proposes solutions that will resolve and prevent recurring issues and brings them to the attention of their Site Reliability Engineering (SRE) and/or product engineering teams.Leverages technical expertise and telemetry analysis across a range of components and/or features to identify patterns and opportunities to implement configuration and data changes for one or more platforms, systems, or products in production using code, tooling, and automation.



Qualifications

Required Qualifications:

Master's Degree in Computer Science, Information Technology, or related field OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ years of technical experience in software engineering, network engineering, or systems administration OR 4+ years of technical experience in software engineering, network engineering, or systems administrationExperience in infrastructure, scale, performance, and/or the behavior of distributed systems.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: 

Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter. 

Preffered Qualifications:

Master's Degree in Computer Science, Information Technology, or related field AND 1+ years of technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years of technical experience in software engineering, network engineering, or systems administration OR 5+ years of technical experience in software engineering, network engineering, or systems administration

Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until May 18, 2025.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

#ODSPEng #MicrosoftSREJobs #SharePointOnline



Email/text job link for Site Reliability Engineer II at Microsoft

Provide your email or phone number to recieve a short message with the job link and details.

Check out other jobs at Microsoft.