
Job Information
Microsoft Corporation Site Reliability Engineer II in Redmond, Washington
Microsoft’s Cloud Operations & Innovation (CO&I) group is looking for a Site Reliability Engineer II to support the Commissioning (Cx) Automation and Global Cx teams to deploy, monitor, and troubleshoot a distributed test platform. The platform is globally deployed and consists of client and cloud-based applications, custom hardware, wired / wireless networks, and sensor networks that automate the measurement and validation of hardware and electrical components and interconnected systems within large datacenters. Our infrastructure supports more than 1 billion customers and 20 million businesses in over 90 countries worldwide.
In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond
Responsibilities
Configure, monitor, and support the test platform used by the Global Commissioning Team
Establish and maintain the Cx Automation lab as the environment for training and testing new applications
Perform technical evaluation of new devices and test instruments
Lead projects in the lab to add or update test automation or device simulation capabilities
Establish and oversee the Incident Management processes for the team
Develop an understanding of features and operation of all software products and test equipment
Participate in on-call rotations and alert product teams to major customer impacting issues
Analyze telemetry data to identify opportunities to improve the reliability and performance of the platform
Leverage and contribute to troubleshooting tools for commons problems
Evaluate and test new applications and test equipment prior to global deployments
Develop reporting for quality of service, and usage of the application / test instruments
Troubleshoot and repairing test devices or network equipment that is returned from field
Develop code or scripts that reduce the setup and overall testing time
Embody our Culture (https://www.microsoft.com/en-us/about/corporate-values) and Values (https://careers.microsoft.com/us/en/culture)
Qualifications
Required/Minimum Qualifications
7+ years relevant technical engineering experience
OR Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 3+ years technical engineering experience
OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 2+ years technical engineering experience.
Other Requirements:
- Ability to meet Microsoft, customer and/or government security screening requirements arerequired for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Additional or Preferred Qualifications
Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years technical engineering experience
OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 3+ years technical engineering experience
OR Doctorate Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field
Experience working on large scale distributed test systems or high-speed data acquisition systems.
Experience setting up and troubleshooting wired and wireless networks
Demonstrated proficiency in deploying and monitoring Azure based services
Experience writing code to automate day-to-day tasks with proficiency in C#, PowerShell, Linux, or Python
Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until September 3, 2024
#COICareers
#EPCcareers
#SiteReliability
Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .
Microsoft Corporation
-
- Microsoft Corporation Jobs