Principal Forensic Engineer

Redmond, Washington, United States

Microsoft

Every company has a mission. What's ours? To empower every person and every organization to achieve more. We believe technology can and should be a force for good and that meaningful innovation contributes to a brighter world in the future and today.

View company page

Microsoft Cloud Infrastructure and Operations (CO+I) is the engine that powers Microsoft's cloud services. The group is responsible for designing, building, and operating Microsoft’s global datacenters; managing the programmatic delivery of our critical infrastructure design, equipment procurement, construction delivery, infrastructure innovation, demand planning and capacity utilization of our unified infrastructure; and responsible for all operations needed to run the physical infrastructure.

 

We focus on smart growth with an emphasis on automation, data-driven engineering, cost‐effectiveness, and environmental sustainability. We deliver the core infrastructure and foundational technologies for Microsoft's 200+ online businesses including Azure, Office 365, Bing, Xbox Live, Skype, and OneDrive.  Our portfolio is built and managed by a team of subject matter experts working 24x7x365 to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide. 

   

Within CO+I, the Forensic Engineering Team is responsible for performing Root Cause Analysis (RCA) on systemic issues and investigating when critical components fail.  Within Forensic Engineering, we are seeking a motivated and experienced Principal Forensic Engineer to join our team. If you are a strategic thinker with a passion for driving business success, we encourage you to apply for this exciting opportunity. 

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. 

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day and to empower billions!    

Responsibilities

  • Lead and track forensic analysis of events which have occurred within the data center infrastructure.
  • Serve as a functional specialist by being able to speak to all aspects of data center functions and failure modes in critical environments.
  • Develop methodologies to validate data center performance, system control parameters and operational efficiency against design intent and determine quantifiable deviations.
  • Perform troubleshooting and root cause analysis associated with equipment failure.
  • Review equipment and system performance data to identify issues through trend analysis.
  • Assists in the troubleshooting of issues in the field, remotely or in person.
  • Review compliance with existing corrective and preventative maintenance program to enhance operational readiness.
  • Analyze full time employee and vendor staffing to include training, procedures, and site requirements as part of root cause analysis.
  • Foster and promote our proactive implementation of lesson learned from analysis across multiple design, construction, and operational organizations.
  • Develop solutions for defects identified through trends and data analysis.
  • Drive global standardization and consistency of processes, procedures, and reports with Operations teams for Quarterly Business Reviews.
  • Work with Site Operations Engineers to establish visual standards, process improvement and error proofing systems to drive efficiency within the business.
  • Identify and monitor the need for use of new tools to improve the quality of data and analytics.
  • Embody our culture and values.

Qualifications

Required/Minimum Qualifications:

  • Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 8+ years of mission critical experience in electrical, mechanical, or controls engineering
    • OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 7+ years of mission critical experience in electrical, mechanical, or controls engineering
    • OR Doctorate Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years of mission critical experience in electrical, mechanical, or controls engineering.
  • Ability to travel up to 50%

Other Requirements:

 

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: 

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications:

  • Proficiency in Datacenter or Critical Environment’s Mechanical and Electrical systems
  • Experience leading and managing a business-critical function
  • Experience leading construction, design, and process reviews assessing availability and reliability threat vectors, while partnering with various teams to design out or eliminate the potential issues
  • Passion to drive resolution and  understand root causes and incident triggers
  • Understanding of datacenter and topologies or equivalent Mission Critical facility background
  • Analytics capabilities to extract and summarize large, complex data from multiple databases and systems
  • Ability to influence cross functional team and leadership team in driving process improvement, efficiencies, and best practices

 

Reliability Engineering IC5 - The typical base pay range for this role across the U.S. is USD $133,600 - $256,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $173,200 - $282,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

 

Microsoft will accept applications for this  role until April 30, 2024. 

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

 

#COICareers

Apply now Apply later
  • Share this job via
  • or

Tags: Analytics Automation Azure Cloud Compliance Travel

Perks/benefits: Career development Medical leave Team events

Region: North America
Country: United States
Job stats:  6  1  0

More jobs like this

Explore more InfoSec / Cybersecurity career opportunities

Find even more open roles in Ethical Hacking, Pen Testing, Security Engineering, Threat Research, Vulnerability Management, Cryptography, Digital Forensics and Cyber Security in general - ordered by popularity of job title or skills, toolset and products used - below.