Senior Engineer, Production Engineering & Incident Response
Plantation, FL
Applications have closed
Magic Leap
Explore Magic Leap AR for business. Improve your organization's training, 3D visualization, collaboration, and remote assistance workflows.Senior Engineer, Production Engineering & Incident Response
Magic Leap is looking for a senior engineer to focus on live site operations and incident response management.
Job Description
In this role, you will be responsible for day-to-day operations of our production live site systems, coordinate response to an outage and build incident management engineering systems based on industry standards and ITSM principals.
The ideal candidate is very knowledgeable with ITSM and is experienced in IT Incident Management engineering, processes improvement with a proven track record of resolving critical impacting incidents affecting microservice architect-based engineering services.
Responsibilities
- Oversight of 24x7 Major Incident Response
- Continually improve the engineering, efficiency and effectiveness of the Incident Response program
- Develop, measure, and report process performance and functional metrics in order to identify opportunities, measure success, or validate expected outcomes
- Tightly integrate incident management tools & processes with monitoring & observability platforms, production engineering dashboard and other ITSM tools.
- Define SLO & SLA metrics with engineering service owners & work with monitoring team to
- Bring continuous improvement to support and operational practices.
- Handle escalations and communicate clearly and effectively to all stakeholders including senior company leaders
Qualifications
- 10+ years of incident management in a high paced technology company
- Track record of managing complex incident management
- 8+ years of experience in managing production system of build & release tools, large scale public cloud based micro service with 100K+ concurrent users
- Prior experience of working in production engineering w/ regional NOC & SOC
- Prior experience with instrumenting mission critical services on a globally distributed level, using cloud hosting providers like AWS, GCP and more
- Prior experience integrating event management systems such as Pager Duty and other production engineering system
- Prior experience with Cloud Watch, StackDriver, Prometheus, Data Dog, Sumo Logic
Education
- BA/BS in Computer Science or related field and equivalent experience
Additional Information
All your information will be kept confidential according to Equal Employment Opportunities guidelines.
Tags: AWS Cloud Computer Science GCP Incident response Monitoring Prometheus
More jobs like this
Explore more InfoSec / Cybersecurity career opportunities
Find even more open roles in Ethical Hacking, Pen Testing, Security Engineering, Threat Research, Vulnerability Management, Cryptography, Digital Forensics and Cyber Security in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Information Security Specialist jobs
- Open Ethical hacker / Pentester H/F jobs
- Open Information Systems Security Officer (ISSO) jobs
- Open Senior Cyber Security Engineer jobs
- Open Principal Security Engineer jobs
- Open Cyber Security Architect jobs
- Open Manager Pentest H/F jobs
- Open Cyber Security Specialist jobs
- Open Product Security Engineer jobs
- Open Staff Security Engineer jobs
- Open Chief Information Security Officer jobs
- Open Cybersecurity Analyst jobs
- Open Senior Information Security Analyst jobs
- Open Consultant infrastructure sécurité H/F jobs
- Open IT Security Analyst jobs
- Open Cybersecurity Consultant jobs
- Open Senior Penetration Tester jobs
- Open Consultant SOC / CERT H/F jobs
- Open Security Specialist jobs
- Open Senior Information Security Engineer jobs
- Open Security Researcher jobs
- Open IT Security Engineer jobs
- Open Cybersecurity Specialist jobs
- Open Senior Security Architect jobs
- Open Sr. Security Engineer jobs
- Open Windows-related jobs
- Open CISM-related jobs
- Open Network security-related jobs
- Open Pentesting-related jobs
- Open ISO 27001-related jobs
- Open Application security-related jobs
- Open Agile-related jobs
- Open GCP-related jobs
- Open Vulnerability management-related jobs
- Open CISA-related jobs
- Open IAM-related jobs
- Open Analytics-related jobs
- Open Threat intelligence-related jobs
- Open SaaS-related jobs
- Open Security assessment-related jobs
- Open APIs-related jobs
- Open Java-related jobs
- Open Malware-related jobs
- Open Forensics-related jobs
- Open DevOps-related jobs
- Open Security Clearance-related jobs
- Open IDS-related jobs
- Open CEH-related jobs
- Open Kubernetes-related jobs
- Open EDR-related jobs