System Development Engineer - Incident Management at Amazon
System Development Engineer - Incident Management Details
Jan. 4, 2019, 5:37 a.m.
Systems, Quality, & Security Engineering
with service team Amazon Web Services
Amazon TechOps is at the heart of the high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by providing large scale event and incident management. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact, and much of our engineer time is spent on projects to improve the tooling and automation. We also provide manual incident management for AWS and other Amazon groups, directing the resolution of an issue with service teams, and diving deep into those events to drive improvements to the tooling. It's an exciting time to join our team as we are rapidly growing and expanding our offerings. As a System Development Engineer on the team you will build tooling to automate the detection and resolution of issues within AWS and Amazon infrastructure. You will also spend a portion of your time of your time directing the resolution of high visibility incidents by leading conference calls and virtual teams. Using data
• Bachelor's Degree in Computer Science or at least 4 years relevant experience in a large-scale technical environment • 3-5 years experience building software for internal or external use • 3-5 years of experience using and troubleshooting Linux or Unix based systems • 3-5 years experiencing troubleshooting and resolving technical issues in a distributed environment. • 2+ years experience driving collaborative projects from conception to delivery using Agile/Scrum methodology • Solid grasp of networking fundamentals • Effective organizational skills and the ability to maintain a consistently high