Software Development Manager, AWS Incident Tooling & Response

Amazon

Not Interested
Bookmark
Report This Job

profile Job Location:

Dublin - Ireland

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Department:

Software Development

Job Summary

The Team
AWS Resilience owns service that prevent and respond to availability and security issues for all AWS other words were the people who keep the cloud running. We work on the most challenging problems with constant new services and possible failure modes to prevent and were looking for talented people who want to help.

AWS Incident Tooling is at the heart of the high availability of Amazon Web Services. We make customer impacting events shorter and less frequent by detecting early large-scale events and providing the tooling to enable fast mitigation. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact. Our engineer time is spent on projects to improve the tooling and automation. We also provide our solutions for other AWS groups to manage their own events. Its an exciting time to join our team as we are growing and expanding our offerings.

The Role
As a Software Development Manager on the team you will manage automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure. You will work closely with the team managing the incident response and with leadership to gather new requirements. Based on learning from past incidents you will drive further improvements into our automation tooling and processes so that the next event is shorter or avoided entirely. You will coordinate across project teams to expand use of our tooling to additional areas across Amazon. If youre looking for a team with great growth potential and an opportunity to make a huge impact this is the team to join.

AIS
AWS Infrastructure Services (AIS) owns the design planning delivery and operation of all AWS global other words were the people who keep the cloud running. We support all AWS data centers and all of the servers storage networking power and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems with thousands of variables impacting the supply chain and were looking for talented people who want to help.

Youll join a diverse team of software hardware and network engineers supply chain specialists security experts operations managers and other vital roles. Youll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And youll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

Key job responsibilities
Define and Deliver Business Priorities
- Serve as a key contributor and owner of the direction of the AWS Incident Management team
- Define plan track and deliver on strategic goals for the team while ensuring the team remains unblocked and focused

Cross-Site Cross-Team Coordination
- Coordinate with counterparts and sister teams to ensure clear communication channels exist between AWS Incident tooling and Response teams
- Work closely with alarming systems to create and maintain a proper end-to-end experience from detecting and alarming to mitigating incidents

Performance Management/Team Health
- Own all facets of performance and career management for the team
- Ensure the operational load of the team remains manageable and as minimal as possible


A day in the life
Your day will be a blend of strategic planning collaborative problem-solving and innovative tooling development. Youll dive into complex infrastructure challenges design automated solutions and lead a team dedicated to minimizing service disruptions. Expect to engage in cross-functional discussions analyze incident patterns and develop cutting-edge prevention strategies that transform potential risks into opportunities for improvement.

About the team
We are a passionate collective of technology experts committed to maintaining the reliability and security of AWS services. Our team represents diverse backgrounds and skills united by a shared mission to deliver exceptional cloud infrastructure. We believe in collaborative innovation where every team members insight contributes to solving global technological challenges.

ABOUT AWS:
Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description we encourage candidates to apply. If your career is just starting hasnt followed a traditional path or includes alternative experiences dont let it stop you from applying.

Why AWS
Amazon Web Services (AWS) is the worlds most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating thats why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home theres nothing we cant achieve.

Inclusive Team Culture
Here at AWS its in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences inspire us to never stop embracing our uniqueness.

Mentorship and Career Growth
Were continuously raising our performance bar as we strive to become Earths Best Employer. Thats why youll find endless knowledge-sharing mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle including coding standards code reviews source control management build processes testing certification and livesite operations
- Experience in engineering team management
- Experience in engineering
- Experience in leading the definition and development of multi tier web services
- Experience partnering with product and program management teams

- Experience in communicating with users other technical teams and senior leadership to collect requirements describe software product features technical designs and product strategy
- Experience in recruiting hiring mentoring/coaching and managing teams of Software Engineers to improve their skills and make them more effective product software engineers
- Experience managing a team of high calibre Software Engineers developing complex world class scalable software systems that have been successfully delivered to customers

Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover invent simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice ( to know more about how we collect use and transfer the personal data of our candidates.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status disability or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit
for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.


Required Experience:

Manager

The TeamAWS Resilience owns service that prevent and respond to availability and security issues for all AWS other words were the people who keep the cloud running. We work on the most challenging problems with constant new services and possible failure modes to prevent and were looking for talent...
View more view more

Key Skills

  • Feed
  • Jsf
  • Advocacy
  • Java
  • Automobile

About Company

Company Logo

Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa Devices, sporting goods, toys, automotive ... View more

View Profile View Profile