AWS Infrastructure Services owns the design planning delivery and operation of all AWS global other words were the people who keep the cloud running. We support all AWS data centers and all of the servers storage networking power and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems with thousands of variables impacting the supply chain and were looking for talented people who want to help.
Youll join a diverse team of software hardware and network engineers supply chain specialists security experts operations managers and other vital roles. Youll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And youll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
- Our organization -
The Infrastructure Operations (Data Center) Team is the backbone of AWS supporting the rapidly growing AWS business and customers 24/7. We are committed to maintain the physical infrastructure of AWS ensuring the standards for operational performance in the areas of safety security availability productivity capacity efficiency and cost.
As a member of the Infrastructure Operations (Data Center) Team you will have the chance to work on the most advanced technologies in a DYNAMIC environment with expanding opportunities.
If you enjoy working in a strong and close-knit diverse team Infrastructure Operations (Data Center) Team is the place to be!
- Our team -
The Amazon Data Center Engineering Operations (DECO) team is seeking a strong subject matter expert (SME) who can deploy operate and maintain the facilities (electrical/mechanical systems control/fire-fighting systems etc.) of our large-scale high-density data centers. We support our internal and external customers 24x7 all year so work is by shift on-call or a combination of the two.
There are on-call duties and this role will cover shift in case as needed.
Key job responsibilities
MAIN RESPONSIBILITIES
Own as the site SME and POC plan review evaluate operate maintain improve and manage mission-critical facilities including vendor management day to day hands-on work and supervision relating to decrease/increase of rack capacity onsite on-going or future construction works planned maintenance works and urgent or emergency changes along with the AWS Infrastructure Priorities.
Participate in and be responsible for future Capacity Availability and other projects of assigned sites review evaluate and give feedback on designs from Operations viewpoint to mitigate Safety Security and Availability risks beforehand.
Prepare and implement countermeasures for natural disasters emergency response to high priority/critical incidents including creating EOPs training staff and preparing appropriate tools. Respond to high severity events and large scale events as the owner of the operations. Understand SOO and EOPs troubleshoot mitigate and resolve issues write and update senior leaders through regular and timely reports conclude issue with complete root cause analysis.
Review evaluate and proactively identify SPOF risks or vulnerability in data center (electrical mechanical control) designs test and commissioning program construction and operations processes and consider plan coordinate propose negotiate persuade grant approval from stakeholders for the issue remediation and/or mitigation plan and deliver results
Build sustainable and scalable mechanisms to collect review and report regular metrics and KPI of the team plan propose and drive kaizen based on the metrics and KPI results
Understand and develop team structure create and document headcount requirements help drive interview and hire bar raising candidates build strong team through delegation development training directing coaching empowering motivating promoting and managing 6 to 10 engineers including regular performance review and discussion
Plan review update and manage budget and procurement.
About the team
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description we encourage candidates to apply. If your career is just starting hasnt followed a traditional path or includes alternative experiences dont let it stop you from applying.
Why AWS
Amazon Web Services (AWS) is the worlds most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating thats why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger more collaborative teams. Our continual innovation is fueled by the bold ideas fresh perspectives and passionate voices our teams bring to everything we do.
Mentorship & Career Growth
Were continuously raising our performance bar as we strive to become Earths Best Employer. Thats why youll find endless knowledge-sharing mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home theres nothing we cant achieve in the cloud.
Bachelors degree or equivalent in engineering computer science or a related field.
7 years relevant experience operating and managing mission critical facilities
5 years people management experience including hiring and developing the best team promotion experience
English proficiency (verbal and written) at a level that enables to communicate with global team - TOEIC 800 or equivalent and Japanese proficiency (verbal and written) JLPT N2 or equivalent
In depth knowledge of Data Center Facilities such as generators chillers cooling towers air handling units UPS electrical sub distribution systems fire detection and suppression systems cable reticulation systems
Experience with operations and maintenance of data centers.
CDCP CDCS CDCE CDFOM CDFOS AHRAE ATD or similar certifications
Certifications necessary for managing data centers (Type II Chief Electrical Engineer Qualified Energy Manager Type 4 Hazardous Materials Engineer Refrigeration Machinery Manager 2nd Class Qualified Certified Electrician)
Project Management certification (PMP Prince2 ITILv2 BICSI).
Strong ability to understand electrical systems (supply system of power substations transformers switchgears VFI-class UPS DRUPS PDU ATS STS SLA or VRLA battery and related systems fuel systems related to diesel/gas turbine generators surge control circuits active harmonic filters battery monitoring systems branch circuit monitoring systems SCADA systems)
Strong ability to understand mechanical systems (CRAC/CRAH AHU chillers cooling towers storage tanks heat exchangers plumbing systems pumps valves duct systems fans dampers fire detection and extinguishing systems drainage systems building monitoring systems automatic control systems)
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit
for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.