Network Development Engineer, Office Network Reliability Engineering
Job Summary
Key job responsibilities
- Provide Tier 3 escalation support on a rotating on-call schedule for your regional hub diagnosing and resolving complex office network incidents including multi-site outages routing protocol failures wireless infrastructure degradation and circuit performance problems while maintaining clear communication with operations teams
- Build capability and reduce escalations by conducting structured learning sessions after high-severity incidents identifying gaps in training permissions tooling or technical barriers and developing automation and self-service tools that enable operations teams to independently handle incidents
- Deliver knowledge transfer and training to operations engineers across your regional hub covering complex failure patterns diagnostic techniques and resolution approaches based on real escalation data and monthly operational reviews
- Execute proactive reliability engineering by conducting Network Availability Risk assessments driving Operating System Compliance programs implementing Configuration Compliance initiatives and participating in Network Infrastructure Validation reviews to identify and remediate technical debt vulnerabilities and architectural risks before they cause incidents
- Contribute to platform and tooling development by developing and integrating alarming systems automation scripts and monitoring improvements that enhance observability and operational efficiency across the office network infrastructure
A day in the life
As a Network Development Engineer on the Office Network Reliability Engineering team youll operate at the intersection of immediate problem-solving and long-term system improvement. Your day might begin by reviewing overnight escalations during handoff identifying a pattern in wireless controller failures that points to a configuration gap. Youll document the root cause and draft an automated remediation script that enables our Operations Management Center team to self-heal this failure type going forward.
Later you might receive an escalation about a multi-site network issue affecting three offices in your region with Amazonians unable to access internal systems. Youll take ownership of the escalation engage with carriers isolate the fault to a circuit configuration issue and restore service. Youll then document the resolution and schedule a lessons learned session to identify why our operations team didnt have the tooling or permissions to address this independently.
In the afternoon you might join a Network Infrastructure Validation review for a new campus design making recommendations on alerting coverage and pre-built runbooks before the design moves to production. Youll close your shift by updating documentation handing off to the next regional team and reviewing action items from recent lessons learned sessions. No two days are the sameyoull work in an environment where Amazons scale means developing durable scalable solutions that have direct and visible impact on hundreds of thousands of people.
About the team
We are a globally distributed team of network engineers operating on a 24/7/365 follow-the-sun model across three regional hubs: EMEA APAC and AMER. Our mission is to make the office network invisible to the 540000 Amazonians who depend on it every day. We partner closely with the Operations Management Center Office Infrastructure Excellence AWS Enterprise Networking and onsite IT support teams to ensure highly available reliable and performant networks across all corporate offices.
Our vision centers on building systems and processes that scale Amazons ability to prevent and resolve network incidents. Were investing in automation platforms monitoring improvements and lifecycle automation to reduce the burden on our operations teams and enable them to handle increasingly complex scenarios independently. When you join us youll be part of a team that values engineering excellence intellectual curiosity and partnershipwhere youll have significant autonomy to develop innovative solutions that go beyond standard industry patterns.
- Associates degree or above
- Network automation experience (Python)
- Major internet routing protocols experience
- Experience with enterprise routing protocols including BGP OSPF MPLS and their operational behavior in large corporate or cloud provider network environments
- Experience operating and troubleshooting major network platforms and operating systems including Cisco IOS IOS-XE NX-OS and/or Aruba AOS
- Experience working independently and as part of large distributed engineering teams across time zones
- Industry experience in large-scale network environments including cloud provider ISP corporate enterprise or large carrier networks
- Demonstrated experience in 24/7 on-call operations for high severity incident response
- Experience with Cisco ISE Aruba ClearPass or equivalent Network Access Control (NAC) platforms
- Familiarity with IT Service Management platforms specifically ServiceNow including incident management workflows TSG development and CMDB
- Experience building automation tooling self-service platforms or operational runbooks for use by operations teams with varying technical backgrounds
- Track record of conducting post-incident reviews root cause analysis and lessons learned sessions with a focus on permanent defect elimination
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover invent simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice ( to know more about how we collect use and transfer the personal data of our candidates.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status disability or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.
Required Experience:
IC
About Company
Free shipping on millions of items. Get the best of Shopping and Entertainment with Prime. Enjoy low prices and great deals on the largest selection of everyday essentials and other products, including fashion, home, beauty, electronics, Alexa Devices, sporting goods, toys, automotive ... View more