Responsibilities : Network Design Build & Lifecycle Management
Participate in the design and architecture of network solutions contributing to development of standalone features and enhancements
Lead and support network build upgrade and migration projects as part of the IES Cloud Services transformation to OCI infrastructure
Implement maintain and support cloud-based Network SaaS components in alignment with SLAs/OLAs
Collaborate with program and project managers to define milestones deliverables and execution plans
Create and maintain runbooks methods of procedure (MOPs) and documentation for network changes and operations
Operations & Reliability
Ensure site reliability performance and security maintaining Site UP as the highest operational priority
Actively monitor the production environment with the Global Operations Center to proactively detect and resolve issues
Provide break-fix support during network events act as an escalation point and lead root cause analysis and postmortem documentation
Participate in on-call rotations including weekend or global coverage as required
Develop and capture network service health metrics to monitor and report on service performance
Automation & Tool Development
Develop and maintain automation scripts and tools (Python or other object-oriented languages) to streamline network operations
Automate manual processes using APIs and automation frameworks to improve operational efficiency
Coordinate with network automation services teams to design and integrate support tooling
Serve as a Subject Matter Expert (SME) on network automation and software development initiatives
Collaboration & Continuous Improvement
Work closely with network vendor technical teams and internal QA to drive bug resolution and firmware/software validation
Partner with cross-functional teams to drive innovation and process improvement
Lead and participate in automation and innovation workshops to enhance organizational capabilities
Mentor and guide junior engineers promoting skill development in automation troubleshooting and network design
Required Qualifications
Bachelors degree in a technology-related field or equivalent hands-on experience in large-scale network environments
7 years of professional experience supporting carrier-grade IP-based ISP web-scale on-premises datacenter and cloud-provider network infrastructures
Proven record of progressive responsibility and technical growth in network engineering roles
Advanced-level expertise with multiple network operating systems including Cisco IOS NX-OS and TMOS
Strong operational knowledge of internet routing protocols and core networking concepts including TCP/IP BGP iBGP EGP MPLS IS-IS OSPF Anycast RHI and route reflection
Proficiency in network automation and DevOps practices with experience using tools and languages such as Python Ansible Chef Docker Terraform Perl JavaScript JSON REST iControl Bash YAML XML and iControlRest
Advanced knowledge of firewall technologies such as Cisco ASA and Palo Alto
Deep understanding of Layer 47 protocols including TCP UDP AH ESP SMB RCP TLS SSL HTTP/HTTPS DNS SNMP SMTP and SSH
Experience supporting cloud environments across IaaS PaaS SaaS and LBaaS offerings
Advanced experience in capacity planning traffic engineering and performance optimization for large-scale networks
Preferred Skills and Experience
5 years of experience in network peering design customer provisioning migration and decommissioning
12 years of experience designing and managing cloud-based network services
Working knowledge of databases such as Oracle Database SQL Server MySQL or PL/SQL
Hands-on experience with traffic management and load balancing technologies
Experience supporting DNS infrastructure and services
Proficiency working in Linux/Unix environments for network operations and troubleshooting
3 years of experience managing Incident and Problem Management workflows including automation defect tracking and analytics
Practical experience using Apex Oracle BI or Grafana for analytics metrics and reporting
Experience with network change management and release management processes
Proven ability to define and measure Objectives and Key Results (OKRs) Key Performance Indicators (KPIs) Operational Level Agreements (OLAs) and Service Level Agreements (SLAs) to drive continuous improvement