Apple Customer systems Operations team is looking for a highly skilled and motivated OPS Engineer (Operations Engineer) to join our operations this role you will be responsible for maintaining the reliability availability and performance of business-critical globally distributed systems. You will design and develop automation solutions to streamline system sustenance monitoring and operational workflows while collaborating closely with support and engineering teams.
- Minimum 4 years working experience
- Demonstrated ability to lead SEV1/SEV2 incident bridges conduct blameless postmortems and drive problem management initiatives; incident management experience;
- Strong knowledge of production support practices for handling large-scale mission-critical web and iOS applications in a 24x7 onshore/offshore model;
- Experience in troubleshooting analyzing logs building metrics and operational dashboards;
- Fundamental understanding of distributed systems (e.g. microservices messaging brokers) and Linux operating system internals.
- Experience leading global operations teams in large-scale enterprise environments with collaboration and leadership skills;
- Expertise in observability and monitoring using tools such as Hubble ExtraHop Splunk and similar platforms;
- Foundations in networking (HTTP DNS TCP/IP ICMP OSI Model Subnetting Load Balancing);
- Hands-on engineering background with Java/JEE REST APIs Swift/Objective-C databases (schema design data access) and modern frontend technologies (React JavaScript).
- Proficiency in supporting scalable event-driven architectures (Kafka or equivalent) large distributed systems and high-availability platforms.
- - Strong automation skills with Python/Linux including CI/CD pipelines Infrastructure as Code Kubernetes/EKS (deployment strategies scaling troubleshooting) and self-healing systems.
- - Experience applying AI/ML for operational automation (anomaly detection predictive alerting automated incident response).
- - Familiarity with ITSM frameworks and enterprise support practices.
Apple Customer systems Operations team is looking for a highly skilled and motivated OPS Engineer (Operations Engineer) to join our operations this role you will be responsible for maintaining the reliability availability and performance of business-critical globally distributed systems. You will d...
Apple Customer systems Operations team is looking for a highly skilled and motivated OPS Engineer (Operations Engineer) to join our operations this role you will be responsible for maintaining the reliability availability and performance of business-critical globally distributed systems. You will design and develop automation solutions to streamline system sustenance monitoring and operational workflows while collaborating closely with support and engineering teams.
- Minimum 4 years working experience
- Demonstrated ability to lead SEV1/SEV2 incident bridges conduct blameless postmortems and drive problem management initiatives; incident management experience;
- Strong knowledge of production support practices for handling large-scale mission-critical web and iOS applications in a 24x7 onshore/offshore model;
- Experience in troubleshooting analyzing logs building metrics and operational dashboards;
- Fundamental understanding of distributed systems (e.g. microservices messaging brokers) and Linux operating system internals.
- Experience leading global operations teams in large-scale enterprise environments with collaboration and leadership skills;
- Expertise in observability and monitoring using tools such as Hubble ExtraHop Splunk and similar platforms;
- Foundations in networking (HTTP DNS TCP/IP ICMP OSI Model Subnetting Load Balancing);
- Hands-on engineering background with Java/JEE REST APIs Swift/Objective-C databases (schema design data access) and modern frontend technologies (React JavaScript).
- Proficiency in supporting scalable event-driven architectures (Kafka or equivalent) large distributed systems and high-availability platforms.
- - Strong automation skills with Python/Linux including CI/CD pipelines Infrastructure as Code Kubernetes/EKS (deployment strategies scaling troubleshooting) and self-healing systems.
- - Experience applying AI/ML for operational automation (anomaly detection predictive alerting automated incident response).
- - Familiarity with ITSM frameworks and enterprise support practices.
View more
View less