AIOps Architect to lead the design and implementation of an AIbased observability and correlation engine using Selector AI. Architect solutions that enhance system reliability automate incident management and enable proactive IT operations through advanced machine learning (ML) and event correlation.
Key Responsibilities
- Design & Implementation: Architect and deploy AIdriven observability platforms using Selector AI to unify metrics logs traces and events across hybrid cloud environments.
- AI/ML Integration: Develop correlation engines to identify patterns reduce noise and automate root cause analysis (RCA) using ML models.
- Tooling & Automation: Integrate Selector AI with existing monitoring tools (e.g. Prometheus Grafana ELK New Relic Dynatrace) and orchestrate automated remediation workflows.
- Model Development: Build and train ML models for anomaly detection predictive alerting and incident prioritization.
- Collaboration: Partner with DevOps SRE and Selector Data science teams to align AIOps strategies with business goals.
- Performance Optimization: Continuously refining correlation rules reduce false positives and improve system accuracy.
- Innovation: Stay ahead of AIOps trends (e.g. causal inference topologyaware analytics) and evaluate new tools/techniques.
- Documentation: Create architecture blueprints runbooks and best practices for AIOps adoption.
Qualifications
- Education: Bachelors/masters in computer science Data Science or related field.
- Experience: 5 years in Observability Tools AIOps and cloud operations with 2 years focused on AI/MLdriven observability.
- Technical Skills:
- Proficiency in Selector AI or similar platforms (e.g. Moogsoft BigPanda).
- Expertise in AI/ML frameworks (TensorFlow PyTorch) and observability tools (ELK Stack OpenTelemetry).
- Handson experience with cloud platforms (AWS Azure GCP) and containerization (Kubernetes Docker).
- Strong programming skills in Python SQL.
- Soft Skills: Problemsolving crossfunctional collaboration and excellent communication.
- Certifications (Bonus): AWS/Azure Architect Kubernetes or ML certifications.
Preferred Qualifications
- Experience implementing Selector AI for largescale event correlation.
- Familiarity with big data technologies (Kafka Spark) and CI/CD pipelines.
- Knowledge of ITIL processes and Agile methodologies.
Additional Information :
All your information will be kept confidential according to EEO guidelines.
Remote Work :
No
Employment Type :
Fulltime