Role: AI/ML Engineer
Location: San Jose CA (5 days WFO)
Notice period: 2 weeks
Job Description
- Design and implement AI Agents to optimize cloud resource allocation auto-scaling and performance tuning.
- Develop predictive models for failure detection incident management and system health monitoring.
- Automate operational workflows using machine learning and intelligent scripting.
- Integrate AI-driven insights with existing cloud monitoring tools.
- Collaborate with DevOps and SRE teams to deploy monitor and improve ML models in production environments.
- Conduct anomaly detection for security cost optimization and performance analytics.
- Continuously evaluate emerging AI technologies and tools for operational improvements.
- Maintain documentation and best practices for AI/ML integration in cloud systems.
Our Minimum Requirements include:
- Bachelors or equivalent experience or masters degree in computer science Data Science or related technical field.
- Proven ability building and deploying ML models with at least 2 years focused on infrastructure or cloud operations.
- Solid knowledge of hybrid cloud technologies (AWS GCP OpenStack Kubernetes).
- Experience with Python Jupiter and ML libraries such as PyTorch TensorFlow or scikit-learn.
- Familiarity with cloud-native monitoring logging and automation tools (e.g. Terraform Ansible Prometheus Splunk AppDynamics).
- Comfortable working with streaming data APIs and telemetry systems.
- Strong communication and multi-functional collaboration skills.
- Experience with Agile and DevOps operating models including project tracking tools (e.g. Jira) Git (any Version Control systems) and CI/CD systems (e.g. GitLab GitHub Actions Jenkins).
- Proficient in general-purpose programming languages (Python GoLang Bash and/or C/C) and development platforms and technologies.
Preferred Qualifications
- Deep understanding of operating systems and experience with Cisco technologies (UCS Nexus Thousand Eyes)
- Established record of leading technical initiatives delivering results and a commitment to fostering a supportive work environment.
- Hard-working dedicated to providing quality support for your customers