Application Developer-AIMI
Job Summary
Primary Skills & Experience
AI / Machine Learning & Data Engineering (Primary)
Strong proficiency in Python for AI/ML and data engineering
Experience designing and deploying AI/ML applications in production
Hands-on experience with LLMs and APIs (OCI Generative AI OpenAI or similar)
Experience with prompt engineering evaluation frameworks and RAG pipelines
Understanding of anomaly detection pattern recognition and time-series analysis
Experience with vector databases / similarity search systems
Observability Backend & Distributed Systems (Core)
Strong understanding of observability principles (metrics logs traces events)
Experience with distributed systems debugging and reliability engineering
Hands-on experience with OpenTelemetry and monitoring tools (Prometheus Grafana OCI Monitoring)
Strong backend development experience with Python APIs and microservices
Familiarity with event-driven architectures and streaming platforms (Kafka OCI Streaming)
Understanding of scalable fault-tolerant system design
Experience with monitoring alerting dashboards and search platforms (Elasticsearch/OpenSearch)
Qualifications
Bachelors or Masters degree in computer science or related field
Experience with AI-powered observability or AIOps systems preferred
Knowledge of incident management root cause analysis and SLO/SLA frameworks
Experience with multi-tenant large-scale distributed systems
Strong communication and collaboration skills in an agile environment.
AI / Machine Learning & Data Engineering (Primary)
Strong proficiency in Python for AI/ML and data engineering
Experience designing and deploying AI/ML applications in production
Hands-on experience with LLMs and APIs (OCI Generative AI OpenAI or similar)
Experience with prompt engineering evaluation frameworks and RAG pipelines
Understanding of anomaly detection pattern recognition and time-series analysis
Experience with vector databases / similarity search systems
Observability Backend & Distributed Systems (Core)
Strong understanding of observability principles (metrics logs traces events)
Experience with distributed systems debugging and reliability engineering
Hands-on experience with OpenTelemetry and monitoring tools (Prometheus Grafana OCI Monitoring)
Strong backend development experience with Python APIs and microservices
Familiarity with event-driven architectures and streaming platforms (Kafka OCI Streaming)
Understanding of scalable fault-tolerant system design
Experience with monitoring alerting dashboards and search platforms (Elasticsearch/OpenSearch)
Qualifications
Bachelors or Masters degree in computer science or related field
Experience with AI-powered observability or AIOps systems preferred
Knowledge of incident management root cause analysis and SLO/SLA frameworks
Experience with multi-tenant large-scale distributed systems
Strong communication and collaboration skills in an agile environment.