Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailAt Oracle Cloud Infrastructure (OCI) we are redefining the future of computing for enterprisesbuilding cloud-native systems from the ground up powered by a global team of visionary engineers scientists and creators. We combine the agility of a startup with the scale security and reach of Oracles enterprise-grade platforms.
Our Generative AI Service team is pioneering the development of infrastructure and services that harness the transformative power of Large Language Models (LLMs) and Agentic AI systems. Our mission is to build world-class scalable platforms that enable customers to deploy intelligent agents and applications deeply integrated with OCIs robust cloud ecosystem.
Role Summary
As a Consulting Member of Technical Staff (IC5) you will play a pivotal role in designing building and optimizing LLM infrastructure agent execution runtimes and next-generation developer platforms. Youll collaborate closely with applied scientists and ML engineers to bring agentic workflows into real-world deploymentsat scale. This is a hands-on technical leadership role ideal for someone deeply rooted in distributed systems and low-level computer science.
Minimum Qualifications
- BS in Computer Science or equivalent experience.
- 10 years of experience in production-grade distributed systems and cloud-native software engineering.
- Proficiency in Go Java Python or C.
- Expertise in high-performance computing and ML model serving infrastructure.
- Deep understanding of container orchestration and CI/CD pipelines.
- Strong communication skills and experience mentoring across teams.
Preferred Qualifications
- MS or PhD in Computer Science particularly in Systems ML Infrastructure or Compilers.
- Experience with LLM serving frameworks like vLLM FasterTransformer DeepSpeed or Triton.
- Familiarity with agent-based systems.
- Contributions to LLM-native developer tools and compiler IRs.
- Experience with vector databases tool APIs and event-driven workflows.
- Foundation in OS internals compiler pipelines and systems programming.
- Proven ability to lead large-scale architecture efforts.
Why Join Us
- Be at the frontier of generative AI and agent-based software interaction.
- Work on mission-critical projects impacting Oracles AI strategy.
- Collaborate with a globally distributed team of leading engineers and researchers.
- Enjoy the agility of a fast-moving team with enterprise-level resources.
- Architect and build high-throughput low-latency serving systems for LLM inference and agent orchestration.
- Design agent-native runtime environments that support dynamic planning tool calling memory and long-running context.
- Integrate foundational AI components with OCIs compute and networking layers.
- Partner with ML research to optimize model training fine-tuning and inference performance on GPU clusters.
- Own critical paths of software delivery from architectural review through implementation and post-deployment resilience.
- Contribute to OCIs developer-facing agent framework.
- Tackle deep systems-level challenges drawing on knowledge in operating systems compiler design and cloud primitives.
Career Level - IC5
Required Experience:
Staff IC
Full-Time