Senior Backend Engineer, Distributed Systems and Applied AI
Job Summary
Who are we
Were a team of engineers and architects building core engineering platforms for Oracle SaaS including Fusion Applications. Our mission is to help developers and operators across Oracle SaaS understand troubleshoot and optimize complex services with speed confidence and intelligence.
Weve built a Kubernetes-native cloud-first observability platform that goes far beyond monitoring. It supports global-scale telemetry real-time querying and rich signals across metrics logs traces and service behavior. Our systems process billions of telemetry signals daily from millions of compute nodes worldwide using lightweight agents and edge components that run close to the services they observe. We build on open source ecosystems such as OpenTelemetry Kubernetes and other cloud-native technologies while extending them through internal innovation for Oracle SaaSs scale and complexity.
Looking ahead were building toward an AI-native observability platform for Oracle SaaS. Our ambition is to evolve systems that have traditionally served human engineers into platforms designed for both AI-assisted workflows and autonomous AI agents. This means making our services APIs data models and tools natively usable by AI; building cost-efficient telemetry pipelines and distributed lakehouse architectures for large-scale AI analysis; and expanding into advanced signals such as continuous profiling where AI can make complex performance data easier to understand and act on.
What makes this role unique
The work
- Combine engineering exploration and operations to build systems that span high-volume data processing real-time querying developer workflows and edge data collection
- Own critical platform components and shape how engineers interact with telemetry service context and AI-assisted workflows at global scale
The impact
- Build for thousands of Oracle SaaS engineers who use the platform every day to develop debug and operate services with direct feedback on what helps what hurts and what needs to improve
- Help define the next chapter of AI-native observability for Oracle SaaS moving from dashboards and manual investigation toward intelligent workflows powered by telemetry context and AI assistants and agents
What were looking for
- Strong backend engineering fundamentals with experience designing and maintaining reliable scalable services or distributed systems
- Proficiency in one or more systems-oriented languages such as Go Java or Rust; our team primarily uses Go
- Hands-on experience with Kubernetes-native environments and cloud infrastructure such as OCI AWS GCP or Azure
- Ability to solve complex engineering problems in areas such as performance scalability reliability data processing cost efficiency or developer experience
- Strong product sense and empathy for engineers who depend on internal platforms to develop debug and operate services
- Ability to write clear technical designs communicate tradeoffs and collaborate across teams
- For senior candidates experience leading design efforts mentoring engineers and owning production systems through their full lifecycle
Level Flexibility
This role is primarily scoped for senior engineers typically IC4IC6. We are also open to exceptional earlier-career engineers who demonstrate strong fundamentals high learning velocity and clear potential to grow into broad technical ownership. If youre excited about the mission but unsure whether you meet every listed criterion we encourage you to apply.
Bonus skills
- Exposure to AI/ML in production systems especially for infrastructure or engineering platform use cases
- Experience with large-scale high-throughput or globally distributed backend systems
- Experience contributing to open source projects or working with open source ecosystems in production
- Familiarity with observability or SRE domains either as a user of telemetry platforms or as a builder of engineering tools
Responsibilities
What youll do
- Design and build backend systems that power telemetry collection analysis and developer-facing tools
- Drive architectural efforts to improve scale performance cost efficiency reliability and operational simplicity
- Collaborate across our global engineering team and with thousands of Oracle SaaS engineers who depend on our services
- Take ownership of services and componentsfrom architecture and implementation to production operation and long-term evolution
- Grow into a subject matter expert in key areas of the platform
- Participate in a global on-call rotation as part of shared responsibility for the systems we run
How we work
We are a globally distributed remote-friendly team with many members working from home across multiple time zones. Most of our team is located in the U.S. and India and we collaborate with SaaS engineers around the world. We value clear communication async-friendly workflows and a culture of trust and ownership.
Qualifications
Career Level - IC5
Required Experience:
Senior IC
About Company
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when eve ... View more