Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailAt eBay were more than a global ecommerce leader were changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. Were committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.
Our customers are our compass authenticity thrives bold ideas are welcome and everyone can bring their unique selves to work every day. Were in this together sustaining the future of our customers our company and our planet.
Join a team of passionate thinkers innovators and dreamers and help us connect people and build communities to create economic opportunity for all.
Join the Marketing Technologies Platform Team that powers billions of communications per day sent to customers across the world. This team plays a pivotal role in delivering personalized and timely customer engagement experiences across eBays global user base.
We are seeking a highly motivated and experienced Senior Platform Reliability Engineer (PRE) to join our growing team. In this critical role you will be responsible for ensuring the reliability scalability and performance of our core platform and services. You will apply Site Reliability Engineering (SRE) principles to automate operations improve system resilience and drive a culture of continuous improvement across our engineering organization.
Reliability & Performance: Design implement and maintain systems and processes to ensure the high availability performance and scalability of our production platform.
Automation: Develop and implement automation for infrastructure provisioning deployment monitoring and incident response reducing manual toil and improving operational efficiency.
Observability: Implement and enhance comprehensive monitoring logging and alerting solutions to provide deep insights into system health and performance.
Incident Management: Lead incident response efforts conduct root cause analyses and implement preventative measures to minimize future occurrences.
Capacity Planning: Collaborate with development teams to forecast resource needs and ensure the platform can handle anticipated growth and traffic spikes.
System Design & Architecture: Provide input on system architecture and design advocating for reliability scalability and operational best practices from the outset.
Tooling & Infrastructure: Evaluate select and implement new tools and technologies to improve our platforms reliability security and operational capabilities.
Collaboration & Mentorship: Work closely with development QA and security teams to embed reliability practices throughout the software development lifecycle. Mentor junior engineers on SRE principles and best practices.
Documentation: Create and maintain clear concise documentation for systems processes and troubleshooting guides.
Experience: 5 years of experience in a DevOps SRE or similar role focused on platform reliability and operations.
Cloud Platforms: Strong hands-on experience with at least one major cloud provider (e.g. AWS Azure GCP).
Containerization & Orchestration: Expertise with Docker and Kubernetes for deploying and managing microservices.
Infrastructure as Code: Proficiency with IaC tools such as Terraform CloudFormation or Ansible.
Scripting & Programming: Strong scripting skills (e.g. Python Bash) and experience with at least one compiled language (e.g. Go Java ) for automation and tool development.
Monitoring & Alerting: Experience with monitoring tools (e.g. Prometheus Grafana Datadog New Relic) and logging systems (e.g. ELK Stack Splunk).
CI/CD: Solid understanding and experience with CI/CD pipelines (e.g. Jenkins GitLab CI GitHub Actions).
AI Code Generation: Familiarity with foundational AI concepts and practical experience applying AI-powered coding generation (e.g. OpenAI Codex GitHub Copilot Anthropic Claude Cursor Windsurf or understanding of transformer-based code generation) will be a significant asset.
Networking: Fundamental understanding of networking concepts (TCP/IP DNS Load Balancing Firewalls).
Databases: Familiarity with database operations performance tuning and backup/recovery strategies (SQL and NoSQL).
Problem-Solving: Exceptional analytical and troubleshooting skills with a methodical approach to identifying and resolving complex system issues.
Communication: Excellent verbal and written communication skills capable of effectively communicating technical concepts to diverse audiences.
Education: Bachelors degree in Computer Science Engineering or a related field or equivalent practical experience.
Please see the Talent Privacy Noticefor information regarding how eBay handles your personal data collected when you use the eBay Careers website or apply for a job with eBay.
eBay is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race color religion national origin sex sexual orientation gender identity veteran status and disability or other legally protected you have a need that requires accommodation please contact us at. We will make every effort to respond to your request for accommodation as soon as possible. View our accessibility statement to learn more about eBays commitment to ensuring digital accessibility for people with disabilities.
The eBay Jobs website uses cookies to enhance your experience. By continuing to browse the site you agree to our use of cookies. Visit our Privacy Center for more information.
Required Experience:
Senior IC
Full-Time