drjobs Principal Engineer SRE

Principal Engineer SRE

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Mountain View, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Site Reliability Engineers (SREs at Coupang is a missioncritical role that combines software and system engineering to build run and scale our complex largescale ecommerce systems. As part of the Site Reliability Engineering team you will be responsible for ensuring all our customerfacing services are healthy monitored automated and designed to scale. As an SRE organization we take pride in handling operations as an engineering problem with automation automationfirst approach. You will use your background to build bestinclass infrastructure automation for areas such as Observability Incident Management Disaster Recovery Load testing Capacity engineering and many more. In this role you will work very closely with our product development teams from an early stage of design to all the way helping resolve any production incidents maintaining SLI/SLA bar for production services and influencing them with SRE principles and best practices. If you take pride in complete ownership have a passion for solving complex technical challenges for largescale distributed systems and demeanor to work and communicate effectively across team boundaries this is the role for you!

Key Responsibilities:

  • Serve as a primary point responsible for the reliability health and performance of all Coupang customerfacing services.

  • Gain deep knowledge of Coupang application workflow and dependencies.

  • Spearheading and conceptualizing revolutionary designs in critical service architecture.

  • Conducting comprehensive architecture reviews leading rearchitecting initiatives to set industryleading benchmarks in performance reliability and availability.

  • Lead and drive largescale technical initiatives across multiple engineering teams.

  • Be able to drive collaboration effectively across organizational boundaries be able to build strong stakeholder relationships to achieve broad organizational objectives.

  • Identify and implementscalable solutions for complex technical problems. Be the change driver.

  • Selfmotivated to be able to navigate the ambiguity with large initiatives and find solutions to accomplish the goal.

  • Be the SRE champion/lead working with the rest of the technical leaders across Coupang to define and drive the engineering roadmap.

  • Contribute towards hiring and building a worldclass team. Mentor and coachjunior engineers on the team.

  • Communicate effectively with people at all levels of the organization.

Essential Qualifications:

  • 10 years of industry experience building and operating largescale distributed systems.

  • Deep UNIX/Linux systems knowledge and administration background.

  • Strong programming skills in one or more of Python Java Golang or C.

  • Strong problemsolving and analytical skills spanning systems networks (TCP/IP) and code with a focus on datadriven decisionmaking.

  • Proficient with cloudbased infrastructure including AWS Azure or Google Cloud Platform.

  • Strong understanding of DevOps and SRE practices including continuous integration continuous delivery and infrastructure as code IaC.

  • Proficient with containerization and orchestration technologies such as Docker and Kubernetes.

  • Knowledge of observability ecosystem including metrics logging tracing and tools such as Prometheus Grafana Elastic Stack Datadog or New Relic.

  • Excellent communication and collaboration skills with the ability to work with teams across distinct functions and technical domains.

Preferred Qualifications:

  • Masters degree in computer science Engineering or a related technical field.

  • Prior experience working with largescale webbased Java architectures and JVM configuration.

  • Professional certifications in cloud platforms monitoring tools or related technologies.

  • Previous experience working on a largescale ecommerce platform.


Required Experience:

Staff IC

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.