Group Product Manager, Compute Platform
Mountain View, CA - USA
Job Summary
Company Introduction
We exist to wow our customers. We know were doing the right thing when we hear our customers say How did we ever live without Coupang Born out of an obsession to make shopping eating and living easier than ever were collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.
We are proud to have the best of both worlds a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurs surrounded by opportunities to drive new initiatives and innovations. At our core we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang you will see yourself your colleagues your team and the company grow every day.
Our mission to build the future of commerce is real. We push the boundaries of whats possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on high-tech and hyper-connected world.
About Us
Coupang is at the forefront of the AI and high-performance computing (HPC) revolution. We are building a next-generation cloud platform designed to provide developers researchers and enterprises with seamless scalable and powerful access to accelerated computing. As the demand for AI machine learning and data-intensive workloads skyrockets we are looking for a visionary product leader to define the future of our core compute offerings.
Job Overview
CIC is looking for a Group Product Manager to own the foundationalcomputeplatform that powers enterprise AI workloads. This role spans fleet management capacity management bare metal virtualizedcompute node lifecycle placement reservations and infrastructure-level customer experience.
The Product Manager willbe responsible forturning physical GPU and CPU capacity into reliable customer-ready observable and billablecomputeproducts. This includes defining how capacity is reserved provisionedvalidated monitoredmaintained packaged and exposed to enterprise customers.
This role sits at the compute foundation layer. It enables higher-level orchestration and workload services such as Kubernetes Slurm Ray jobs notebooks and inference candidate should understand how those orchestration and workload systems depend on foundationalcomputeinfrastructure.
Key Responsibilities
- Define the product strategy and roadmap for CICs compute platform across fleet capacity bare metal and virtualizedcompute working backwards from customer AI workload requirements.
- Own the product lifecycle from infrastructure capacity to customer-readycompute including reservation provisioning lifecycle actions observability billing integration maintenance and deprecation with clear linkage to customer workload readiness.
- Define customer-facing abstractions for capacity node pools bare metal instances VM instances placement policies OS/runtime images and reserved compute based on how customers run AI training inference and cluster operations.
- Partner with engineering and infrastructure operations to define fleet readiness lifecycle states failure handling maintenance workflows and operational requirements.
- Partner with sales solutions support and finance to understand enterprise AI workload requirements and translate them into compute offerings including reserved capacity dedicated infrastructure and VM/bare metal packaging.
- Define product requirements for supported compute configurations including GPU type node shape OS/runtime image network/storage attachment and compatibility expectations for customer AI workloads.
- Improve the reliability usability and supportability of CIC compute products across customer onboarding provisioning and day-2 operations.
- Drive roadmap decisions that improveutilization reduce stranded capacity increase enterprise readiness and lower operational support burden.
- Support enterprise customer discovery and roadmap prioritization for bare metal VMs reserved capacity and dedicated infrastructure with specific attention to customer workload patterns performance needs operational workflows and production readiness.
- Create clear product narratives customer-facing materials sales enablement launch plans and executive updates for enterprise compute offerings.
- Track product and business metrics such as usable capacityallocatedcapacity stranded capacity provisioning time replacement timeutilization revenue per deployed GPU and support burden.
Basic Qualifications
- 8 years of product management technical product management or equivalent product leadership experience in cloud infrastructurecompute virtualization GPU cloud HPC private cloud or enterprise infrastructure platforms.
- Experience working backwards from customer workload requirements to define infrastructure products ideally for AI training fine-tuning inference HPC data-intensive workloads or enterprise production systems.
- Experience with foundationalcomputeproducts such as virtual machines bare metal cloud instances node pools fleet management capacity management or infrastructure control planes.
- Strong understanding of infrastructure concepts including provisioning lifecycle management placement quota reservations OS images networking storage attachment observability and billing integration.
- Familiarity with GPU-based infrastructure and the operational considerations that make compute capacity customer-ready including drivers firmware OS images high-performance networking and workload compatibility.
- Ability to translate customer AI workload needs such as distributed training inference serving data movement checkpointing and cluster operations into product requirements forcomputecapacity lifecycle observability and enterprise readiness.
- Experience partnering with engineering and infrastructure teams on technically complex systems while driving product outcomes roadmap decisions prioritization and business impact.
- Experience supporting enterprise customers including workload discovery requirements definition launch readiness customer-facing documentation GTM enablement and post-launch adoption measurement.
- Strong analytical judgment around workload requirementsutilization capacity planning product readiness revenue impact support cost and customer adoption.
- Strong written and verbal communication skills with engineering infrastructure operations finance sales support executive stakeholders and enterprise customers.
Preferred Qualifications
- Experience at ahyperscaler neo-cloud GPU cloud provider HPC cloud provider private cloud platform or infrastructure SaaS company.
- Experience with NVIDIA GPU infrastructure CUDA NCCL OFED driver compatibility firmware lifecycle GPU health/telemetry or supported GPU software stack management.
- Experience with distributed AI training or inference infrastructure includingSlurm Kubernetes Ray model serving platforms high-performance networking shared storage or checkpointing workflows.
- Experience with reserved capacity committed-use contracts dedicated clusters savings plans or enterprise infrastructure commitments.
- Experience with bare metal-as-a-service GPU passthrough virtualization VM image lifecycle custom images or cloud control plane products.
- Application Review - Phone Interview - Onsite (or Virtual Onsite) Interview Offer
- The exact nature of the recruitment process may vary according to the specific job and may be changed due to scheduling or other circumstances.
- Interview schedules and the results will be informed to the applicant via the e-mail address submitted at the application stage
Details to Consider
- This job posting may be closed prior to the stated end date for application if all openings are filled.
- Coupang has the right to rescind an offer of employment if a candidate is found to have submitted false information as part of the application process.
- Those eligible for employment protection (recipients of veterans benefits the disabled etc.) may receive preferential treatment for employment in accordance with applicable laws.
Privacy Notice
- Your personal information will be collected and managed by Coupang as stated in the Application Privacy Notice located below: is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to actual or perceived race (including traits historically associated with race including but not limited to hair texture and protective hair styles) color religion religious creed (including religious dress and grooming practices) sex or gender (including pregnancy childbirth breastfeeding and medical conditions related to pregnancy childbirth or breastfeeding) gender identity gender expression sexual orientation ancestry national origin (including language use restrictions) age (40 and over) physical or mental disability medical condition genetic information HIV/AIDS or Hepatitis C status family status (including but not limited to marital or domestic partnership status) military or veteran status use of a trained dog guide or service animal political activities or affiliations ancestry citizenship family and medical leave status status as a victim of any violent crime or any other characteristic or class protected by the laws or regulations in the locations where we operate. Coupang is also committed to providing a safe work environment for its employees and its consumers.If you need assistance and/or a reasonable accommodation in the application of recruiting process due to a disability please contact us at.
Job Requisition ID: R0059428
Job Requisition ID: R0059428
Required Experience:
IC
About Company
Join us to innovate. Rocket your career. Collaborate with teams across the globe. Find your role and learn more about our culture.