Manager, Software Engineering, Compute Infrastructure

LinkedIn

Not Interested
Bookmark
Report This Job

profile Job Location:

Mountain View, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 2 days ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

We are the Host Health and Remediation team within Compute Infrastructure focused on advancing the reliability and operability of LinkedIns compute infrastructure. Our mission is to provide a unified reliable and transparent host health signal and to remediate unhealthy hosts across LinkedIns entire server fleet.

The team offers the opportunity to tackle large-scale technically challenging problems work on cutting-edge infrastructure systems and directly contribute to LinkedIns fleet reliability through automation observability and data-driven insights. The impact of this work is felt across the entire company.

This role will be based in Mountain View Bellevue or San Francisco.

At LinkedIn our approach to flexible work is centered on trust and optimized for culture connection clarity and the evolving needs of our business. The work location of this role is hybrid meaning it will be performed both from home and from a LinkedIn office on select days as determined by the business needs of the team.

Join LinkedIns Host Health and Remediation team to shape the next generation of our compute infrastructure!  In this role youll lead efforts on large-scale systems automating host health monitoring and remediation to boost reliability and minimize downtime across the entire server fleet. This high-visibility company-wide initiative offers the opportunity to design scalable solutions influence key infrastructure decisions and collaborate across multiple teams.

This manager will play a pivotal role in ensuring the reliability observability and operability of LinkedIns entire server fleet. By leading the Host Health and Remediation team this role directly impacts the availability and performance of LinkedIns compute infrastructure enabling all services and products across the company to operate reliably at scale. The position combines technical leadership strategic planning and team development making it critical for advancing LinkedIns long-term infrastructure goals

If you have experience managing teams that build large-scale compute infrastructure systems and want to make a lasting impact on LinkedIns future wed love to connect!

Responsibilities

  • Talent Management: Recruit coach mentor and grow high-performing engineers; foster a culture of accountability innovation and continuous learning.
  • Technical Leadership: Guide architectural decisions review designs and ensure scalable reliable infrastructure solutions.
  • Strategic Planning: Define and drive the technical roadmap for host health monitoring and remediation; prioritize initiatives with maximum business impact.
  • Cross-Team Collaboration: Partner with other infrastructure operations and platform teams to define best practices share insights and influence company-wide reliability improvements.
  • Operational Excellence: Ensure systems are observable automated and proactively maintained to minimize downtime and maximize fleet health.

Qualifications :

Basic Qualifications

  • BA/BS Degree in Computer Science or related technical discipline or equivalent practical experience.
  • 1 year(s) of management experience or 1 year(s) of staff level engineering experience with management training
  • 5 years of industry experience in software design development and large-scale software engineering
  • Strong understanding of large-scale systems reliability engineering monitoring and automation.
  • Proven experience managing engineering teams in distributed systems or compute infrastructure.
  • Experience programming in object-oriented languages such as Java C Python Go Rust C# and/or Functional languages such as Scala or other relevant coding languages.

Preferred Qualifications

  • MS or PhD in Computer Science or related technical discipline
  • 2 years of hands-on software engineering/technical management and people management experience
  • 7 years industry experience in software design development and algorithm related solutions.
  • 5 years programming experience in languages such as Java C Python Go Rust C# and/or Functional languages such as Scala or other relevant coding languages.
  • Experience in architecting building and running large-scale distributed systems
  • Experience with industry opensource and/or academic research in technologies such as Hadoop Spark Kubernetes Feather GraphQL GRPC Apache Kafka Pinot Samza or Venice
  • Experience with open-source project management and governance

Suggested Skills

  • Distributed systems
  • Backend Systems Infrastructure
  • Java/Golang/Rust/Python

You will Benefit from our Culture

We strongly believe in the well-being of our employees and their families. That is why we offer generous health and wellness programs and time away for employees of all levels LinkedIn is committed to fair and equitable compensation practices.

The pay range for this role is $200000 - $268000 Actual compensation packages are based on several factors that are unique to each candidate including but not limited to skill set depth of experience certifications and specific work location. This may be different in other locations due to differences in the cost of labor.

The total compensation package for this position may also include annual performance bonus stock benefits and/or other applicable incentive compensation plans. For more information visit Information :

Equal Opportunity Statement 

We seek candidates with a wide range of perspectives and backgrounds and we are proud to be an equal opportunity employer. LinkedIn considers qualified applicants without regard to race color religion creed gender national origin age disability veteran status marital status pregnancy sex gender expression or identity sexual orientation citizenship or any other legally protected class.

LinkedIn is committed to offering an inclusive and accessible experience for all job seekers including individuals with disabilities. Our goal is to foster an inclusive and accessible workplace where everyone has the opportunity to be successful.

If you need a reasonable accommodation to search for a job opening apply for a position or participate in the interview process connect with us at and describe the specific accommodation requested for a disability-related limitation.

Reasonable accommodations are modifications or adjustments to the application or hiring process that would enable you to fully participate in that process. Examples of reasonable accommodations include but are not limited to:

  • Documents in alternate formats or read aloud to you
  • Having interviews in an accessible location
  • Being accompanied by a service dog
  • Having a sign language interpreter present for the interview

A request for an accommodation will be responded to within three business days. However non-disability related requests such as following up on an application will not receive a response.

LinkedIn will not discharge or in any other manner discriminate against employees or applicants because they have inquired about discussed or disclosed their own pay or the pay of another employee or applicant. However employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information unless the disclosure is (a) in response to a formal complaint or charge (b) in furtherance of an investigation proceeding hearing or action including an investigation conducted by LinkedIn or (c) consistent with LinkedIns legal duty to furnish information.

San Francisco Fair Chance Ordinance

Pursuant to the San Francisco Fair Chance Ordinance LinkedIn will consider for employment qualified applicants with arrest and conviction records.

Pay Transparency Policy Statement

As a federal contractor LinkedIn follows the Pay Transparency and non-discrimination provisions described at this link: Data Privacy Notice for Job Candidates

Please follow this link to access the document that provides transparency around the way in which LinkedIn handles personal data of employees and job applicants: Work :

No


Employment Type :

Full-time

We are the Host Health and Remediation team within Compute Infrastructure focused on advancing the reliability and operability of LinkedIns compute infrastructure. Our mission is to provide a unified reliable and transparent host health signal and to remediate unhealthy hosts across LinkedIns entire...
View more view more

Key Skills

  • IT Experience
  • Project Management Methodology
  • Data Center Experience
  • LAN
  • Cloud Infrastructure
  • Computer Networking
  • Visio
  • ITIL
  • Project Management
  • SAN
  • Microsoft Project
  • Project Management Lifecycle

About Company

Company Logo

LinkedIn is the world’s largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. We’re ... View more

View Profile View Profile