Nexthink is looking for a Lead Site Reliability Engineer who is passionate about building and running a high-performance cloud platform and enabling best-in-class site reliability and operations practices. This role will support US-based operations generally but will in addition focus on enabling Nexthink to deliver to the US Public Sector market in particular a FedRAMP Moderate offering. The candidate will drive the development of modern cloud-native SRE processes and the management and operations for Nexthinks multi-tenant microservices-based cloud platform. The platform has multiple instances deployed across the globe.
This role involves working closely with cross-functional teams to integrate reliability and security into our systems ensuring they meet federal security standards. The ideal candidate will have extensive experience in both software engineering and systems administration with a strong understanding of FedRAMP concepts requirements and security practices.
Leadership and Team Management:
- Lead mentor and develop a team of US-based Site Reliability Engineers.
- Foster a culture of continuous improvement collaboration and innovation.
Infrastructure Management:
- Oversee the design deployment and management of scalable and secure cloud infrastructure.
- Drive automation of infrastructure provisioning configuration and management using Infrastructure as Code (IaC) tools.
Monitoring and Performance:
- Develop and maintain comprehensive monitoring logging and alerting systems to ensure high availability and performance.
- Lead efforts in performance tuning and optimization for applications and infrastructure.
Security and Compliance:
- Ensure implementation and maintenance of security controls and best practices to achieve FedRAMP compliance.
- Conduct and oversee regular security assessments vulnerability scans and penetration testing.
- Collaborate with the compliance team to prepare for and respond to FedRAMP audits.
Incident Management:
- Lead incident management efforts ensuring rapid resolution and thorough root cause analysis.
- Develop and implement strategies for improving incident response and minimizing downtime.
Collaboration and Communication:
- Work closely with development operations and security teams to integrate reliability and security into the software development lifecycle.
- Communicate effectively with stakeholders providing regular updates on system performance reliability and compliance status.
Qualifications :
- Bachelors degree in Computer Science Engineering or a related field (or equivalent experience).
- 5 years of experience in site reliability engineering DevOps or a related role with at least 2 years in a leadership or managerial position.
- Proficiency in cloud platforms (AWS Azure GCP) and cloud-native services.
- Strong scripting and programming skills (Python Bash Go or similar).
- Experience with Infrastructure as Code (IaC) tools such as Terraform CrossPlane CloudFormation or Ansible.
- Knowledge of containerization and orchestration (Docker Kubernetes).
- Familiarity with CI/CD pipelines and tools (Jenkins GitLab GitHub etc.).
- In-depth knowledge of FedRAMP requirements and best practices.
- Experience with security tools and practices (SIEM IDS/IPS firewalls).
- Understanding of network security encryption and secure software development practices.
- Ability to collaborate with and foster effective communication with global engineering teams in EU and India timezones.
Additional Information :
We are the pioneers and trailblazers of a global IT Market Category (DEX) that is shaping the future of how the world works giving our customers IT Teams total digital visibility across their enterprise. Our innovative solutions integrate real-time analytics automation and employee feedback across all endpoints. This enables our IT teams to solve complex technical challenges create ever more productive workplaces and deliver happy satisfied employees in the digital workplace.
With over 1000 employees across 5 continents Nexthink operates as One Team connecting collaborating and innovating to continuously grow. We call our employees Nexthinkers and our commitment to diversity inclusion and equity is second to none. We currently have over 75 nationalities working with us from all cultures and backgrounds speaking many different languages.
Please note that not all the benefits listed above are available for temporary contract and internship roles. To ensure you have the most up-to-date information we recommend checking with your Recruitment Partner.
Total Rewards @ Nexthink
At Nexthink we offer one of the most comprehensive and generous benefits plans. Your total rewards compensation package includes base salary and may also include a commission or performance bonus plan. We provide our US employees with 100% covered company benefits that consist of health dental vision as well as access to life insurance long-term disability and accidental death/personal loss coverage.
In addition we offer:
- Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 15 days of holidays we offer) 11 company-paid holidays and 3 extra days for volunteering.
- Hybrid work model that balances office and remote work with structured onboarding to foster connections and team integration.
- Free access to professional training platforms to explore your interests and enhance your skills.
- Up to 16 weeks of paid leave for birthing parents/primary caregivers 6 weeks for secondary caregivers.
- Plan for the future with a 401(k) plan featuring up to 4% company matching contributions vesting immediately to grow your retirement savings.
- Bonuses for referring successful hires after three months of continuous employment.
Base salary ranges are determined by country role level experience and skills. The range displayed on each job posting reflects Nexthinks good faith determination of the minimum and maximum targets for new hire salaries across all US locations. Individual pay is determined by related factors including job skills experience and relevant education or training which may impact a final offer. Your Talent Acquisition Partner can share more about the specific salary range during the hiring process.
Remote Work :
No
Employment Type :
Full-time
Nexthink is looking for a Lead Site Reliability Engineer who is passionate about building and running a high-performance cloud platform and enabling best-in-class site reliability and operations practices. This role will support US-based operations generally but will in addition focus on enabling Ne...
Nexthink is looking for a Lead Site Reliability Engineer who is passionate about building and running a high-performance cloud platform and enabling best-in-class site reliability and operations practices. This role will support US-based operations generally but will in addition focus on enabling Nexthink to deliver to the US Public Sector market in particular a FedRAMP Moderate offering. The candidate will drive the development of modern cloud-native SRE processes and the management and operations for Nexthinks multi-tenant microservices-based cloud platform. The platform has multiple instances deployed across the globe.
This role involves working closely with cross-functional teams to integrate reliability and security into our systems ensuring they meet federal security standards. The ideal candidate will have extensive experience in both software engineering and systems administration with a strong understanding of FedRAMP concepts requirements and security practices.
Leadership and Team Management:
- Lead mentor and develop a team of US-based Site Reliability Engineers.
- Foster a culture of continuous improvement collaboration and innovation.
Infrastructure Management:
- Oversee the design deployment and management of scalable and secure cloud infrastructure.
- Drive automation of infrastructure provisioning configuration and management using Infrastructure as Code (IaC) tools.
Monitoring and Performance:
- Develop and maintain comprehensive monitoring logging and alerting systems to ensure high availability and performance.
- Lead efforts in performance tuning and optimization for applications and infrastructure.
Security and Compliance:
- Ensure implementation and maintenance of security controls and best practices to achieve FedRAMP compliance.
- Conduct and oversee regular security assessments vulnerability scans and penetration testing.
- Collaborate with the compliance team to prepare for and respond to FedRAMP audits.
Incident Management:
- Lead incident management efforts ensuring rapid resolution and thorough root cause analysis.
- Develop and implement strategies for improving incident response and minimizing downtime.
Collaboration and Communication:
- Work closely with development operations and security teams to integrate reliability and security into the software development lifecycle.
- Communicate effectively with stakeholders providing regular updates on system performance reliability and compliance status.
Qualifications :
- Bachelors degree in Computer Science Engineering or a related field (or equivalent experience).
- 5 years of experience in site reliability engineering DevOps or a related role with at least 2 years in a leadership or managerial position.
- Proficiency in cloud platforms (AWS Azure GCP) and cloud-native services.
- Strong scripting and programming skills (Python Bash Go or similar).
- Experience with Infrastructure as Code (IaC) tools such as Terraform CrossPlane CloudFormation or Ansible.
- Knowledge of containerization and orchestration (Docker Kubernetes).
- Familiarity with CI/CD pipelines and tools (Jenkins GitLab GitHub etc.).
- In-depth knowledge of FedRAMP requirements and best practices.
- Experience with security tools and practices (SIEM IDS/IPS firewalls).
- Understanding of network security encryption and secure software development practices.
- Ability to collaborate with and foster effective communication with global engineering teams in EU and India timezones.
Additional Information :
We are the pioneers and trailblazers of a global IT Market Category (DEX) that is shaping the future of how the world works giving our customers IT Teams total digital visibility across their enterprise. Our innovative solutions integrate real-time analytics automation and employee feedback across all endpoints. This enables our IT teams to solve complex technical challenges create ever more productive workplaces and deliver happy satisfied employees in the digital workplace.
With over 1000 employees across 5 continents Nexthink operates as One Team connecting collaborating and innovating to continuously grow. We call our employees Nexthinkers and our commitment to diversity inclusion and equity is second to none. We currently have over 75 nationalities working with us from all cultures and backgrounds speaking many different languages.
Please note that not all the benefits listed above are available for temporary contract and internship roles. To ensure you have the most up-to-date information we recommend checking with your Recruitment Partner.
Total Rewards @ Nexthink
At Nexthink we offer one of the most comprehensive and generous benefits plans. Your total rewards compensation package includes base salary and may also include a commission or performance bonus plan. We provide our US employees with 100% covered company benefits that consist of health dental vision as well as access to life insurance long-term disability and accidental death/personal loss coverage.
In addition we offer:
- Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 15 days of holidays we offer) 11 company-paid holidays and 3 extra days for volunteering.
- Hybrid work model that balances office and remote work with structured onboarding to foster connections and team integration.
- Free access to professional training platforms to explore your interests and enhance your skills.
- Up to 16 weeks of paid leave for birthing parents/primary caregivers 6 weeks for secondary caregivers.
- Plan for the future with a 401(k) plan featuring up to 4% company matching contributions vesting immediately to grow your retirement savings.
- Bonuses for referring successful hires after three months of continuous employment.
Base salary ranges are determined by country role level experience and skills. The range displayed on each job posting reflects Nexthinks good faith determination of the minimum and maximum targets for new hire salaries across all US locations. Individual pay is determined by related factors including job skills experience and relevant education or training which may impact a final offer. Your Talent Acquisition Partner can share more about the specific salary range during the hiring process.
Remote Work :
No
Employment Type :
Full-time
View more
View less