What are we looking for
SRE organizations mission at SentinelOne (S1) is to keep our uptime promise to our customers by ensuring we meet our SLOs/SLAs help our engineering teams ship software to our customers fast and with quality and ensure our customers are successful.
As a Staff Site Reliability Engineer you will be a technical leader within the SRE organization responsible for setting the technical direction and driving the long-term reliability vision for SentinelOnes production service. You will be empowered to solve systemic cross-team challenges and improve the reliability scalability and performance of our entire service ecosystem. You will not just contribute to major initiatives like our Monitoring and Observability Uplift and Logging Pipeline modernization; you will be instrumental in leading the strategy and architecture for these large-scale projects ensuring they meet the long-term needs of the business.
What will you do
As a Staff SRE you will be a key technical leader strategist and mentor. You will operate across teams to solve the most challenging reliability and scalability problems at SentinelOne. Your responsibilities will include:
- Setting the technical direction for reliability across multiple services partnering with engineering leaders to create and execute long-term roadmaps.
- Identifying and eliminating entire classes of operational work by designing and building scalable automated platforms for use by all of SRE and Engineering.
- Leading post-mortems for major multi-system incidents and owning the strategic follow-up to address systemic root causes across the organization.
- Mentoring and developing senior engineers within the SRE organization acting as a force multiplier to level up the entire team.
- You will join a like minded team of SREs who help run our operations smoothly at scale by building a platform on which S1s services can run. If the thought of running a large scale cybersecurity platform on various cloud providers and air gapped environments excite you youve found the right place!
- As a team we value good written communication skills data driven decisions and a keen eye for continuous improvements. Youll help simplify have a passion for new ideas and know how to execute iteratively towards the final goal. We value candor and collaboration.
What skills and knowledge should you bring
- An extensive and proven track record (e.g. 10 years) in SRE/DevOps with deep experience leading large cross-functional technical projects from inception to completion.
- Deep architectural-level expertise across multiple cloud providers (AWS GCP Azure) with proven experience designing running and troubleshooting highly-available systems in complex multi-cloud and air-gapped environments.
- Great proficiency in one or more mainstream languages (e.g. Go Python) with demonstrable experience building scalable software and automation platforms.
- Strong Production experience with orchestration systems like Kubernetes Nomad or Mesos (We are a Kubernetes shop)
- Proven ability to set technical direction and influence the roadmap of multiple engineering teams without direct authority.
- Experience with SecOps & Compliance processes and their touch points with SRE is desired
- Polyglot experience with other SRE tools we integrate with more tools every day
Apart from the above technical skills following soft skills are required:
- A strong sense of business acumen and the ability to evaluate technical decisions in the context of cost risk and long-term company strategy
- Demonstrated experience in mentoring and growing senior engineers.
- Exceptional communication skills with the ability to articulate complex technical concepts to diverse audiences from junior engineers to executive leadership.
- Curiosity fast-learning pursuit to improvements great communication
- Ability to work in a diverse and distributed team
- A self-starter that is passionate and motivated by new technologies and has empathy for legacy systems
- A quick learner that can navigate through unfamiliar programming languages systems and processes
Why Us
You will be joining a cutting-edge company where you will tackle extraordinary challenges and work with the very best in the industry along with competitive compensation.
- Flexible working hours and hybrid/remote work model
- Flexible Time Off.
- Flexible Paid Sick Days.
- Global gender-neutral Parental Leave (16 weeks beyond the leave provided by the local laws)
- Generous employee stock plan in the form of RSUs (restricted stock units)
- On top of RSUs you can benefit from our attractive ESPP (employee stock purchase plan)
- Gym membership by Cultfit.
- Wellness Coach app with 3000 on-demand sessions daily interactive classes audiobooks and unlimited private coaching.
- Private medical insurance plan for you and your family.
- Life Insurance covered by S1 (for employees)
- Telemedical app consultation (Practo)
- Global Employee Assistance Program (confidential counseling related to both personal and work life matters)
- High-end MacBook or Windows laptop.
- Home-office-setup allowances (one time) and maintenance allowance.
- Internet allowances.
- Provident Fund and Gratuity (as per govt clause)
- NPS contribution (Employee contribution)
- Half yearly bonus program depending on the individual and company performance.
- Above standard referral bonus as per policy.
- Udemy Business platform for Hard/Soft skills Training & Support for your further educational activities/trainings
- Sodexo food coupons.
Required Experience:
Staff IC