What are we looking for
Shape the Future of Observability at Scale: Were seeking a Developer driven by a deep passion for observability. Imagine building the very systems that provide SentinelOne with comprehensive real-time visibility across our vast global platform delivering actionable insights precisely when and where theyre this central role youll leverage your expertise to design and implement robust solutions for high-volume data collection storage and analysis fundamentally enabling us to monitor our own solutions. This is your chance to gain significant ownership over critical infrastructure profoundly impacting engineering teams ability to gain insights and accelerate software delivery.
What will you do
As a Staff Infrastructure Engineer youll be a pivotal technical leader and architect within our Observability team driving strategic initiatives and shaping the future of our critical systems. You will leverage your deep expertise to design implement and optimize solutions that underpin SentinelOnes global platform directly empowering engineering teams across the organization.
Your responsibilities will include:
- Drive exemplary operational efficiency for critical observability services (Grafana Prometheus Thanos OTEL) meticulously balancing unwavering reliability with shrewd cost-effectiveness. This includes expertly optimizing cloud resource utilization and strategically aligning workloads with optimal machine types across our multi-cloud environment.
- Champion automation to drastically reduce operational toil and minimize pager burden freeing up engineering cycles for innovation.
- Cultivate robust operational visibility by rigorously implementing Infrastructure as Code (IaC) embedding comprehensive observability and championing industry best practices.
- Architect and implement robust scalable systems and platforms that directly empower SentinelOne engineers to deliver features with unparalleled safety speed and reliability.
- Serve as a subject matter expert (SME) and actively administrate core observability tools including Grafana Prometheus Thanos/Mimir/Cortex and OTEL collectors/pipelines.
- Operate and innovate across diverse large-scale environments spanning Kubernetes clusters (EKS GKE) and core cloud platforms (AWS GCP).
- Lead swift and effective resolution of highly complex technical incidents and issues ensuring continuous system integrity and peak performance.
- Elevate team quality by meticulously reviewing technical designs and code providing insightful constructive feedback that fosters growth and upholds SentinelOnes high standards.
- Drive impactful cross-functional collaboration strategically engaging with diverse teams to define system requirements and ensure our platform robustly meets the evolving needs of all stakeholders.
- Take end-to-end ownership of critical features from initial requirements refinement through to flawless production deployment and ongoing operational excellence.
- Participate in on-call rotations providing expert-level support to ensure the continuous reliability and readiness of our production systems.
What skills and knowledge should you bring
As a Staff Infrastructure Engineer youre not just bringing a skill set; youre bringing a wealth of proven experience in navigating complex infrastructure challenges and driving impactful scalable solutions. We seek seasoned professionals who can elevate our teams capabilities strategically influence our technical direction and mentor others.
Your background should include:
- A distinguished track record of 7 years in IT or a related technical field demonstrating sustained growth and impact.
- Profound hands-on experience in architecting and optimizing comprehensive observability solutions.
- Demonstrated mastery in cutting-edge infrastructure design and robust cloud architecture.
- Extensive proven experience in ensuring the extreme reliability of high-scale SaaS products.
- Deep expertise with foundational observability technologies including Grafana Prometheus Thanos/Mimir/Cortex OTEL or comparable advanced platforms.
- A strong command of container orchestration systems like Kubernetes proficiently utilizing tools such as Helm Kustomize and similar.
- Valuable familiarity with the unique complexities of on-premises and air-gapped Kubernetes deployments.
- Substantial multi-cloud experience possessing deep expertise in at least one major platform (AWS GCP).
- A solid grasp of modern CI/CD principles and advanced deployment automation tools (e.g. GitHub Actions).
- Proficiency in various sophisticated deployment strategies such as blue-green rolling deployments and canary releases.
- Exceptional programming proficiency in a mainstream language with deep expertise in GoLang being highly desirable (or a strong willingness to master GoLang if proficient in another major programming language).
- Comprehensive understanding and hands-on experience withInfrastructure as Code (IaC) tools like Terraform and Ansible.
Why Us
Join a cutting-edge company tackling extraordinary challenges alongside top industry talent. Enjoy flexible hybrid work in Prague (Karlin) Brno (Clubco) or remotely across CZ/SK. Only Prague-based employees are required to work from the office at least two days per week.
Competitive Benefits Package:
- Stock & Bonuses:Grant of Restricted Stock Units with a 4-year vesting plan annual performance-based bonuses and an employee stock purchase plan.
- Time Off & Well-being:Flexible Time Off on top of the standard 5 weeks vacation flexible paid sick days fully paid Short Term Sick/Nursing Leave 16-week parental leave grandparent leave and additional company holidays.
- Insurance & Health:Pension Insurance Contribution Premium life insurance Private medical care (for you and 1) and a Global Employee Assistance Program.
- Work Perks:Monthly meal and well-being allowance high-end MacBook/Windows laptop work-from-home support and in-office refreshments.
- Growth & Community:LinkedIn Learning internal mentoring educational support generous referral bonuses and optional company events (sports BBQs charity).
Be part of an inclusive innovative workplace that values belonging flexibility and growth!
Required Experience:
Staff IC