Please note that under Federal & FedRAMP regulations hiring for this role is limited to US citizens only.
FedRamp Staff may be subject to customer or third-party background checks up to and including secret clearance if required by their role at SentinelOne.
What are we looking for:
Join SentinelOne as an Infrastructure Engineer and play a crucial role in building infrastructure. SentinelOnes XDR vision of one autonomous cybersecurity platform depends on our ability to collect store and analyze data efficiently. If you are interested in building infrastructure for data platforms that run reliably with 99.99% uptime can scale to petabytes of data ingested every day (and >2 trillion events processed) and return queries with p95 latencies less than 5 seconds you will love this opportunity.
Tools we use: Orchestration tools like Kubernetes (EKS GKE) Jenkins Github Actions ArgoCD Terraform
US Eastern Time Zone preferred due to collaboration requirements with teams in Europe and India.
What will you do
- Lead the design and operation of distributed data servicesincluding Kafka and Redisrunning at massive scale across Kubernetes clusters and multi-cloud environments.
- Unlock complete cloud portability for SentinelOnes services by building a highly automated self-service infrastructure that can run seamlessly across AWS GCP and air-gapped on-prem environments.
- Manage data infrastructure supporting 5 PB/day ingestion ensuring low-latency high-throughput and cost-effective operation at global scale.
- Consolidate and optimize multi-tenant Kafka clusters to reduce cost improve resilience and streamline operations.
- Drive Redis and Kafka lifecycle automation using GitOps principles (ArgoCD Terraform) reducing operational toil and minimizing pager fatigue.
- Define and implement standards for observability HA backup and DR of stateful workloads in Kubernetes.
- Partner with FinOps and engineering stakeholders to continuously optimize performance cost and operational overhead across data platform components.
- Own the end-to-end platform experience for mission-critical open-source systems such as Kafka Redis and Cassandra serving hundreds of product teams.
What skills and knowledge should you bring
- 8 years of experience in infrastructure/platform engineering with a proven track record of operating stateful distributed systems at scale.
- Deep hands-on experience with Kafka and Redis running in Kubernetes including performance tuning scaling partitioning persistence and operator-based lifecycle management.
- Strong understanding of Kubernetes internals and best practices for managing both stateless and stateful workloads in production environments.
- Experience providing Database- or Messaging-as-a-Service (DBaaS/PaaS) for internal development teams or external customers.
- Exposure to multi-cloud environments with strong expertise in at least one major provider: AWS GCP or Azure.
- Experience with Infrastructure as Code and GitOps practices (Terraform ArgoCD Pulumi).
- Familiarity with advanced deployment strategies (blue-green canary rolling).
- Strong scripting or development skills (e.g. Python Go or similar).
- Solid understanding of CI/CD pipelines and workflow automation (GitHub Actions Argo Workflows etc.).
Why us
You will be joining a cutting-edge company where you will tackle extraordinary challenges and work with the very best in the industry.
- Medical Vision Dental 401(k) Commuter Health and Dependent FSA
- Unlimited PTO
- Industry-leading gender-neutral parental leave
- Paid Company Holidays
- Paid Sick Time
- Employee stock purchase program
- Disability and life insurance
- Employee assistance program
- Gym membership reimbursement
- Cell phone reimbursement
- Numerous company-sponsored events including regular happy hours and team-building events
Required Experience:
Staff IC