Staff Site Reliability Engineer, Cloud

Kentik

Not Interested
Bookmark
Report This Job

profile Job Location:

Austin, TX - USA

profile Monthly Salary: $ 165000 - 200000
Posted on: 13 hours ago
Vacancies: 1 Vacancy

Job Summary

Who we are

Kentik is the network intelligence platform for modern infrastructure teams. Unlike traditional monitoring and observability tools we demystify complex network operations enabling organizations to deliver applications and innovation at scale. Built by network experts to make critical insight accessible to every engineer Kentik is the real-time source of truth that understands every network in context from data center to cloud to the internet. This single platform unifies and correlates cloud device flow synthetic data to turn telemetry into action. Market leaders like Akamai Dropbox and Zoom rely on Kentik to run manage and optimize their networks.

What we do

Our platform ingests trillions of records and serves hundreds of thousands of queries for our users each day. You will gain experience building a production quality high performance server-and-client SaaS application that handles uniquely high volumes of data.

We have built a team of world-class engineers network experts and technology thought leaders in a remote-friendly culture from day one. While prior experience in a remote environment is not required we highly value strong collaboration and communication skills as well as a high level of independence and autonomy.

What youll do

Kentik is looking for a Staff level Site Reliability Engineer (Cloud) to join our Product Engineering team to help build and maintain our Synthetics and Cloud product lines. These products have multiple applications deployed in various cloud providers all over the world. We manage these cloud applications using observability tooling automated build processes and adherence to configuration as code best practices.

Were looking for an experienced engineer who will work with engineering teams across the company to help grow our hardware and software infrastructure. We operate a well-organized well-instrumented platform and offer enormous opportunities for employee growth.

  • Make sure our real-time scalable infrastructure is set up for growth and working efficiently. Our infrastructure runs on our own hardware across multiple locations as well as all major cloud vendors
  • Work on tools and processes to better monitor our platform as well as ensuring its stability through our rapid growth
  • Deep-diving into diverse topics from firewalls and IP routing to database replication strategies or automating build processes
  • Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective
  • Assist with expanding our cloud deployments across the major cloud providers
  • Contribute code code reviews and tools or patches to all kinds of existing code
  • Write design documents or collaborate on colleagues docs to introduce new features or changes into our infrastructure
  • Provide valuable feedback on team goals projects and processes. We believe in continuously improving our team

What youll bring

Studies have shown that some candidates tend to apply to jobs only if they meet 100% of the qualifications. We encourage you to apply if you meet most of the criteria - even if you dont match all of the qualifications your skills and experience could be valuable in this role!

  • 8 years of experience in cloud-based Systems Administration IT and/or SRE related projects
  • Expertise in public cloud environments such as AWS GCP Azure or OCI.
  • Strong command of containerization and orchestration using Docker and Kubernetes.
  • Solid programming and automation skills using Bash Python or Go.
  • Proficiency with Infrastructure as Code (IaC) and configuration management platforms such as Terraform Ansible and Puppet.
  • Proficiency in Linux administration and command-line tools (e.g. SSH grep awk).
  • Detailed understanding of major internet protocols (TCP/IP DNS HTTP TLS)
  • Networking administration experience: concepts such as routing firewalls (iptables) peering sound familiar
  • A passion for documenting code processes and infrastructure in runbooks and wikis
  • Worked with metrics monitoring solutions such as grafana prometheus telegraf and OpenTelemetry
  • Experience creating and managing tickets with third party vendors and owning cloud vendor partner relationships

Nice to haves:

  • Familiarity with Kubernetes automation tools specifically managing complex deployments with Helm and Helmfile.
  • Knowledge of scaling Kubernetes workloads and compute infrastructure
  • Experience optimizing CI/CD build and deploy pipelines using GitHub Actions and Jenkins.
  • Exposure to PagerDuty Integrations
  • Knowledge of SRE DevOps and GitOps practices and principles

Our tech stack

  • Our core data engine and platform are primarily written in Go
  • We use Express for application serving and React as our primary UI framework
  • We also use some JS and Python for tooling/scripting
  • In addition to our own database we use Postgres Kafka Mysql and Redis
  • Internal and public APIs expose both rest/json and gRPC endpoints
  • Haproxy Envoy for API traffic routing and balancing
  • Github for source control PRs issues
  • Jenkins for automated builds

What we offer

Kentik is a fully remote company that operates globally. We seek professionals that will help us thrive as an organization and in turn to broaden and enhance your career. Were very thorough in the interview process to understand your skills and how they will relate to your successful growth here at Kentik. Our compensation philosophy encompasses a fair program for all in order to attract engage and retain talented individuals who will drive our business and wow our customers.

The compensation range for this position is: $165000 - $200000.This range reflects the low and high end of the U.S. compensation range Kentik reasonably and generally expects to pay the hired candidate in this role. The actual compensation offered may be lower or higher than the stated range depending on various factors including but not limited to:

  • Experience with the skill sets required for success
  • Demonstrated competencies and potential
  • A geographic market-based approach

In addition to a great career opportunity Kentik offers stellar benefits for our employees which include:

  • 100% of premiums are paid by company for health vision and dental coverage for you and your dependents
  • Additionally an annual Health Reimbursement Account (HRA) of $3000 for an individual or $4500 for a family
  • Paid family & medical leave
  • Open PTO a quarterly Wellness Day and a minimum of 10 paid holidays
  • 401(k) retirement account
  • Home office reimbursement
  • Stock options

Note: Benefits are as listed for all US full-time employees. For compensation international applicants will be treated equitably in relation to the laws applicable within the countries in which we operate.

Come work with us

The true meaning of Kentik is visibility. Were committed to making sure everyone feels empowered to use their voice has a sense of belonging and is represented at Kentik.

We dont look for individuals who fit the culture but those who will continue to add to the culture.
We encourage everyone to apply especially those individuals who are underrepresented in the industry: people of color LGBTQI community women individuals with disabilities (both seen and unseen) veterans and people of any age or family status.

Kentik is committed to creating an inclusive interview process. If you require a reasonable accommodation during the application or interview process please reach out to

Come as you are!
You will be working at a fast-growing well-funded startup alongside industry thought leaders and network aficionados as we build the future of observability and set the high bar for how network operations and digital businesses should run. With a competitive salary and amazing benefits on top of the meaningful and challenging projects youll take on were sure youll enjoy joining the Kentik team.

#li-remote


Required Experience:

Staff IC

Who we areKentik is the network intelligence platform for modern infrastructure teams. Unlike traditional monitoring and observability tools we demystify complex network operations enabling organizations to deliver applications and innovation at scale. Built by network experts to make critical insig...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

Kentik is the network intelligence platform for modern infrastructure teams. Improve network observability, performance, and security. Network performance monitoring and diagnostics for traffic, routing, synthetic testing, and cloud.

View Profile View Profile