drjobs Site Reliability Engineer

Site Reliability Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Krakow - Poland

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Our client a new Silicon Valley-based profitable B2C product startup building innovative mobile solutions for the planet is now looking for an experienced Site Reliability Engineer to help the build reliable scalable and observable systems. You will work closely with backend services (Python/Go) web applications and databases to ensure performance stability and fast recovery in case of failures.

Location: Poland
Type: Remote Full-time
Start date: ASAP
About project and position:

Based in Silicon Valley and backed by top-tier VCs is a new mobile innovator delivering exciting new products for consumers across the planet.
The company has a flagship VPN application with over 1B downloads ensuring online privacy and anonymity for our users by creating a private network from a public internet connection.

Responsibilities:

  • Design and implement observability solutions (monitoring logging alerting tracing) for backend services web applications and databases
  • Develop and maintain cloud and self hosted infrastructure ( AWS DigitalOcean) using infrastructure-as-code and configuration management tools such as Terraform and Ansible
  • Support developers in improving service reliability and automating deployments
  • Build and maintain CI/CD pipelines (e.g. GitHub Actions Jenkins)

  • Track and improve SLI/SLOs; run root cause analyses and post-mortems

  • Promote a strong reliability and continuous improvement culture

Requirements:

  • 5 years of experience in software engineering including 2 years in an SRE or DevOps role
  • Experience managing high-availability production systems
  • Hands-on experience managing and operating Kubernetes clusters in production
  • Proficiency in at least one programming language (e.g. Go Python) with focus on automation and code quality
  • Strong knowledge of observability platforms (e.g. Datadog CloudWatch Prometheus Grafana Clickhouse)
  • Experience with cloud (AWS Digital Ocean) and self hosted infrastructure
  • Good understanding of incident management disaster recovery and monitoring best practices (e.g. DORA metrics post-mortems SLOs/SLIs)
  • Solid Linux administration networking and basic security knowledge
  • Experience building and maintaining CI/CD pipelines (e.g. Jenkins AWS CodePipeline)
  • English - Intermediate spoken and written

Nice to have:

  • Security knowledge (e.g. OWASP threat modeling vulnerability scanning)
  • Experience with OpenTelemetry or similar tracing tools

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.