Senior Site Reliability Engineer (SRE)

TechTellent

Posted on : 03-07-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Nicosia - Cyprus

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 03-07-2025

Job Description

Your Mission

As a Site Reliability Engineer you will own and operate our full observability and monitoring stack ensuring our systems are reliable scalable and performant. You will collaborate closely with development and operations teams to automate processes reduce manual toil and implement engineering-driven solutions that balance innovation velocity with operational stability.

What Youll Do

Design implement and maintain monitoring alerting and logging systems (Prometheus VictoriaMetrics Grafana OpenSearch Dynatrace).
Define and track Service Level Indicators (SLIs) Service Level Objectives (SLOs) and manage error budgets to measure and improve system reliability.
Automate repetitive tasks and build self-healing infrastructure using scripting (Bash Python) and infrastructure-as-code tools (Terraform Terragrunt).
Ensure Kubernetes (EKS) cluster reliability through health checks graceful shutdowns rolling updates and autoscaling.
Develop and maintain CI/CD pipelines using GitLab and Helm charts.
Lead incident response conduct blameless postmortems and implement preventive measures.
Document operational procedures runbooks and observability logic; train internal teams on best practices.
Participate in 24/7 on-call rotations to maintain service availability.

What Youll Bring

3 years of experience working with Linux and AWS environments (AWS certifications a plus).
Hands-on experience with observability tools: Prometheus Grafana OpenSearch/ELK VictoriaMetrics Dynatrace.
Familiarity with messaging and database technologies such as Kafka RabbitMQ PostgreSQL Cassandra Redis Elasticsearch.
Strong skills in containerization and orchestration: Docker Kubernetes (EKS) Helm.
Proficient scripting skills in Bash and Python; experience with Terraform and Terragrunt for infrastructure automation.
Solid understanding of CI/CD processes preferably with GitLab.
Knowledge of SRE principles including SLIs/SLOs error budgets capacity planning and incident management.
Excellent communication skills and ability to collaborate across teams.
English proficiency at intermediate level or higher.

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

TechTellent

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Senior Site Reliability Engineer (SRE)

TechTellent

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

DevOps Engineer

Senior Backend Engineer

Electrical Engineer/ Senior Electrical Engineer- Power System Consulting

Senior Solid Mechanical Engineer - GKN Aerospace

Senior Data Engineer â GÃ¶teborg

Senior Computer Vision Engineer â Lund

Security Engineer â Senior (Ref. 87384) â HÃbrido Argentina

Exchange Engineer â Senior (Ref. 87388) â HÃbrido Argentina