Site Reliability Engineer (SRE) Sofia Bulgaria
Our client is seeking an experienced Site Reliability Engineer (SRE) to join their technology team in Sofia. The role focuses on ensuring the reliability scalability and performance of modern cloud-based and on-premises systems with a strong emphasis on AWS automation and infrastructure as code (IaC).
This position blends software engineering and systems engineering to drive resilience efficiency and high availability across production platforms.
Key Responsibilities
Design build and maintain reliable scalable and performant systems across AWS and on-premises environments (cloud-first approach).
Implement monitoring alerting and observability tools to ensure visibility into system health and performance.
Automate deployments configuration management and operational tasks to increase efficiency and reduce manual effort.
Participate in incident response and postmortems reducing MTTR and strengthening reliability practices.
Collaborate with developers to embed reliability and scalability into the software development lifecycle.
Oversee capacity planning performance tuning and AWS cost optimization.
Ensure compliance with security regulatory and audit requirements.
Requirements
5 years in Site Reliability Engineering DevOps or related roles.
Strong Linux systems administration background.
Proficiency in at least one scripting/programming language (Python Go Bash etc.).
Deep expertise with AWS services (EC2 ECS/EKS RDS S3 IAM networking).
Proven experience with Terraform and configuration management tools (Puppet Chef Ansible).
Strong knowledge of CI/CD pipelines (Jenkins GitLab or similar).
Hands-on experience with monitoring and observability tools (Prometheus Grafana ELK Datadog etc.).
Solid understanding of networking load balancing and DNS.
Excellent troubleshooting and problem-solving skills especially during high-pressure incidents.
Preferred Skills
Experience with Kubernetes or container orchestration systems.
Familiarity with SLOs SLIs and error budgeting.
Previous work in financial systems or other mission-critical environments.
Working Hours
Benefits
Competitive salary plus uncapped quarterly performance bonus
Hybrid work model (3 days office 2 days remote)
Additional health insurance
Food vouchers and fresh fruit in the office
Sports card fitness center and game room on-site
Company-sponsored sports and team events
Budget for professional development (courses certifications conferences)
Exclusive employee discounts and perks
How to Apply
Send your CV in English. All applications will be treated with strict confidentiality. Only shortlisted candidates will be contacted.
InterContinental Recruiting Ltd.
Recruitment License No. 2087/22.07.2016
InterContinental Recruiting
Please contact us with any questions:
Email:
Phone: (w)
Recruitment license from National Agency of Employment No 2087/22.07.2016
Site Reliability Engineer (SRE) Sofia BulgariaOur client is seeking an experienced Site Reliability Engineer (SRE) to join their technology team in Sofia. The role focuses on ensuring the reliability scalability and performance of modern cloud-based and on-premises systems with a strong emphasis on ...
Site Reliability Engineer (SRE) Sofia Bulgaria
Our client is seeking an experienced Site Reliability Engineer (SRE) to join their technology team in Sofia. The role focuses on ensuring the reliability scalability and performance of modern cloud-based and on-premises systems with a strong emphasis on AWS automation and infrastructure as code (IaC).
This position blends software engineering and systems engineering to drive resilience efficiency and high availability across production platforms.
Key Responsibilities
Design build and maintain reliable scalable and performant systems across AWS and on-premises environments (cloud-first approach).
Implement monitoring alerting and observability tools to ensure visibility into system health and performance.
Automate deployments configuration management and operational tasks to increase efficiency and reduce manual effort.
Participate in incident response and postmortems reducing MTTR and strengthening reliability practices.
Collaborate with developers to embed reliability and scalability into the software development lifecycle.
Oversee capacity planning performance tuning and AWS cost optimization.
Ensure compliance with security regulatory and audit requirements.
Requirements
5 years in Site Reliability Engineering DevOps or related roles.
Strong Linux systems administration background.
Proficiency in at least one scripting/programming language (Python Go Bash etc.).
Deep expertise with AWS services (EC2 ECS/EKS RDS S3 IAM networking).
Proven experience with Terraform and configuration management tools (Puppet Chef Ansible).
Strong knowledge of CI/CD pipelines (Jenkins GitLab or similar).
Hands-on experience with monitoring and observability tools (Prometheus Grafana ELK Datadog etc.).
Solid understanding of networking load balancing and DNS.
Excellent troubleshooting and problem-solving skills especially during high-pressure incidents.
Preferred Skills
Experience with Kubernetes or container orchestration systems.
Familiarity with SLOs SLIs and error budgeting.
Previous work in financial systems or other mission-critical environments.
Working Hours
Benefits
Competitive salary plus uncapped quarterly performance bonus
Hybrid work model (3 days office 2 days remote)
Additional health insurance
Food vouchers and fresh fruit in the office
Sports card fitness center and game room on-site
Company-sponsored sports and team events
Budget for professional development (courses certifications conferences)
Exclusive employee discounts and perks
How to Apply
Send your CV in English. All applications will be treated with strict confidentiality. Only shortlisted candidates will be contacted.
InterContinental Recruiting Ltd.
Recruitment License No. 2087/22.07.2016
InterContinental Recruiting
Please contact us with any questions:
Email:
Phone: (w)
Recruitment license from National Agency of Employment No 2087/22.07.2016
View more
View less