Site Reliability Engineer

Sperton Global AS

Not Interested
Bookmark
Report This Job

profile Job Location:

Dublin - Ireland

profile Monthly Salary: Not Disclosed
Posted on: 15 hours ago
Vacancies: 1 Vacancy

Job Summary

Title: Site Reliability Engineer
Location: Dublin Ireland (Hybrid)
Job Type: Permanent

Role Overview:

We are hiring a Site Reliability Engineer for one of our this role you will act as the production readiness steward for versatile Gateway products and integration with other platforms. You will partner with development teams to design implement and support services with a focus on operational resilience automation and compliance.

What Youll Do:

Site Reliability Engineering:

Serve as the primary contact responsible for ensuring application scalability performance and resilience.

Practice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover.

Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability.

DevOps/Automation:

Tackle complex development automation and business process problems. Engage in and improve the whole lifecycle of servicesfrom inception and design through deployment operation and refinement.

Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating and lead Client in DevOps automation and best practices.

Increase automation and tooling to reduce toil and manual intervention

ITSM Practices:

BS degree in Computer Science or related technical field involving coding (e.g. physics or mathematics) or equivalent practical experience.

Coding and/ or scripting exposure.

Appetite for change and pushing the boundaries of what can be done with automation. Be curious about new technology infrastructure and practices to scale our architecture and prepare for future growth.

Experience with algorithms data structures scripting pipeline management and software design

Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.

Interest in designing analysing and troubleshooting large-scale distributed systems.

Willingness and ability to learn and take on challenging opportunities and to work as a member of matrix based diverse and geographically distributed project team.

Ability to balance doing things right with fixing things quickly. Flexible and pragmatic while working towards improving the long-term health of the system.

Comfortable collaborating with cross-functional teams to ensure that expected system behavior is understood and monitoring exists to detect anomalies.

Requirements:

3-5 years of experience working with Apache Kafka in a production environment.

Strong knowledge of Kafka architecture including brokers topics partitions and replicas. (Kafka Knowledge is MUST)

Experience with Kafka security including SSL SASL and ACLs.

Proficiency in configuring deploying and managing Kafka clusters in cloud and on-premises environments.

Experience with Kafka stream processing using tools like Kafka Streams KSQL or Apache Flink.

Solid understanding of distributed systems data streaming and messaging patterns.

Proficiency in Java Scala or Python for Kafka-related development tasks.

Familiarity with DevOps practices including CI/CD pipelines monitoring and logging.

Experience with tools like Zookeeper Schema Registry and Kafka Connect.

Strong problem-solving skills and the ability to troubleshoot complex issues in a distributed environment.

Excellent communication and collaboration skills to work effectively with cross-functional teams and stakeholders.

About Sperton:

This Position is Sponsored by Sperton Global a recruitment and consulting company with an international reach. We are committed to helping our clients achieve success in their hiring processes finding the right people for the right positions.

Title: Site Reliability EngineerLocation: Dublin Ireland (Hybrid)Job Type: PermanentRole Overview:We are hiring a Site Reliability Engineer for one of our this role you will act as the production readiness steward for versatile Gateway products and integration with other platforms. You will partner...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting