Senior Site Reliability Engineer
Job Summary
Blip is a leading tech company focused on software engineering solutions for sports operate at scale. As part of Flutter Entertainment we play an essential role in the Groups goal of becoming the global leader in online sports betting and iGaming developing innovative products and platforms for over 14 million monthly customers worldwide. We are serious about Tech. We are problem-solvers with big ambitions keeping a people-first mindset at the core of our work. We prioritize flexibility as we strive to deliver the best technological products and tackle the greatest industry challenges. Recognizing that everyone brings their own strengths backgrounds and new perspectives we empower you to be yourself. That uniqueness shapes the culture of belonging we are so proud of.
The Role
We are seeking a motivated and experienced senior engineer to join our dynamic organisation. As a Senior Site Reliability Engineer you will be responsible for overseeing a group of employees providing direction and support to ensure goals are met and operations run smoothly. If you have a strong background in team management and are ready to take on a new challenge we want to hear from you. Come be a part of our team and make a positive impact on our organisations success.
What Youll Be Doing
- Engage in and improve the full lifecycle of servicesfrom design and deployment to operation and continuous refinement.
- Actively participate in production incident root cause analysis identification and resolution when needed.
- Support services before go-live through system design consulting platform/framework development capacity planning and launch reviews.
- Maintain live services by monitoring availability latency and overall system health.
- Contribute to performance and capacity testing initiatives.
- Optimize reliability through effective monitoring and alerting strategies.
- Scale systems sustainably through automation and drive improvements that enhance reliability and delivery velocity.
- Perform iterative audits of performance and reliability vulnerabilities.
- Define and continuously refine Service Level Indicators (SLIs).
- Practice sustainable incident response and conduct blameless postmortems.
What Youll Bring
- Deep familiarity building and troubleshooting release and build pipelines (ex Jenkins buildkite GitHub actions)
- Experience implementing creative approach in monitoring distributed systems while leveraging industry best practices (ex instrumenting tagging taxonomy across disparate systems)
- Experience building managing and deploying an application utilizing containerized microservices in a distributed infrastructure (ex AWS GCP self hosted cloud)
- Experience leveraging new technologies when it best serves a business need
- Comprehensive understanding of incident management best practices
- Opinionated and knowledgable approach for implementing industry best practices
- Demonstrated experience developing teams encouraging growth serving as a technical mentor and leader
- Shows strength and comprehension in at least one programming languages (ex. Java Python Scala Kotlin)
- Experience making large directional technical decisions (ex. Deciding which technology or pattern to create or leverage)
- Experience being on-call for a service and familiarity with incident notification tooling (ex. Pagerduty Opsgenie)
- Comprehensive understanding of SRE principles (ex. Working knowledge of the Google SRE book)
- Demonstrated strength in leading a project in a agile/scrum environment
Thrives in a diverse work environment
Wed Like Tou To Master In
- Experience managing complex telemetry solutions which directly contributed to overall reliability
- Design greenfield solutions leveraging Configuration - Management/Infrastructure as Code tools (ex. Chef puppet Terraform)
- Create automated tooling that contributed to multiple teams velocity
- Demonstrated experience with project management best practices
- Shows the ability to break down large technical concepts into effective communication with stakeholders from across the organization
- Extensive knowledge of networking best practices tools and observability
- Experiencing developing and deploying automated service configuration at the edge (ex. CDN configuration certificate renewal)
- Work consulting with a team being able to advise on their technology workflows dev tooling monitoring alerting best practices
- Identified need for and lead development of automation that significantly reduced toil (ex Deployment pipelines distributed dev environments)
- Built and maintained a system and culture that supported and implemented SLOs
- Has shown to be a thought leader contributing to the broader industry conversation about SRE principals and topics (ex. Speaking at conferences)
This is what you should have. What do we have you ask can check ouramazing perks & benefitsrighthere!
So ... are you in
Equal opportunities
At Blip we are committed to creating a diverse and inclusive workplace. We strongly encourage people from all backgroundsways of thinking and working to apply.
We are committed to including everyoneregardless of their race disability age gender identity sexual orientation and religion.
Everyone brings different perspectives and experiences; you dont have to meet all the requirements listed to apply for this role.If you need any adjustments to apply for the position and to ensure this role aligns with your needs please send an email to.
We will only respond to inquiries related to disabilities.
Required Experience:
Senior IC
About Company
INTERNATIONAL TEAM. AWESOME TECHNOLOGY. SOFTWARE ENGINEERING AT ITS FINEST. Based in Porto, Blip is a software engineering company with a difference. Founded in 2009, we already have 320 Blippers. And we’re still growing. We’re in the API Billionaire’s Club alongside Twitter, Faceboo ... View more