Senior Site Reliability Engineer (fmd)

Kraken

Not Interested
Bookmark
Report This Job

profile Job Location:

Paris - France

profile Monthly Salary: Not Disclosed
Posted on: 17 days ago
Vacancies: 1 Vacancy

Job Summary

Help us use technology to make a big green dent in the universe!

Krakenpowers some of the most innovative global developments in energy.

Were a technology company focused on creating a smart sustainable energy system. From optimising renewable generation creating a more intelligent grid and enabling utilities to provide excellent customer experiences our operating system for energy is transforming the industry around the world in a way that benefits everyone.

Its a really exciting time in energy. Help us make a real impact on shaping a better more sustainable future.

Our Global Platform Engineering Reliability group is responsible for architecting developing and maintaining the resilient and scalable infrastructure that power and support our platforms.

As a Site Reliability Engineer within the newly created Product Reliability team youll be responsible for ensuring the availability performance and scalability of the products on our platform.

Your proficiency in supporting products that serve millions of customers will ensure stability and high performance for our brands and clients.

Youll keep up with best practices in building products for scale. Your communication skills and attention to detail will be indispensable as you pinpoint areas for enhancement ensure optimal product performance and continuously improve our reliability and efficiency.

What youll do

  • Teach and support product teams on best practices for reliability implementation patterns and effective usage of our existing platforms
  • Support product teams in improving the performance and availability of their systems
  • Be hands-on in code and infrastructure to help product teams with reliability improvements
  • Provide comprehensive feedback to the wider Platform group on improvements to be made to core infrastructure based on observations and first-hand experience in the code base
  • Support the build-out of proof-of-concept requirements in product teams as needed to evolve application deployment architecture to align with business growth as well as enhance scalability and system resilience
  • Collaborate with product teams to support the release of new features and services ensuring adherence to reliability and performance standards
  • Guide product teams in designing systems for resilience and graceful failure under heavy load
  • Assist application teams with post-incident tasks and follow-ups and contribute to the creation and review of post-mortem documentation
  • Analyse incident metrics to identify trends and potential improvements communicating these insights to the product teams
  • Help solve interesting and difficult problems. Theres a great opportunity for disruption in the global energy market
  • What youll have

  • Great communication skills working effectively with developers product managers and other business stakeholders to understand design and deliver impactful projects and reliability improvements
  • Proficient using AWS; we use a lot of different AWS services and not just the standard few
  • Strong Python skills; particularly with Django the Django ORM and Celery
  • Good expertise in multiple of the following areas:
  • PostgreSQL or a similar RDBMS particularly in Amazon RDS at scale
  • Docker and Kubernetes; we use Amazon EKS in production
  • Datadog or a similar logging/monitoring tool
  • Messaging queues event-driven async processing or similar technologies - we use RabbitMQ
  • Terraform or a similar infrastructure-as-code tool
  • Experience with a Linux distribution
  • Previous experience working in small highly-autonomous teams
  • What will help

  • Previous experience as a Site Reliability Engineer
  • Experience working on SaaS platforms including engaging product teams to ensure up-skilling and knowledge sharing across teams
  • Experience managing and supporting a large scale internet facing service
  • Experience in responding to incidents and outages writing technical incident reports and organising incident retrospectives
  • Experience working with very large relational databases
  • Experience in using service level objectives to improve application performance
  • A proactive innovative mindset
  • Kraken is a certified Great Place to Work in France Germany Spain Japan and the UK we are one of the Best Workplaces on Glassdoor with a score of 4.7. Check out our Welcome to the Jungle site (FR/EN) to learn more about our teams and culture.

    Are you ready for a career with us We want to ensure you have all the tools and environment you need to unleash your potential. If you have any specific accommodations or a unique preference please contact us at and well do what we can to customise your interview process for comfort and maximum magic!

    Studies have shown that some groups of people like women are less likely to apply to a role unless they meet 100% of the job requirements. Whoever you are if you like one of our jobs we encourage you to apply as you might just be the candidate we hire. Across Kraken were looking for genuinely decent people who are honest and empathetic. Our people are our strongest asset and the unique skills and perspectives people bring to the team are the driving force of our success. As an equal opportunity employer we do not discriminate on the basis of any protected attribute. We consider all applicants without regard to race colour religion national origin age sex gender identity or expression sexual orientation marital or veteran status disability or any other legally protected status. U.S. based candidates can learn more about their EEO rights here.

    Our (i) Applicant and Candidate Privacy Notice and Artificial Intelligence (AI) Notice (ii) Website Privacy Notice and (iii) Cookie Notice govern the collection and use of your personal data in connection with your application and use of our website. These policies explain how we handle your data and outline your rights under applicable laws including but not limited to the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA). Depending on your location you may have the right to access correct or delete your information object to processing or withdraw consent. By applying you acknowledge that youve read understood and consent to these terms


    Required Experience:

    Senior IC

    Help us use technology to make a big green dent in the universe!Krakenpowers some of the most innovative global developments in energy.Were a technology company focused on creating a smart sustainable energy system. From optimising renewable generation creating a more intelligent grid and enabling u...
    View more view more

    Key Skills

    • Kubernetes
    • FMEA
    • Continuous Improvement
    • Elasticsearch
    • Go
    • Root cause Analysis
    • Maximo
    • CMMS
    • Maintenance
    • Mechanical Engineering
    • Manufacturing
    • Troubleshooting