drjobs Staff Engineer, Production Operations

Staff Engineer, Production Operations

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bengaluru - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Greenlight is the leading family fintech company on a mission to help parents raise financially smart kids. We proudly serve more than 6 million parents and kids with our award-winning banking app for families. With Greenlight parents can automate allowance manage chores set flexible spend controls and invest for their familys future. Kids and teens learn to earn save spend wisely and invest.

At Greenlight we believe every child should have the opportunity to become financially healthy and happy. Its no small task and thats why we leap out of bed every morning to come to work. Because creating a better brighter future for the next generation depends on it.

Greenlight is looking for a Staff Engineer Production Operations to join our growing team!

As a Staff Engineer you will be a technical leader and individual contributor within our production operations function. You will be responsible for designing building and maintaining highly reliable scalable and performant cloud infrastructure and systems. You will play a critical role in driving technical excellence mentoring junior engineers and solving our most complex scalability and reliability challenges.

What you will be doing:

    • Lead the design implementation and evolution of Greenlights core cloud infrastructure and SRE practices to ensure high availability scalability and performance.
    • Act as a technical authority for complex SRE and cloud engineering challenges providing expert guidance and solutions.
    • Drive significant architectural improvements to enhance system reliability resilience and operational efficiency.
    • Develop maintain and optimize our cloud infrastructure using Infrastructure as Code (primarily Terraform) and automation tools.
    • Collaborate closely with development and security teams to embed SRE principles into the software development lifecycle promoting secure and reliable coding practices.
    • Design and implement robust monitoring logging and alerting solutions to provide comprehensive visibility into system health.
    • Participate in and lead incident response performing deep dive root cause analysis and driving actionable blameless postmortems to prevent recurrence.
    • Mentor and provide technical guidance to other SRE and Cloud Engineers contributing to their growth and the teams overall technical capabilities.
    • Research evaluate and advocate for new technologies and tools that can improve our operational posture and efficiency.
    • Contribute to the strategic planning and roadmap development for the SRE and Cloud Engineering functions.
    • Enhance existing services and applications to increase availability reliability and scalability in a microservices environment.
    • Build and improve engineering tooling process and standards to enable faster more consistent more reliable and highly repeatable application delivery.

What you should bring:

    • Technical Leadership: Lead complex technical projects and mentor engineers.
    • Communication: Articulate complex technical concepts clearly.
    • SRE Expertise: Apply SRE principles (SLIs SLOs error budgets) in production.
    • Distributed Systems: Understand and troubleshoot complex issues in distributed systems.
    • Monitoring & Alerting: Design and optimize monitoring logging and alerting systems (e.g. Datadog Prometheus).
    • Cloud Mastery (AWS): Expert-level knowledge of AWS services (e.g. EC2 S3 EKS).
    • Infrastructure as Code (Terraform): Master IaC for cloud infrastructure management.
    • Containerization: Strong experience with Docker and Kubernetes in production.
    • Automation: Bias for automation and building self-healing systems.
    • Problem Solving: Exceptional analytical and problem-solving skills proactively identifying bottlenecks.

Technologies we use:




Required Experience:

Staff IC

Employment Type

Full-Time

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.