Senior Site Reliability Engineer (Cloud Platform)

Iterable

Not Interested
Bookmark
Report This Job

profile Job Location:

Lisbon - Portugal

profile Monthly Salary: Not Disclosed
Posted on: 4 days ago
Vacancies: 1 Vacancy

Job Summary

Iterable is the leading AI-powered customer engagement platform that helps leading brands like Redfin SeatGeek Priceline Calm and Box create dynamic individualized experiences at scale. Our platform empowers organizations to activate customer data design seamless cross-channel interactions and optimize engagementall with enterprise-grade security and compliance. Today nearly 1200 brands across 50 countries rely on Iterable to drive growth deepen customer relationships and deliver joyful customer experiences.

Our success is powered by extraordinary people who bring our core valuesTrust Growth Mindset Balance and Humilityto life. We foster a culture of innovation collaboration and inclusion where ideas are valued and individuals are empowered to do their best work. Thats why weve been recognized as one of Incs Best Workplaces and Fastest Growing Companies and were recognized on Forbes list of Americas Best Startup Employers in 2022. Notably Iterable has also been listed on Wealthfronts Career Launching Companies List and has held a top 10 ranking on the Top 25 Companies Where Women Want to Work.

With a global presenceincluding offices in San Francisco New York Denver London and Lisbon plus remote employees worldwidewe are committed to building a diverse and inclusive workplace. We welcome candidates from all backgrounds and encourage you to apply. Learn more about our story and mission on our Culture and About Us pages. Lets shape the future of customer engagement together!

How you will make an impact:

As a Senior Engineer on the Cloud Platform team your impact will be measured by the continuous improvement of our platforms reliability scalability and security posture.

  • SLO Ownership & Error Budget Management: Take direct ownership of the established Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for core platform services (e.g. latency availability error rate). You will manage and use the Error Budget as the primary drivers to prioritize reliability work
  • Scale and HArden the Core Platform: Apply deep technical expertise in Kubernetes AWS traffic management and Infrastructure-as-Code to scale and harden the foundational platform that powers Iterables product workloads.
  • Drive Systemic Improvements: This role centers on hands-on engineering skill technical leadership and systemic reliability improvements within our complex distributed multi-region platform.

What youll do

  • Kubernetes Platform Engineering
    Use your Kubernetes and AWS expertise to evolve EKS lifecycle multi-tenant isolation and regional consistency ensuring clusters remain secure performant and predictable as we scale.
  • Traffic & Ingress Reliability
    Apply advanced knowledge of cloud-native traffic management and API gateways to strengthen routing authentication rate-limiting and secure communication protocols (like mTLS). This focus will dramatically improve both the reliability and security posture of the platforms public and internal service access points.
  • Infrastructure-as-Code at Scale
    Demonstrate mastery in IaC to manage complex multi-region architecture. Use tools like Terraform Cloud to build reusable modules validate changes through policy-as-code and establish safe multi-account patterns our teams can rely on.
  • Security & Access Control
    Drive a zero-trust posture by establishing service guardrails and access controls across the platform: This includes: implementing policy-as-code solutionsbrokering least-privilege access for platform using cloud Identity and Access Management (IAM) best practices and Integrating and managing identity providers to define Role-Based Access Control (RBAC) across environments.
  • Reliability Engineering & Incident Leadership
    Demonstrate strong diagnostic and incident-response leadership to rapidly isolate issues across clusters networks and workloads. Your ultimate responsibility will be to lead and drive systemic long-term fixes and root-cause investigations ensuring all necessary actions are taken to eliminate repeat failures .
  • Collaboration Influence & Mentorship
    Guide and influence engineering teams across the organization through design reviews operational best practices and reliability-focused decision-making.

Required Core Competencies & Proficiencies

  • Core Platform & Infrastructure Expertise
    Demonstrate deep skill in managing complex distributed environments at scale specifically focusing on:
    • Cloud-Native Orchestration: Expertise in Kubernetes.
    • Infrastructure Automation: Master of Infrastructure-as-code (IaC) including Terraform.
    • Advanced Networking & Connectivity: understanding of core networking fundamentals including routing DNS network segmentation () and connectivity services (e.g. transit gateways and network endpoints)
    • Platform Systems: Deep competence in traffic/ingress systems and strong programming fundamentals in Go or Python
  • Security & Reliability Skills
    Fluency with IAM/IRSA Vault mTLS and least-privilege design combined with a proven ability to deliver measurable reliability improvements through automation guardrails and smart engineering.
  • Leadership & Communication
    Demonstrate a strong operational mindset excellent technical communication (both written and verbal) and the ability to influence designs mentor others and elevate platform engineering practices across teams.
  • Experience and Proficiency
    Demonstrate advanced proficiency and technical leadership in managing large-scale resilient production systems. This experience is typically gained through roles such as:
    • Site Reliability Engineer (SRE)
    • Cloud Platform Engineer
    • DevOps Engineer
    • Other closely related infrastructure roles

Perks & Benefits:

  • Competitive salaries & meaningful equity
  • Private Medical Insurance
  • Life/Risk Assurance
  • Meal Allowance: 8.55 per day
  • Community Days (additional paid holidays)
  • Paid Annual Leave (22 days)
  • Paid Sabbatical (after 4 years tenure)
  • Initial laptop workstation setup
  • Teleworking Allowance

Recruitment Disclaimer:

Please be aware that Iterable Inc. (Iterable) and our official professional recruiting agencies and platforms do not:

  • Send job offers from free email services like Gmail Yahoo mail Hotmail etc.
  • Request money fees or payment of any kind from prospective candidates to apply to Iterable for employment or for the recruitment process (e.g. for home office supplies or training etc.).
  • Request or require personal documents like bank account details tax forms or credit card information as part of the recruitment process prior to the candidate signing an engagement letter or an employment contract with Iterable.

You may see all job vacancies on our official Iterable channels:

  • Official Iterable website Careers page: LinkedIn Jobs page: is not affiliated in any way to these impostors and we hereby confirm that such individuals/entities are not authorized encouraged or sponsored to act on behalf of Iterable. Such job opportunities are entirely fake and not valid. Therefore please disregard any written or oral request for a job offer or an interview that you believe is or might be fraudulent or suspicious and immediately reach out to us via email at upon receiving a suspicious job offer.

    Criminal and/or civil liabilities may arise from such actions and Iterable expressly reserves the right to take legal action including criminal action against such individuals/entities whenever such phenomena any case please note that under no circumstances shall Iterable and any of its affiliates be held liable or responsible for any claims losses damages expenses or other inconvenience resulting from or in any way connected to the actions of these impostors.

    Iterable is an Equal Employment Opportunity employer that proudly pursues and hires a diverse workforce. Iterable does not make hiring or employment decisions on the basis of race color religion or religious belief ethnic or national origin nationality sex gender gender-identity sexual orientation disability age military or veteran status or any other basis protected by applicable local state or federal laws or prohibited by Company policy. Iterable also strives for a healthy and safe workplace and strictly prohibits harassment of any kind. Pursuant to the San Francisco Fair Chance Ordinance and other similar state laws and local ordinances and its internal policy Iterable will also consider for employment qualified applicants with arrest and conviction records.


Required Experience:

Senior IC

Iterable is the leading AI-powered customer engagement platform that helps leading brands like Redfin SeatGeek Priceline Calm and Box create dynamic individualized experiences at scale. Our platform empowers organizations to activate customer data design seamless cross-channel interactions and optim...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

The cross channel marketing platform that powers unified customer experiences, and empowers you to create, optimize, and measure every customer interaction.

View Profile View Profile