Lead Site Reliability Engineer
Job Summary
Powering the agentic revolution in travel. Sabre is an AI-native technology leader backed by one of the worlds largest travel data clouds. Built on an open modular cloud-native architecture Sabre serves as the backbone for both established leaders and bold new disruptors guiding them to the next age of travel retailing through intelligent connected and personalized experiences. With AI at its core and operating at unparalleled scale Sabre transforms insights into innovation empowering airlines hoteliers agencies and other partners to retail distribute and fulfill travel worldwide.
Job Title: Site Reliability Engineer (SRE) - Business Intelligence & Analytics Platforms
About the Role
The Site Reliability Engineer (SRE) for BI Platforms manages the reliability performance and modernization of a hybrid analytics environment. You will own the project-level Google Cloud Platform (GCP) infrastructure that hosts our established enterprise reporting platforms IBM Cognos and JasperReports operating within the architectural guardrails established by our central Cloud engineering team. Concurrently you will serve as the primary technical lead for our Looker instances and next-generation analytics initiatives.
This role requires a unique blend of Site Reliability Engineering (SRE) practices Java build engineering and BI modernization. You will serve as the technical escalation point for our existing BI team taking ownership of complex Terraform deployments continuous integration pipelines and custom Java/Maven configurations. Furthermore you will help drive the future of our platform by developing custom React-based dashboards and exploring cutting-edge AI capabilities. (Note: We prioritize strong SQL DevOps and Java build skills for this role and are fully prepared to train the right candidate on LookML development).
Key Responsibilities
Infrastructure as Code (IaC) & CI/CD Operations
- Provision and manage project-level GCP resources (e.g. Compute Engine instances load balancers project-specific IAM) using Terraform.
- Triage debug and refactor complex legacy IaC deployments within our dedicated GCP projects ensuring all infrastructure is securely versioned.
- Manage and optimize deployment pipelines across Dev Cert and Prod environments using GitHub Actions and internal enterprise deployment orchestrators.
- Navigate enterprise release management processes by maintaining integrations between our CI/CD pipelines and ServiceNow (SNOW) for automated change requests and approvals.
- Act as the primary technical liaison with the central GCP Infrastructure team to ensure adherence to organizational security and networking parameters.
Java Ecosystem & Application Tuning
- Manage customized JasperReports deployments including building upgrading and resolving dependencies for core and optional extension JAR artifacts using Maven.
- Perform JVM tuning dispatcher routing and OS-level optimizations for the GCP-hosted BI applications.
Looker Custom Dashboards & AI Innovation
- Build highly customized analytics dashboards and embedded data experiences utilizing the React framework and Lookers extension capabilities.
- Reverse-engineer complex Cognos Framework Manager models and Jaspersoft Domains translating their business logic into efficient scalable LookML.
- Remain highly flexible with automation utilizing the Looker API (Looker SDK) Python or Bash to solve unique operational challenges and automate administrative tasks.
- Stay highly motivated to learn evaluate and integrate emerging technologies into our ecosystem with a specific focus on conversational analytics and agentic AI solutions.
Observability & Incident Management
- Implement unified monitoring: Use Looker System Activity to track query queues and PDT health while using Google Cloud Operations Suite to monitor CPU/memory/network utilization.
- Be part of operational on-call duties as needed participating in incident response and leading blameless post-mortems for data platform outages.
- Experience: 3 years in Site Reliability Engineering Cloud Infrastructure Full-Stack or Data Operations.
- IaC & CI/CD Mastery: Strong hands-on experience using Terraform for GCP resource deployment within a governed multi-project environment. Extensive experience orchestrating pipelines with GitHub Actions and enterprise deployment managers alongside ITSM tools like ServiceNow (SNOW).
- Java Build Engineering: Hands-on experience managing Java builds using Maven handling dependency conflicts and compiling JAR artifacts for enterprise applications.
- Custom UI/Front-End: Hands-on experience developing with the React framework to build custom interfaces dashboards or web applications.
- Database & SQL: Expert-level SQL skills with a deep understanding of analytical databases particularly Google BigQuery.
- Enterprise Application Tuning: Proven experience deploying tuning (JVM/Tomcat) and managing the architecture of enterprise Java applications (experience with IBM Cognos Analytics and JasperReports is highly preferred).
- Scripting & Adaptability: Excellent programming skills in Python or Bash with a demonstrated flexibility to automate ad-hoc operational tasks.
Preferred Qualifications
- Strong intrinsic motivation to research and prototype conversational analytics LLMs and agentic AI workflows.
- Prior experience writing LookML or administering Looker-hosted instances (Candidates lacking this will receive on-the-job training provided they possess expert-level SQL and engineering skills).
- Direct experience leading a migration from legacy reporting platforms to modern BI tools.
- Google Cloud Professional Cloud Architect Professional Cloud DevOps Engineer or Data Engineer certification.
We will give careful consideration to your application and review your details against the position criteria. You will receive separate notification as your application progresses.
Please note that only candidates who meet the minimum criteria for the role will proceed in the selection process.
#LI-Hybrid#LI-GS1Required Experience:
IC
Key Skills
About Company
Sabre Corporation is a travel technology company based in Southlake, Texas. It is the largest Global Distribution Systems provider for air bookings in North America. American Airlines founded the company in 1960, and it was spun off in 2000.