Staff Site Reliability Engineer Cloud
Job Summary
Elevate Global Operations as our Next Cloud Site Reliability Engineer (Observability Expert)!
Are you ready to lead an OTel-first strategy and redefine reliability for a global industrial technology leader Trimble is looking for a visionary Cloud Site Reliability Engineer to manage our massive-scale observability platform ensuring our digital and physical solutions remain performant and resilient. This is your chance to use cutting-edge automation and OpenTelemetry to make a tangible impact on the worlds most critical industries.
About Us:
Trimble is an industrial technology company transforming the way the world works by delivering solutions that enable our customers to thrive. We create technologies that connect the digital and physical worlds helping our customers increase productivity quality safety and sustainability. From purpose-built products to enterprise-level solutions our technology empowers professionals in construction geospatial government transportation and more.
T&L: In the Transportation & Logistics segment our solutions make it safer simpler and more efficient to move freightbringing together a global network of shippers carriers brokers and 3PLs.
What Makes This Role Great:
In this role you will be the primary architect of our Observability Centre of Excellence directly influencing the reliability and uptime of global platforms that keep world industries moving.
Key Exciting Responsibilities:
Lead a global OTel First strategy implementing OpenTelemetry at scale across a diverse technological landscape.
Spearhead the development of automation scripts and Infrastructure as Code using Terraform to ensure seamless reproducible platform delivery.
Optimize platform performance and cost-efficiency ensuring our observability tools scale economically as our data grows.
Collaborate with engineering teams to embed reliability and security standards into new features from the ground up.
Drive root cause analysis and problem management to proactively prevent incidents and improve the customer experience.
Essential Skills & Experience:
Hands-on experience with the OpenTelemetry Collector APIs and SDKs.
Extensive experience with observability tools like NewRelic Datadog or Splunk.
Strong proficiency in Infrastructure as Code (Terraform Ansible) and cloud platforms (AWS GCP or Azure).
Deep understanding of containerization and orchestration using Docker and Kubernetes.
Advanced coding skills in Python Go or Java for building robust automation and monitoring tools.
Bonus Points For:
Experience leveraging AI coding assistants like GitHub Co-Pilot to accelerate development.
How to Apply: Please submit an online application for this position by clicking on the Apply Now button located in this posting.
Join a Values-Driven Team: Belong Grow Innovate.
At Trimble our core values of Belong Grow and Innovate arent just wordstheyre the foundation of our culture. We foster an environment where you are seen heard and valued (Belong); where you have an opportunity to build a career and drive our collective growth (Grow); and where your innovative ideas shape the future (Innovate). We believe in empowering local teams to create impactful strategies ensuring our global vision resonates with every individual. Become part of a team where your contributions truly matter.
If you need assistance or would like to request an accommodation in connection with the application process please contact
Required Experience:
Staff IC
Key Skills
About Company
Trimble is transforming the way the world works by delivering products and services that connect the physical and digital worlds. Core technologies in positioning, modeling, connectivity and data analytics enable customers to improve productivity, quality, safety, and sustainability. ... View more