Site Reliability Engineer, UC Operations

8x8

Not Interested
Bookmark
Report This Job

profile Job Location:

Manila - Philippines

profile Monthly Salary: Not Disclosed
Posted on: 5 hours ago
Vacancies: 1 Vacancy

Job Summary

8x8 connects our customers and teams globally empowering CX leaders with performance and insights to make smarter decisions delight customers and drive lasting business impact.

About 8x8 UC Operations

The UC Operations team manages the production infrastructure behind 8x8s Unified Communications platform voice fax messaging and collaboration services used by enterprise customers globally. The team oversees dozens of applications running across more than two thousand service instances worldwide spanning VoIP infrastructure messaging brokers storage systems and cloud workloads across Oracle Cloud Infrastructure and physical datacenters.

UC Ops sits at the operational center of 8x8 taking escalations from the NOC coordinating with Engineering and working alongside Support Sales and Professional Services. The work is complex the systems are live and the stakes are real. We are actively moving from reactive operations to a proactive automation-first SRE model and we are looking for engineers who want to help build that not just maintain the status quo.

What Youll Do

  • Production Operations & Incident Response

  • Own platform reliability across global UC infrastructure driving incident response rather than just resolving in isolation.

  • Triage and resolve complex issues service restarts hung processes infrastructure failures and act as an escalation for the NOC when frontline teams hit their limit.

  • Execute the unglamorous but essential work: scheduled maintenance certificate renewals log rotation the stuff that prevents failure before it happens.

  • Lead blameless post-mortems that produce real follow-through not action items that disappear into a backlog.

Cross-Team Collaboration

  • Work directly with Support Sales Sales Engineering NOC Professional Services and Engineering teams across 8x8 this team sits at the operational center of the company.

  • Translate production events into clear business-readable communication under pressure; stakeholders across the org depend on your judgment during incidents.

  • Feed operational insight back into engineering turning recurring failures and patterns into actionable bug reports and platform improvements.

  • Reliability Engineering & Automation

  • Identify recurring manual work and build automation to eliminate it we treat toil as a bug not a requirement.

  • Participate in 2-week sprint cycles to deliver automation tooling improvements runbook development and infrastructure initiatives from a structured backlog.

  • Address security issues as they arise CVEs misconfigurations access control gaps treated as first-class work alongside incident response.

  • Define and track SLIs SLOs and SLAs to drive honest data-driven conversations about where reliability investment is needed.

  • Build and maintain dashboards (Grafana OCI Log Analytics) that give the team genuine signal; tune alerting to eliminate noise a high-noise on-call is itself a reliability failure.

  • Leverage AI-powered tooling to accelerate diagnostics and reduce cognitive load at scale.

On-Call & Coverage

  • Shared on-call rotation approximately 1 week per month same expectation for every engineer on the team.

  • Escalation is always an option and is encouraged; you are expected to drive the response and know when to pull others in not to hero it alone.

  • Tooling: PagerDuty for alerting Jira for tracking OCI Log Analytics and Grafana for diagnostics.

What Were Looking For

Required

  • 3 years in a site reliability platform operations or infrastructure engineering role you have run production systems and know what that actually means.

  • Solid Linux systems administration: multi-service distributed systems log reading systemctl network diagnostics no GUI required.

  • Hands-on experience with at least one major cloud provider (OCI AWS GCP or Azure) compute storage IAM networking fundamentals.

  • On-call experience: calm under pressure fast triage clear communication during an incident.

  • Scripting in Python or Bash enough to automate a task parse logs or hit an API independently.

  • Strong incident response discipline: structured thinking stakeholder communication post-mortems that actually say something.

  • Familiarity with SRE concepts: SLIs SLOs error budgets toil measurement.

  • AI-forward mindset you use AI tools as a core part of how you work not as a novelty.

  • Preferred

  • Experience with Oracle Cloud Infrastructure (OCI) compute networking Log Analytics Object Storage.

  • Familiarity with VoIP and SIP infrastructure registration trunking call signaling; this is a UC platform and that knowledge matters.

  • Knowledge of observability tooling: Prometheus Grafana PagerDuty OCI Log Analytics.

  • Experience with Ansible for configuration management and deployment automation.

  • Exposure to infrastructure migrations at scale in multi-tenant SaaS environments

What We Offer

  • Dedicated onboarding and shadow period before solo on-call responsibilities.

  • Direct exposure to global-scale production infrastructure serving global enterprise customers

  • A team culture that values operational discipline blameless post-mortems and investing in automation over accepting toil.

Work Arrangement: Hybrid (On-site Tuesdays and Wednesdays)

Office Location: BGC Taguig

Shift: US Business Hours

#LI-Hybrid

#LI-MM1

8x8 is proud to provide equal employment opportunities (EEO) to all employees and applicants for employment without regard to race color religion sex national origin age disability or genetics.

For 8x8 jobs located in the US: 8x8 participates in the E-Verify program.

View the Participant Poster in English and Español.

View the Right to Work Poster in English and Español.

We also provide reasonable accommodation to individuals with disabilities in accordance with applicable laws. Learn more or email us at (Include Reasonable Accommodation in the subject line)

Our Job Applicant Privacy Notice can be found here.

Learn more on our company website at our pages on LinkedIn Twitter Facebook and Instagram.


Required Experience:

IC

8x8 connects our customers and teams globally empowering CX leaders with performance and insights to make smarter decisions delight customers and drive lasting business impact.About 8x8 UC OperationsThe UC Operations team manages the production infrastructure behind 8x8s Unified Communications platf...
View more view more

About Company

Company Logo

The 8x8 unified platform for contact center, business phone, video, chat, and APIs helps companies of any size deliver differentiated customer experiences.

View Profile View Profile