We are looking for a skilled Site Reliability Engineer (SRE) to join our engineering team. As an SRE you will be responsible for ensuring the reliability scalability and performance of our systems and infrastructure. You will work closely with developers and DevOps teams to build and maintain tools that automate operations and ensure 24/7 ResponsibilitiesDesign build and maintain scalable resilient and secure infrastructure in cloud or hybrid environments (e.g. AWS GCP Azure).Monitor system performance and availability using tools like Prometheus Grafana Datadog or infrastructure as code (IaC) using tools like Terraform Ansible or and maintain CI CD pipelines to enable reliable and fast in on call rotation incident response and root cause system observability alerting and with software engineers to influence system design for reliability and repetitive tasks to improve operational SRE best practices across the organization.
Required Experience:
IC