Site-Reliability Engineer (W2 Role) Need Locals to AZ for In-person Interview

Saransh Inc

Job Location:

Scottsdale, AZ - USA

Monthly Salary: Not Disclosed

Posted on: 7 hours ago

Vacancies: 1 Vacancy

Job Summary

Job title: Site-Reliability Engineer

Location: Scottsdale AZ (Onsite from Day 1)

Job Type: Contract (W2)

NOTE: *Local candidates only as the client interview is In-person*

*In-Person Interview can also either be at the client location in Richardson TX*

Interview process:

- 1 level of internal evaluation

- 3 Levels of Client Interviews (2 Telephonic and 1 In person). Last round in person interview can either be in Richardson Texas or Scottsdale Arizona.

Skill Matrix:

Name	Required
Python	Yes
Helm	Yes
Google Cloud Platform	Yes
Kubernetes	Yes

Experience required: 7 years

Mandatory skills:

Google Cloud Platform (GCP) Containerization Kubernetes

Infrastructure as Code (Terraform) CI/CD (GitHub Actions) and Helm

Automation and scripting using Python Ansible and

Monitoring and observability with Prometheus and Grafana

Linux systems and troubleshooting

Required Skills:

Service reliability/operation experience running large-scale high-performance applications in a hybrid environment (on-prem and cloud).

Experience in writing automation scripts and building dashboards for Application Performance management to manage Transaction journeys.

Experience working with Programming languages such as Go Python Java Rust etc.

Working knowledge on with one or more databases- Oracle SQL Server Redis Clickhouse postgres Mongo or any time-series databases

Experience in transitioning platforms to the cloud and Containerization GCPand Rancher

Experience maintaining containerized app in GKE/RKE/AKE environments.

Experience Implementing Cloud observability using OTEL to enable real-time monitoring distributed tracing and incident resolution.

Experience working with specific GraphQL Framework (Apollo Prisma Hasura etc...).

Experience using knowledge of networking protocols such as TCP/IP HTTP DNS Load balancing and service mesh to troubleshoot issues in high pressure situations.

Preferred Skills:

Proven experience managing Application availability building creative solutions to manage repetitive activities improving gating and detect for applications at every touchpoint for a 24 x 7 High availability platform exposed to critical clients and customers.

Working knowledge of Monitoring tools - Splunk App-dynamics grafana/Prometheus and Dynatrace.

Experience with tools like Rally Confluence and other CI/CD extenders.

Hands-on experience with implementing in-memory caching solutions. Experience on Redis DB is a plus.

Excellent debugging skills across variety of integrated technical platforms on API gateway.

Hands-on with GCS Cloud SQL Spanner and Firestore.

Extensive experience in Enterprise level Infrastructure and Operations.

Experience in High Availability and distributed systems Linux and Windows administration troubleshooting and support.

Monitor and troubleshoot HashiCorp Vault environments ensuring minimal downtime and rapid recovery from incidents.

Working knowledge on Vertex AI Gen AI and Bigquery

Job title: Site-Reliability Engineer Location: Scottsdale AZ (Onsite from Day 1) Job Type: Contract (W2) NOTE: *Local candidates only as the client interview is In-person* *In-Person Interview can also either be at the client location in Richardson TX* Interview process: - 1 level of intern...