DevOps SRE

Sysmind LLC

Not Interested
Bookmark
Report This Job

profile Job Location:

Sunnyvale, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 4 hours ago
Vacancies: 1 Vacancy

Job Summary

Request ID: 52317-1

Start/End Dates: 2/28/2026 - 8/31/2026

Tax Work Location: US Default

Job Title: Information TechnologyUSA - USASystem Administrator

Job Description: **Please strictly adhere to the following resume naming convention:

ALL CAPS NO SPACES B/T UNDERSCORES

PTNUSGBAMSREQIDCandidateBeelineID

i.e. PTNUS9999999SKIPJOHNSON0413

Bill Rate: $80.00

GBaMS ReqID:

MSP Owner: Stefanie

Location: Sunnyvale CA (3x/ week onsite)

Duration: 6 months

DevOps SRE

We are looking for a DevOps SRE proficient in cloud technologies Kubernetes Docker and Python

Role Descriptions: Job SummaryWe are looking for a highly skilled Site Reliability Engineer (SRE) to design build and maintain reliable scalable and high-performance systems. The ideal candidate has strong hands-on experience with AWS cloud services Kubernetes Docker DevOps practices and Python and is passionate about automation system reliability and operational ResponsibilitiesDesign deploy and maintain scalable and highly available infrastructure on AWSManage containerized applications using Docker and orchestrate them with KubernetesBuild and maintain CICD pipelines to automate build test and deployment processesImplement infrastructure as code (IaC) using tools like Terraform or CloudFormationDevelop automation and internal tools using PythonMonitor system performance availability and reliability proactively identify and resolve issuesDefine and track SLIs SLOs and SLAsParticipate in on-call rotations incident response and post-incident reviewsCollaborate closely with development security and operations teamsImprove system resilience fault tolerance and disaster recovery strategiesEnsure best practices for security cost optimization and performance tuningRequired Skills QualificationsStrong experience with AWS services (EC2 EKS S3 RDS IAM VPC CloudWatch etc.)Hands-on expertise with Kubernetes (EKS preferred) and DockerSolid understanding of DevOps principles and practicesProficiency in Python for automation scripting and toolingExperience with CICD tools (Jenkins GitHub Actions GitLab CI ArgoCD etc.)Knowledge of LinuxUnix systemsExperience with monitoring and logging tools (Prometheus Grafana ELK Datadog etc.)Familiarity with networking concepts (DNS TCPIP load balancing)Strong troubleshooting and problem-solving skills Essential Skills: Job SummaryWe are looking for a highly skilled Site Reliability Engineer (SRE) to design build and maintain reliable scalable and high-performance systems. The ideal candidate has strong hands-on experience with AWS cloud services Kubernetes Docker DevOps practices and Python and is passionate about automation system reliability and operational ResponsibilitiesDesign deploy and maintain scalable and highly available infrastructure on AWSManage containerized applications using Docker and orchestrate them with KubernetesBuild and maintain CICD pipelines to automate build test and deployment processesImplement infrastructure as code (IaC) using tools like Terraform or CloudFormationDevelop automation and internal tools using PythonMonitor system performance availability and reliability proactively identify and resolve issuesDefine and track SLIs SLOs and SLAsParticipate in on-call rotations incident response and post-incident reviewsCollaborate closely with development security and operations teamsImprove system resilience fault tolerance and disaster recovery strategiesEnsure best practices for security cost optimization and performance tuningRequired Skills QualificationsStrong experience with AWS services (EC2 EKS S3 RDS IAM VPC CloudWatch etc.)Hands-on expertise with Kubernetes (EKS preferred) and DockerSolid understanding of DevOps principles and practicesProficiency in Python for automation scripting and toolingExperience with CICD tools (Jenkins GitHub Actions GitLab CI ArgoCD etc.)Knowledge of LinuxUnix systemsExperience with monitoring and logging tools (Prometheus Grafana ELK Datadog etc.)Familiarity with networking concepts (DNS TCPIP load balancing)Strong troubleshooting and problem-solving skills Desirable Skills:

Keyword:

Skills: Digital : PythonDigital : DockerDigital : KubernetesDigital : Site Reliability Engineering (SRE) Experience Required: 8-10

Release Comments:

Please use the below link to begin submitting Candidates.

Request ID: 52317-1 Start/End Dates: 2/28/2026 - 8/31/2026 Tax Work Location: US Default Job Title: Information TechnologyUSA - USASystem Administrator Job Description: **Please strictly adhere to the following resume naming convention: ALL CAPS NO SPACES B/T UNDERSCORES PTNUSGBA...
View more view more