DescriptionCloud SREEngineer - Associate
Who We Look For:
Goldman Sachs Engineers are innovators and problem-solvers who thrive in fast-paced global environments. We are seeking a motivated Cloud Site Reliability Engineer (SRE) to support the WM Data Engineering this role you will apply software engineering principles to operational challenges ensuring that our cloud-native services - primarily running onAWSare resilient scalable and cost-optimized. As we transition from on-premises legacy systems to AWS you will be the guardian of system health moving beyond traditional dashboards to implementpredictive remediationandSLOs-as-Code.
Key Responsibilities:
- Reliability & Performance Engineering:
- SLO Management:Define and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs) usingOpenSLOor similar declarative frameworks. Manage Error Budgets to balance the pace of innovation with system stability.
- Predictive Observability:Implement AI-driven observability stacks (e.g.DatadogAmazon CloudWatch Container Insights orOpenTelemetry) to detect p99 latency spikes and subtle configuration drifts before they impact users.
- Incident Response:Lead high-severity incident restoration and conductblameless post-mortemsto identify root causes and automate future prevention.
- Cloud Migration & Orchestration:
- Microservices Migration:Support the migration of on-premises microservices toAmazon ECS (Fargate/EC2). Design and maintain task definitions service discovery viaAWS Cloud Map and inter-service communication usingAmazon ECS Service Connect.
- Infrastructure as Code (IaC):Develop and maintain modular version-controlled infrastructure usingTerraformorAWS CDK ensuring that reliability guardrails are baked into every deployment.
- Automation of Toil:Identify and eliminate repetitive manual tasks (toil) by developing custom automation tools inPythonorGo.
- Modernization:
- Migration Support:Contribute to the migration of on-premises data workloads to AWS.
Qualifications:
Technical Requirements
- Experience:4 years in SRE DevOps or Cloud Engineering roles with a strong focus on production operations for distributed systems.
- Container Orchestration:Deep proficiency inAmazon ECS(Fargate and EC2 launch types). Experience withDockercontainerization and managing service-to-service connectivity.
- Programming:Strong proficiency inPythonorJavafor automation and tool development. Expert-levelSQLfor data-driven reliability analysis.
- Cloud Platforms:Advanced knowledge ofAWScore services (VPC IAM S3 Lambda) and networking (Transit Gateway PrivateLink).
- Observability Tools:Hands-on experience with modern monitoring and tracing tools such as PrometheusGrafanaAWS X-Ray orSplunk.
- CI/CD for Containers:Proven ability to build automated deployment pipelines for ECS usingAWS CodePipelineGitHub Actions orTerraform incorporating blue/green or canary deployment strategies.
- Soft Skills:Strong problem-solving builder mindset and the ability to communicate technical concepts within a team environment.
Education
- Bachelors or Masters degree in computer science Engineering Mathematics or a related field.
ABOUT GOLDMAN SACHS
At Goldman Sachs we commit our people capital and ideas to help our clients shareholders and the communities we serve to grow. Founded in 1869 we are a leading global investment banking securities and investment management firm. Headquartered in New York we maintain offices around the world.
We believe who you are makes you better at what you do. Were committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally from our training and development opportunities and firmwide networks to benefits wellness and personal finance offerings and mindfulness programs. Learn more about our culture benefits and people at
Were committed to finding reasonable accommodations for candidates with special needs or disabilities during our recruiting process. Learn more:
The Goldman Sachs Group Inc. 2023. All rights reserved.
Goldman Sachs is an equal opportunity employer and does not discriminate on the basis of race color religion sex national origin age veterans status disability or any other characteristic protected by applicable law.
Required Experience:
IC
DescriptionCloud SREEngineer - AssociateWho We Look For:Goldman Sachs Engineers are innovators and problem-solvers who thrive in fast-paced global environments. We are seeking a motivated Cloud Site Reliability Engineer (SRE) to support the WM Data Engineering this role you will apply software engi...
DescriptionCloud SREEngineer - Associate
Who We Look For:
Goldman Sachs Engineers are innovators and problem-solvers who thrive in fast-paced global environments. We are seeking a motivated Cloud Site Reliability Engineer (SRE) to support the WM Data Engineering this role you will apply software engineering principles to operational challenges ensuring that our cloud-native services - primarily running onAWSare resilient scalable and cost-optimized. As we transition from on-premises legacy systems to AWS you will be the guardian of system health moving beyond traditional dashboards to implementpredictive remediationandSLOs-as-Code.
Key Responsibilities:
- Reliability & Performance Engineering:
- SLO Management:Define and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs) usingOpenSLOor similar declarative frameworks. Manage Error Budgets to balance the pace of innovation with system stability.
- Predictive Observability:Implement AI-driven observability stacks (e.g.DatadogAmazon CloudWatch Container Insights orOpenTelemetry) to detect p99 latency spikes and subtle configuration drifts before they impact users.
- Incident Response:Lead high-severity incident restoration and conductblameless post-mortemsto identify root causes and automate future prevention.
- Cloud Migration & Orchestration:
- Microservices Migration:Support the migration of on-premises microservices toAmazon ECS (Fargate/EC2). Design and maintain task definitions service discovery viaAWS Cloud Map and inter-service communication usingAmazon ECS Service Connect.
- Infrastructure as Code (IaC):Develop and maintain modular version-controlled infrastructure usingTerraformorAWS CDK ensuring that reliability guardrails are baked into every deployment.
- Automation of Toil:Identify and eliminate repetitive manual tasks (toil) by developing custom automation tools inPythonorGo.
- Modernization:
- Migration Support:Contribute to the migration of on-premises data workloads to AWS.
Qualifications:
Technical Requirements
- Experience:4 years in SRE DevOps or Cloud Engineering roles with a strong focus on production operations for distributed systems.
- Container Orchestration:Deep proficiency inAmazon ECS(Fargate and EC2 launch types). Experience withDockercontainerization and managing service-to-service connectivity.
- Programming:Strong proficiency inPythonorJavafor automation and tool development. Expert-levelSQLfor data-driven reliability analysis.
- Cloud Platforms:Advanced knowledge ofAWScore services (VPC IAM S3 Lambda) and networking (Transit Gateway PrivateLink).
- Observability Tools:Hands-on experience with modern monitoring and tracing tools such as PrometheusGrafanaAWS X-Ray orSplunk.
- CI/CD for Containers:Proven ability to build automated deployment pipelines for ECS usingAWS CodePipelineGitHub Actions orTerraform incorporating blue/green or canary deployment strategies.
- Soft Skills:Strong problem-solving builder mindset and the ability to communicate technical concepts within a team environment.
Education
- Bachelors or Masters degree in computer science Engineering Mathematics or a related field.
ABOUT GOLDMAN SACHS
At Goldman Sachs we commit our people capital and ideas to help our clients shareholders and the communities we serve to grow. Founded in 1869 we are a leading global investment banking securities and investment management firm. Headquartered in New York we maintain offices around the world.
We believe who you are makes you better at what you do. Were committed to fostering and advancing diversity and inclusion in our own workplace and beyond by ensuring every individual within our firm has a number of opportunities to grow professionally and personally from our training and development opportunities and firmwide networks to benefits wellness and personal finance offerings and mindfulness programs. Learn more about our culture benefits and people at
Were committed to finding reasonable accommodations for candidates with special needs or disabilities during our recruiting process. Learn more:
The Goldman Sachs Group Inc. 2023. All rights reserved.
Goldman Sachs is an equal opportunity employer and does not discriminate on the basis of race color religion sex national origin age veterans status disability or any other characteristic protected by applicable law.
Required Experience:
IC
View more
View less