Role: Cloud Infrastructure & SRE Engineer
Duration: Long Term
Location: Dearborn MI 4 days onsite in a week
Local candidate only
Required
- Experience operating production cloud platforms and services (e.g. GCP/AWS/Azure) with an SRE mindset.
- Strong fundamentals in Linux networking distributed systems and debugging complex production issues.
- Proficiency with infrastructure as code and automation (e.g. Terraform Helm/Kustomize GitOps tooling).
- Experience with containers and orchestration (Docker Kubernetes) and modern CI/CD.
- Programming and scripting ability (e.g. Go Python Java TypeScript) to build tooling and automate workflows.
- Clear communication effective incident leadership and a customer-focused approach to platform work.
Preferred:
- Experience defining SLIs/SLOs and implementing SLO-based alerting and dashboards.
- Observability platform experience (e.g. Prometheus/Grafana OpenTelemetry centralized logging).
- Policy-as-code and supply chain security (e.g. OPA/Rego SLSA concepts SBOMs artifact signing).
- Experience building golden paths (container images templates reference architectures paved pipelines) adopted by multiple teams.
- Cost optimization experience (FinOps practices capacity forecasting right-sizing multi-tenant platform controls).
Role: Cloud Infrastructure & SRE Engineer Duration: Long Term Location: Dearborn MI 4 days onsite in a week Local candidate only Required Experience operating production cloud platforms and services (e.g. GCP/AWS/Azure) with an SRE mindset. Strong fundamentals in Linux networki...
Role: Cloud Infrastructure & SRE Engineer
Duration: Long Term
Location: Dearborn MI 4 days onsite in a week
Local candidate only
Required
- Experience operating production cloud platforms and services (e.g. GCP/AWS/Azure) with an SRE mindset.
- Strong fundamentals in Linux networking distributed systems and debugging complex production issues.
- Proficiency with infrastructure as code and automation (e.g. Terraform Helm/Kustomize GitOps tooling).
- Experience with containers and orchestration (Docker Kubernetes) and modern CI/CD.
- Programming and scripting ability (e.g. Go Python Java TypeScript) to build tooling and automate workflows.
- Clear communication effective incident leadership and a customer-focused approach to platform work.
Preferred:
- Experience defining SLIs/SLOs and implementing SLO-based alerting and dashboards.
- Observability platform experience (e.g. Prometheus/Grafana OpenTelemetry centralized logging).
- Policy-as-code and supply chain security (e.g. OPA/Rego SLSA concepts SBOMs artifact signing).
- Experience building golden paths (container images templates reference architectures paved pipelines) adopted by multiple teams.
- Cost optimization experience (FinOps practices capacity forecasting right-sizing multi-tenant platform controls).
View more
View less