We are seeking a highly skilled Cloud Engineer & Infrastructure Security professional to design build and secure our hybrid infrastructure (cloud on-prem). The ideal candidate will have deep experience with Kubernetes Terraform Helm and a strong background in infrastructure security DevSecOps and on-prem deployments. This role is critical for architecting scalable secure and observable infrastructure supporting mission-critical applications and LLM (Large Language Model) workloads.
Your Responsibilities:
Infrastructure & Cloud Management
Deploy and manage Kubernetes clusters (cloud & on-prem) using Terraform and Helm.
Implement zero trust models IAM and least-privilege access.
Enforce security policies micro-segmentation and secrets management.
DevSecOps & CI/CD Security
Integrate security scanning SBOM and policy-as-code into pipelines.
Automate compliance and security checks during build and deploy.
LLM & Hybrid Deployments
Build and maintain infrastructure for LLM workloads (vLLM KServe).
Support hybrid cloud and on-prem deployments ensuring consistency and security.
Monitoring & Observability
Implement monitoring logging and alerting using Grafana Azure Monitor Prometheus.
Maintain dashboards SLIs/SLOs and performance metrics.
Linux & Automation
Harden Linux systems automate routine tasks and support incident response.
Develop scripts and tools to streamline operations.
Collaboration & Strategy
Partner with engineering security and operations teams.
Mentor teams on cloud best practices and emerging technologies.
What we look for:
Strong experience with Kubernetes including cluster provisioning scaling and security.
Proficient in Terraform and Helm for infrastructure-as-code and deployment automation.
Expertise in infrastructure security zero trust models and IAM best practices.
Hands-on experience with DevSecOps: security scanning SBOM generation secrets management and policy-as-code.
Solid understanding of cloud networking: VPC design VPN and firewall configuration.
Experience with hybrid or on-prem deployments alongside cloud environments.
Skilled in Linux administration scripting and automation for operational efficiency.
Familiarity with monitoring and observability tools (Azure Monitor Grafana Prometheus).
Experience building and managing infrastructure for LLM or AI workloads (vLLM KServe).
Nice - to - have:
Cloud and security certifications (e.g. CKA/CKAD Terraform Associate CISSP).
Experience with GitOps workflows (Argo CD Flux) and CI/CD security pipelines.
Knowledge of policy frameworks (OPA Gatekeeper Kyverno) and workload identity systems (SPIFFE/SPIRE).
Familiarity with GPU/accelerator-based infrastructure for ML/LLM workloads.
Background in SRE practices including SLO/SLI design and incident response.
Contributions to open-source cloud DevSecOps or LLM infrastructure projects.
What we offer:
Competitive salary and performance-based bonuses.
Fully remote flexible work environment.
Modern laptop and hardware provided by us.
Specialized training in AI automation and digital productivity tools.
Global exposurecollaborate with top-tier founders and fast-growing startups.
Continuous learning and career growth opportunities in an international environment.
A Message from Our CEO:We are seeking a highly skilled Cloud Engineer & Infrastructure Security professional to design build and secure our hybrid infrastructure (cloud on-prem). The ideal candidate will have deep experience with Kubernetes Terraform Helm and a strong background in infrastructure s...
A Message from Our CEO:
We are seeking a highly skilled Cloud Engineer & Infrastructure Security professional to design build and secure our hybrid infrastructure (cloud on-prem). The ideal candidate will have deep experience with Kubernetes Terraform Helm and a strong background in infrastructure security DevSecOps and on-prem deployments. This role is critical for architecting scalable secure and observable infrastructure supporting mission-critical applications and LLM (Large Language Model) workloads.
Your Responsibilities:
Infrastructure & Cloud Management
Deploy and manage Kubernetes clusters (cloud & on-prem) using Terraform and Helm.
RAY is on a mission to enable builders & innovators to focus on work only they can do and simultaneously create exciting high-paying jobs in the global south. We envision a world in which busy work is a relic of the past. In which our brightest minds can give 110% to their mission wit
... View more