SRE Engineer
San Jose, CA - USA
Job Summary
Must Have Technical/Functional Skills:
Apache SPARK development Kubenetes CI-CD Pipeline Jenkins Dokcer Kubernetes PL SQL Python
Writing SQL queries and procedures
Writing Python code to automate and develop small functionalities
Creating CI/CD pipelines
Writing Jenkins jobs
Managing applications in Kubernetes environments including deployment configuration and triaging
Hands-on experience with Apache Spark
Roles & Responsibilities:
The candidate will provide technical leadership for the team(s) they are associated with and participate in key technical decisions. They will engage with customers on escalations and ensure that there is continuous improvement in all areas. Participate in technical discussions within the team and with other groups within Business Units associated with specified projects
You design develop and maintain our real time data processing data Lakehouse infrastructure.
You have experience with Python writing data pipelines and data processing layers.
You develop and maintain Ansible playbooks for infrastructure configuration and management
You develop and maintain Kubernetes manifests Helm charts and other deployment artifacts
You have hands-on experience on Docker and containerization and how to manage/prune the images in private registries.
You have hands-on experience on access control in K8S cluster
You have hands-on experience on SPARK and maintaining SPARK CLUSTER
You monitor and troubleshoot issues related to Kubernetes clusters and containerized applications
You drive initiatives to containerize standalone apps to be containerized i n Kubernetes.
You develop and maintain infrastructure as code (IaC) and collaborate with other teams to ensure consistent infrastructure management across the organization
You use observability tools to do capacity management of our services and infrastructure resources.
You are for guiding the development and testing activities of other engineers that involve several inter-dependencies
Experience in AWS ECS and EKS is added advantage
Experience in Dremio is added advantage
Experience in Dynatrace or any tracing infrastructure or real time monitoring tool is added advantage