DescriptionWe have an exciting and rewarding opportunity for you to take your software engineering career to the next level.
As a Lead Site Reliability Engineer at JPMorgan Chase within CCB you are an integral part of an agile team that works to enhance build and deliver trusted marketleading technology products in a secure stable and scalable way. Drive significant business impact through your capabilities and contributions and apply deep technical expertise and problemsolving methodologies to tackle a diverse array of challenges that span multiple technologies and applications.
Job responsibilities
- Execute software solutions design development and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems.
- Support the engineering teams in building faulttolerant scalable applications by engaging in design discussions RFCs and code reviews.
- Drive decisions that influence the product design application functionality and technical operations and processes.
- Implement and regularly testing DR strategies to ensure highest level of resilience and fault tolerance of the platform.
- Automate the installation upgrade scaling and management of a large and rapidly growing fleet of Kubernetes clusters. Develop custom platform control plane webhooks CRDs and operators and more that provide a secure opinionated platform.
- Maintain and promoting highquality written documentation of assets processes and runbooks that are used by the team in their daytoday operations.
- Add to the team culture of diversity equity inclusion and respect.
Required qualifications capabilities and skills
- Possess an uptodate understanding of design patterns relevant to hosting and networking architectures.
- Proactively champion product development driven by a desire to build truly exceptional products not just solve immediate challenges.
- A strong background working in either Python Golang or Java having used one of these programming languages to execute a significantly sized project or initiative.
- Extensive experience of working with Kubernetes and Cloud Platforms (AWS GCP or Azure).
- Expertise in one or more of the following areas: Database Administration Networking Observability Tools or automation of infrastructure.
- Ability to tackle design and functionality problems independently with little to no oversight.
- Excellent debugging and trouble shooting skills.
Preferred qualifications capabilities and skills
- Experience inInfrastructure Architecturedesigns.
- Certification in Cloud Platforms (AWS GCP preferred)
- Certification in Kubernetes