DescriptionHealthPartners is currently hiring for a Senior Cloud and Platform Engineer Observability Team. This pivotal role is essential for maximizing the efficiency and effectiveness of our Observability product team. As the team expands its Splunk capabilities to enhance organizational insights and performance the engineer will be a key contributor to its growth. The Senior Cloud and Platform Engineer is driven by a passion for observability automation high availability continuous improvement and crafting highquality code.
Required Qualifications:
- Bachelors degree in information technology orrelated field
- 6 years of experience in software development
- 4 years of operations experience
- Exceptional understanding of software principles and ability to code.
- Exceptional understanding of automation and how to apply it to remove barriers and increase reliability.
- Strong understanding and experience with architecting designing and automating within a public cloud (AWS GCP Azure).
- Understanding of Cloud principles.
- Demonstrated experience with cloud technologies like OpenShift Kubernetes Vault Prometheus Grafana Splunk Kafka and others are highly valued.
- Understanding of use and operation of tools like Bit Bucket GitLab Jenkins Confluence.
- Experience with Chef Ansible Jenkins.
- Understanding of infrastructure like storage networking operating systems VMs.
- Understanding of architectures such as Micro services SOA J2EE Enterprise Service Bus.
- Demonstrated ability to learn recent technologies.
- Excellent ability to communicate complex ideas in written and verbal forms.
- Proven ability to work in an Agile environment.
Hours/Location:
- MF; Days
- Participation in an oncall rotation.
Responsibilities:
- Automates relentlessly following continuous integration/continuous delivery practices.
- Participates in the development and communication of DevOps principles and activities.
- Installs automates and operates tools that enable our developers and systems.
- Deploys scales and runs distributed software at scale understanding service dependencies.
- Monitors application performance and exposesmetrics to everyone.
- Designs and conducts platform/application testsincluding functionality availability load/stress and performance.
- Troubleshoot development and production problems across multiple environments and operating platforms.
- Participates within high availability and disaster recovery testing.
- Creates documentation required for smooth operation.
- Participates in a team environment which thrives on transparency and trust.
- Guides and mentors less experienced staff.
- Participates in an oncall rotation.
- Performs other duties as assigned.
Required Experience:
Senior IC