drjobs Cloud Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Plano, TX - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Cloud Engineer Observability & Monitoring (Azure Splunk AKS)

Overview: A technology-focused organization is seeking a highly skilled Cloud Engineer with deep expertise in monitoring distributed microservices within Azure environments. This role will lead the design and implementation of comprehensive observability solutions using Splunk ensuring robust performance scalability and reliability across a microservices-based architecture.

The ideal candidate will play a key role in shaping monitoring strategies automating incident response and driving operational excellence for mission-critical applications.

Key Responsibilities:

  • Design implement and maintain monitoring solutions for distributed microservices on Azure Kubernetes Service (AKS)

  • Utilize Splunk for log ingestion custom dashboards alerting and advanced analytics

  • Integrate Istio service mesh observability (telemetry tracing logging) into monitoring frameworks

  • Apply Twistlock (Prisma Cloud) policies for container and workload security monitoring

  • Monitor and configure Azure-native services including API Management (APIM) Cosmos DB SQL Server and Azure Networking

  • Use Terraform to manage infrastructure as code embedding observability into deployments

  • Build and optimize CI/CD pipelines in Azure DevOps (AzDO) with integrated monitoring hooks

  • Leverage Azure Chaos Studio to test system resilience and incorporate findings into monitoring improvements

  • Support automated API and performance testing using Karate Labs alongside observability tools

  • Collaborate with development security and operations teams to define and track SLAs SLOs and SLIs

  • Participate in incident response root cause analysis and continuous improvement initiatives

Required Qualifications:

  • Experience in DevOps practices and methodologies

  • Background in Site Reliability Engineering or Cloud Operations roles

  • Hands-on experience with Splunk in a microservices environment

  • Proficiency with Azure Kubernetes Service (AKS) and Istio

  • Strong understanding of Azure services and architecture

  • Experience implementing Twistlock Terraform and Azure DevOps (AzDO) pipelines

  • Familiarity with Azure Chaos Studio and Karate Labs for testing and validation

  • Strong scripting and automation capabilities

Preferred Qualifications:

  • Azure certifications (e.g. AZ-400 AZ-104 AZ-305

  • Experience with other observability tools such as Prometheus Grafana or OpenTelemetry

  • Knowledge of DevSecOps practices and secure CI/CD pipeline implementation

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.