Solutions Architect SRE (DYNATRACE)
Job Summary
We are seeking a highly experienced Senior Observability Engineer with deep handson expertise in Dynatrace SaaS to lead enterprisescale observability deployments across AWS and Azure. This role will drive the design automation and rollout of Dynatrace capabilities for large complex environments while partnering closely with DevOps SRE Cloud and Application teams. The ideal candidate has 9 years of Dynatrace implementation experience strong DevOps and automation skills and the ability to mentor engineering teams.
Dynatrace Expertise
Dynatrace Expertise
- Lead large enterprise-scale deployments of Dynatrace observability across distributed microservices serverless workloads and multiregion multi-cloud environments.
- Maintain Dynatrace governance and best practices support multi-tenants fine grained access controls and logical segmentation of teams apps and environments.
- Configure and optimize APM instrumentation Deep codelevel visibility PurePath distributed tracing Smartscape topology mapping and other advanced Dynatrace features to ensure fullstack observability.
- Build and maintain custom dashboards management zones tagging rules and entity metadata strategies.
- Develop and tune alerting profiles anomaly detection rules Davis AI configurations and auto-remediation workflows.
- Leverage Davis AI to automatically identify Root Cause using causal analysis correlate metrics logs traces and events to reduce noise and eliminate false positives.
- Build HTTP and Browser Synthetic Monitoring and performance baselines.
- Configure Real User Monitoring (RUM) for web and mobile applications including User journey analysis User experience insights and performance KPIs.
- Implement and manage log ingest pipelines log processing rules retention policies and Dynatrace Grail/Log Management features
- Integrate with GitHub Actions Jenkins ServiceNow PagerDuty and Teams
- Build OTel integrations and custom plugins.
- Implement CI/CD pipelines using tools such as GitHub Actions AWS CodePipeline and Jenkins.
- Automate infrastructure provisioning through Infrastructure-as-Code (IaC) using Terraform CloudFormation or AWS CDK.
- Develop self-service automation tools using Python or other scripting languages.
- Proficient in ITIL framework and ITSM tools such as ServiceNow.
- Production on-call responder with strong troubleshooting capabilities.
- Develop RCA documentation and Knowledge articles
- Apply SRE principles including SLIs SLOs and error budgets.
- Manage service accounts and access permissions
- Create deploy and manage digital certificates.
- Respond to security incidents and execute remediation tasks effectively.
- Bachelors degree in Computer Science Engineering or related field
- 9 years of Dynatrace implementation experience
- 5 years of experience in DevOps SRE or infrastructure roles
- Knowledge of Linux systems and networking.
- Working in a SAFe Agile delivery environment.
- Excellent written and verbal communication skills.
- Demonstrated ability to work independently and manage priorities.
- Availability to work outside of standard business hours as required.