Job Title: Performance & Observability Engineer (Dynatrace & Splunk)
Location: Plano TX / Charlotte NC (Hybrid 3 days/week onsite)
Type: FTE
Job Description
We are seeking a Performance & Observability Engineer to design build and maintain real time dashboards that deliver actionable insights into application performance infrastructure health and business level KPIs. Using Dynatrace and Splunk you will translate complex monitoring data into clear user friendly views for operations SRE and leadership teams. This role is ideal for an engineer who enjoys turning telemetry logs and metrics into operational clarity and performance improvements.
Core Responsibilities
- Design build and customize Dynatrace dashboards for application performance infrastructure health services and business relevant KPIs.
- Develop and enhance Splunk dashboards and visualizations for log analytics incident monitoring and operational insights (searches charts operational views).
- Collaborate closely with application infrastructure and SRE teams to gather requirements and translate business and technical needs into observability dashboards and alerts.
- Configure and manage technical and business KPIs within Dynatrace and Splunk including service level indicators error rates latency and availability metrics.
- Support proactive monitoring incident RCA and performance troubleshooting by providing dashboard driven data rich investigations for on call and L3 support teams.
- Partner with operations and leadership to refine metrics reduce alert noise and improve incident response time via better visibility.
- Maintain dashboard standards documentation and runbooks to ensure reuse and consistency across teams and environments.
Must Have Skills
- Strong hands on experience with Dynatrace dashboard creation and customization including APM infrastructure service level and KPI views.
- Strong hands on experience with Splunk dashboards SPL searches visualizations and operational views.
- Experience working in production SRE or L3 support environments with a focus on stability performance and incident response.
- Ability to translate raw monitoring and log data into actionable insights for operations teams and leadership (e.g. SLIs SLOs error budgets business impact views).
Good to Have Skills
- Exposure to incident management root cause analysis (RCA) and performance troubleshooting processes.
- Scripting knowledge (e.g. Python Shell) for automating data collection enrichment and observability workflows.
- Familiarity with metrics pipelines log forwarding and alerting frameworks (e.g. integrations between Dynatrace and Splunk or with SIEM/ITSM tools).
Keywords: Performance & Observability Engineer Observability Engineer SRE Monitoring Engineer Dynatrace Splunk Dynatrace dashboard Dynatrace visualization Splunk dashboard Splunk visualization SPL APM application performance infrastructure health log analytics RCA root cause analysis incident management SRE production support L3 support KPIs SLIs SLOs error budgets latency availability dashboard design metrics pipeline log forwarding alerting scripting Python Shell performance tuning operational insights business KPIs real time monitoring cloud operations IT operations telemetry observability pipeline
About VDart Group
VDart Group is a global leader in technology product and talent solutions serving Fortune 500 clients in 13 countries. With over 4000 professionals worldwide we deliver innovation operational excellence and measurable outcomes across industries. Guided by our commitment to People Purpose and Planet VDart is recognized with an EcoVadis Bronze Medal and as a UN Global Compact member reflecting our dedication to sustainable practices.
Job Title: Performance & Observability Engineer (Dynatrace & Splunk) Location: Plano TX / Charlotte NC (Hybrid 3 days/week onsite) Type: FTE Job Description We are seeking a Performance & Observability Engineer to design build and maintain real time dashboards that deliver actionable in...
Job Title: Performance & Observability Engineer (Dynatrace & Splunk)
Location: Plano TX / Charlotte NC (Hybrid 3 days/week onsite)
Type: FTE
Job Description
We are seeking a Performance & Observability Engineer to design build and maintain real time dashboards that deliver actionable insights into application performance infrastructure health and business level KPIs. Using Dynatrace and Splunk you will translate complex monitoring data into clear user friendly views for operations SRE and leadership teams. This role is ideal for an engineer who enjoys turning telemetry logs and metrics into operational clarity and performance improvements.
Core Responsibilities
- Design build and customize Dynatrace dashboards for application performance infrastructure health services and business relevant KPIs.
- Develop and enhance Splunk dashboards and visualizations for log analytics incident monitoring and operational insights (searches charts operational views).
- Collaborate closely with application infrastructure and SRE teams to gather requirements and translate business and technical needs into observability dashboards and alerts.
- Configure and manage technical and business KPIs within Dynatrace and Splunk including service level indicators error rates latency and availability metrics.
- Support proactive monitoring incident RCA and performance troubleshooting by providing dashboard driven data rich investigations for on call and L3 support teams.
- Partner with operations and leadership to refine metrics reduce alert noise and improve incident response time via better visibility.
- Maintain dashboard standards documentation and runbooks to ensure reuse and consistency across teams and environments.
Must Have Skills
- Strong hands on experience with Dynatrace dashboard creation and customization including APM infrastructure service level and KPI views.
- Strong hands on experience with Splunk dashboards SPL searches visualizations and operational views.
- Experience working in production SRE or L3 support environments with a focus on stability performance and incident response.
- Ability to translate raw monitoring and log data into actionable insights for operations teams and leadership (e.g. SLIs SLOs error budgets business impact views).
Good to Have Skills
- Exposure to incident management root cause analysis (RCA) and performance troubleshooting processes.
- Scripting knowledge (e.g. Python Shell) for automating data collection enrichment and observability workflows.
- Familiarity with metrics pipelines log forwarding and alerting frameworks (e.g. integrations between Dynatrace and Splunk or with SIEM/ITSM tools).
Keywords: Performance & Observability Engineer Observability Engineer SRE Monitoring Engineer Dynatrace Splunk Dynatrace dashboard Dynatrace visualization Splunk dashboard Splunk visualization SPL APM application performance infrastructure health log analytics RCA root cause analysis incident management SRE production support L3 support KPIs SLIs SLOs error budgets latency availability dashboard design metrics pipeline log forwarding alerting scripting Python Shell performance tuning operational insights business KPIs real time monitoring cloud operations IT operations telemetry observability pipeline
About VDart Group
VDart Group is a global leader in technology product and talent solutions serving Fortune 500 clients in 13 countries. With over 4000 professionals worldwide we deliver innovation operational excellence and measurable outcomes across industries. Guided by our commitment to People Purpose and Planet VDart is recognized with an EcoVadis Bronze Medal and as a UN Global Compact member reflecting our dedication to sustainable practices.
View more
View less