Observability Engineer – SignalFX to Datadog Migration at Remote position
Jersey, NJ - USA
Job Summary
The Observability Engineer will play a key role in driving a value-based migration from SignalFX (Splunk Observability Cloud) to Datadog. This role focuses on rationalization optimization and stakeholder enablement ensuring that monitoring solutions are cost-efficient scalable and aligned with business needs.
Key Responsibilities
1. Assessment & Rationalization
* Evaluate existing dashboards alerts and metrics in SignalFX.
* Identify what should be:
* Migrated
* Refactored
* Retired
* Prioritize based on business impact and usage.
1. Migration & Optimization
* Migrate observability workloads from SignalFX to Datadog.
* Map legacy dashboards to Datadog-native dashboards and templates.
* Leverage AWS integrations and out-of-the-box features to reduce custom development.
* Optimize dashboards and metrics for performance and cost efficiency.
1. Stakeholder Enablement
* Collaborate with Infrastructure Application and DevOps teams.
* Educate stakeholders on:
* Datadog capabilities
* Best practices
* Provide training documentation and support for long-term ownership.
*Required Skills & Experience*
*Core Requirements*
* Strong hands-on experience with Datadog (dashboards integrations alerting).
* Experience with SignalFX / Splunk Observability Cloud.
* Proven experience in migration projects (SignalFX Datadog preferred).
* Strong understanding of:
* Distributed systems monitoring
* SRE principles
* Cloud-native observability
*Technical Skills*
* Python (minimum 3 years experience)
* Ansible (working knowledge)
* Familiarity with OpenTelemetry
* Experience with:
* AWS architecture and services
* Kubernetes / OpenShift
* Infrastructure-as-Code (IaC)
*Collaboration Skills*
* Experience working across:
* Infrastructure teams
* Application teams
* DevOps teams
* Strong communication and stakeholder management skills.
*Preferred Qualifications*
* Certifications in:
* Datadog
* AWS
* Observability platforms
* Experience with enterprise-scale monitoring transformations
*Nice-to-Have*
* Experience optimizing observability costs
* Exposure to automation-driven migrations (Ansible IaC)
* Hands-on experience with OpenTelemetry instrumentation
*Must-Haves*
* Datadog (mandatory)
* SignalFX / Splunk Observability (mandatory)
* Migration experience (strongly preferred)
* Python Ansible
* AWS Kubernetes basics
* OpenTelemetry awareness