Key ResponsibilitiesSet up and maintain monitoring dashboards for ETL jobs using Datadog including metrics logs and daily ETL workflows and proactively detect and resolve data pipeline failures or performance Datadog Monitors for job status (success/failure) job duration resource utilization and error closely with Data Engineering teams to onboard new pipelines and ensure observability best Datadog with root cause analysis of ETL failures and performance thresholds baselines and anomaly detection settings in Datadog to reduce false incident handling procedures and contribute to improving overall ETL monitoring in on call rotations or scheduled support windows to manage ETL Skills & Qualifications3 years of experience in ETL/data pipeline monitoring preferably in a cloud or hybrid in using Datadog for metrics logging alerting and understanding of ETL concepts and tools (e.g. Airflow Informatica Talend AWS Glue or dbt).Familiarity with SQL and querying large working with Python Shell scripting or Bash for automation and log of cloud platforms (AWS/GCP/Azure) and services like S3 Redshift BigQuery of CI/CD and DevOps principles related to data infrastructure QualificationsExperience with distributed tracing and APM in experience monitoring Spark Kafka or streaming with ticketing tools (e.g. Jira ServiceNow) and incident management workflows.