Job Summary (List Format):
- Lead the design implementation and maintenance of monitoring observability and incident management solutions for cloud-based infrastructure and applications.
- Oversee integration of Datadog ServiceNow and AWS platforms.
- Manage deployment optimization and administration of AWS cloud services.
- Develop and maintain complex Datadog dashboards integrations and custom metrics.
- Integrate Datadog with ServiceNow for incident management event management and CMDB processes.
- Guide and mentor teams or projects within cloud operations or DevOps environments.
- Apply strong scripting and automation skills (Python Bash or similar).
- Ensure adherence to networking and security best practices; troubleshoot complex distributed cloud architectures.
- Support hybrid work environment with occasional on-site presence in Reston VA.