We are seeking a Subject Matter Expert (SME) with strong hands on experience on tools administration and maintenance to provide job scheduling infrastructure observability configuration management and automation capabilities for IT Infrastructure management. This role will be responsible for the design administration troubleshooting and automation of critical enterprise tools including Control M SolarWinds BMC BladeLogic and StackStorm ensuring stable operations and improved operational efficiency.
Note: we are looking for tools administrator and not the tool user/operator.
Key Responsibilities
Job Scheduling & Workload Automation (Control M)
Administer and support Control M Server and Control-M EM environment.
Create modify and maintain batch jobs calendars and dependencies
Troubleshoot job failures and performance issues
Partner with application teams to optimize batch processing and reduce failures
Observability (SolarWinds)
Administration and Maintenance of SolarWinds Orion platform.
Configure and maintain SolarWinds configuration to monitor for servers network devices databases and applications
Set up alerts thresholds dashboards and reports
Analyze performance trends and proactively identify issues
Ensure monitoring coverage aligns with SLA and business requirements
Automation & Orchestration (StackStorm)
Design build and maintain automated workflows for infrastructure and operational tasks
Integrate StackStorm with monitoring ticketing and cloud platforms (e.g. ServiceNow AWS Azure)
Develop reusable actions rules and workflows to reduce manual effort
Support event driven automation for incident response and self healing use cases
Experience with cloud platforms (AWS and/or Azure) is a plus
Familiarity with ServiceNow integrations is desirable
Operational Knowledge
Strong understanding of IT operations monitoring automation and job scheduling
Experience working in 24x7 enterprise environments
Knowledge of ITIL processes (Incident Change Problem Management)
Soft Skills
Strong troubleshooting and analytical skills
Ability to work independently and handle complex issues
Clear communication skills with technical and non technical teams
Detail oriented with a focus on reliability and stability
Role : Tools SME Duration : 6 Months Location : Dallas TX - Remote Role Summary: We are seeking a Subject Matter Expert (SME) with strong hands on experience on tools administration and maintenance to provide job scheduling infrastructure observability configuration management and automation capa...
Role : Tools SME
Duration : 6 Months
Location : Dallas TX - Remote
Role Summary:
We are seeking a Subject Matter Expert (SME) with strong hands on experience on tools administration and maintenance to provide job scheduling infrastructure observability configuration management and automation capabilities for IT Infrastructure management. This role will be responsible for the design administration troubleshooting and automation of critical enterprise tools including Control M SolarWinds BMC BladeLogic and StackStorm ensuring stable operations and improved operational efficiency.
Note: we are looking for tools administrator and not the tool user/operator.
Key Responsibilities
Job Scheduling & Workload Automation (Control M)
Administer and support Control M Server and Control-M EM environment.
Create modify and maintain batch jobs calendars and dependencies
Troubleshoot job failures and performance issues
Partner with application teams to optimize batch processing and reduce failures
Observability (SolarWinds)
Administration and Maintenance of SolarWinds Orion platform.
Configure and maintain SolarWinds configuration to monitor for servers network devices databases and applications
Set up alerts thresholds dashboards and reports
Analyze performance trends and proactively identify issues
Ensure monitoring coverage aligns with SLA and business requirements
Automation & Orchestration (StackStorm)
Design build and maintain automated workflows for infrastructure and operational tasks
Integrate StackStorm with monitoring ticketing and cloud platforms (e.g. ServiceNow AWS Azure)
Develop reusable actions rules and workflows to reduce manual effort
Support event driven automation for incident response and self healing use cases