Role : Tools SME
Duration : 6 Months
Location : Dallas TX - Remote
Role Summary:
We are seeking a Subject Matter Expert (SME) with strong hands on experience on tools administration and maintenance to provide job scheduling infrastructure observability configuration management and automation capabilities for IT Infrastructure management. This role will be responsible for the design administration troubleshooting and automation of critical enterprise tools including Control M SolarWinds BMC BladeLogic and StackStorm ensuring stable operations and improved operational efficiency.
Note: we are looking for tools administrator and not the tool user/operator.
Key Responsibilities
Job Scheduling & Workload Automation (Control M)
- Administer and support Control M Server and Control-M EM environment.
- Create modify and maintain batch jobs calendars and dependencies
- Troubleshoot job failures and performance issues
- Partner with application teams to optimize batch processing and reduce failures
Observability (SolarWinds)
- Administration and Maintenance of SolarWinds Orion platform.
- Configure and maintain SolarWinds configuration to monitor for servers network devices databases and applications
- Set up alerts thresholds dashboards and reports
- Analyze performance trends and proactively identify issues
- Ensure monitoring coverage aligns with SLA and business requirements
Automation & Orchestration (StackStorm)
- Design build and maintain automated workflows for infrastructure and operational tasks
- Integrate StackStorm with monitoring ticketing and cloud platforms (e.g. ServiceNow AWS Azure)
- Develop reusable actions rules and workflows to reduce manual effort
- Support event driven automation for incident response and self healing use cases
Configuration Management & Compliance (BMC BladeLogic)
- Administration and configuration of BMC BladeLogic and BMC TrueSight reporting server.
- Manage patch policies and configuration for OS and databases using BladeLogic
- Automate patching compliance checks and remediation
- Ensure configuration consistency across environments (Prod / Non Prod)
- Support audit security and compliance initiatives
- Packaging of third-party applications and deployment through bladelogic.
Operational & Governance Responsibilities
- Participate in incident problem and change management processes (ITIL aligned)
- Provide Tier 3 support and root cause analysis for tool related issues
- Create and maintain documentation SOPs and runbooks
- Mentor junior engineers and provide cross training
- Collaborate with security cloud network and application teams
Required Skills & Experience
Technical Skills
- Strong hands on experience with:
- Control M
- SolarWinds
- BMC BladeLogic
- StackStorm
- Solid understanding of Linux and Windows server environments
- Scripting experience (Python Bash PowerShell preferred)
- Experience with cloud platforms (AWS and/or Azure) is a plus
- Familiarity with ServiceNow integrations is desirable
Operational Knowledge
- Strong understanding of IT operations monitoring automation and job scheduling
- Experience working in 24x7 enterprise environments
- Knowledge of ITIL processes (Incident Change Problem Management)
Soft Skills
- Strong troubleshooting and analytical skills
- Ability to work independently and handle complex issues
- Clear communication skills with technical and non technical teams
- Detail oriented with a focus on reliability and stability
Role : Tools SME Duration : 6 Months Location : Dallas TX - Remote Role Summary: We are seeking a Subject Matter Expert (SME) with strong hands on experience on tools administration and maintenance to provide job scheduling infrastructure observability configuration management and automation capa...
Role : Tools SME
Duration : 6 Months
Location : Dallas TX - Remote
Role Summary:
We are seeking a Subject Matter Expert (SME) with strong hands on experience on tools administration and maintenance to provide job scheduling infrastructure observability configuration management and automation capabilities for IT Infrastructure management. This role will be responsible for the design administration troubleshooting and automation of critical enterprise tools including Control M SolarWinds BMC BladeLogic and StackStorm ensuring stable operations and improved operational efficiency.
Note: we are looking for tools administrator and not the tool user/operator.
Key Responsibilities
Job Scheduling & Workload Automation (Control M)
- Administer and support Control M Server and Control-M EM environment.
- Create modify and maintain batch jobs calendars and dependencies
- Troubleshoot job failures and performance issues
- Partner with application teams to optimize batch processing and reduce failures
Observability (SolarWinds)
- Administration and Maintenance of SolarWinds Orion platform.
- Configure and maintain SolarWinds configuration to monitor for servers network devices databases and applications
- Set up alerts thresholds dashboards and reports
- Analyze performance trends and proactively identify issues
- Ensure monitoring coverage aligns with SLA and business requirements
Automation & Orchestration (StackStorm)
- Design build and maintain automated workflows for infrastructure and operational tasks
- Integrate StackStorm with monitoring ticketing and cloud platforms (e.g. ServiceNow AWS Azure)
- Develop reusable actions rules and workflows to reduce manual effort
- Support event driven automation for incident response and self healing use cases
Configuration Management & Compliance (BMC BladeLogic)
- Administration and configuration of BMC BladeLogic and BMC TrueSight reporting server.
- Manage patch policies and configuration for OS and databases using BladeLogic
- Automate patching compliance checks and remediation
- Ensure configuration consistency across environments (Prod / Non Prod)
- Support audit security and compliance initiatives
- Packaging of third-party applications and deployment through bladelogic.
Operational & Governance Responsibilities
- Participate in incident problem and change management processes (ITIL aligned)
- Provide Tier 3 support and root cause analysis for tool related issues
- Create and maintain documentation SOPs and runbooks
- Mentor junior engineers and provide cross training
- Collaborate with security cloud network and application teams
Required Skills & Experience
Technical Skills
- Strong hands on experience with:
- Control M
- SolarWinds
- BMC BladeLogic
- StackStorm
- Solid understanding of Linux and Windows server environments
- Scripting experience (Python Bash PowerShell preferred)
- Experience with cloud platforms (AWS and/or Azure) is a plus
- Familiarity with ServiceNow integrations is desirable
Operational Knowledge
- Strong understanding of IT operations monitoring automation and job scheduling
- Experience working in 24x7 enterprise environments
- Knowledge of ITIL processes (Incident Change Problem Management)
Soft Skills
- Strong troubleshooting and analytical skills
- Ability to work independently and handle complex issues
- Clear communication skills with technical and non technical teams
- Detail oriented with a focus on reliability and stability
View more
View less