Must Have
Linux; Oracle; bash scripting; Splunk Dynatrace Ansible
Job Summary:
The role is responsible for supporting an IT applications operations team with effective monitoring and event management solutions. Ensure that processes and solutions meet the demanding nature of distributed applications across local cloud-based and virtualized environments.
Primary Responsibilities:
- Maintain tools automation and frameworks supporting monitoring batch and reporting functions.
- Maintain install and configure all deployments.
- Perform scheduled maintenance install patches and upgrade software packages as needed in a large mixed platform environment.
- Implement efficient event management processes and automation;
- Configure and maintain central monitoring platforms;
- Perform daily health checks and break/fix support on monitoring platforms;
- Engineering and support of monitoring tools.
- Engineer develop and integrate in-house solutions to collect monitor and present data to a variety of reporting structures.
- Maintain and support third party monitoring solutions.
- Troubleshooting Incident escalation and problem solving
- Design upgrade and support production monitoring tools.
- Periodically evaluate new batch and monitoring industry solutions and technologies.
Qualifications:
- Technical Degree or related work experience
- Experience in IT operations
- Experience with Systems Administration.
- Experience with scripting/programming and engineering of infrastructure batch and monitoring tools.