- Manage and maintain server infrastructure across cloud (AWS Azure or GCP) and on-premise environments.
- Monitor system performance availability and security using monitoring tools (e.g. Nagios Zabbix Prometheus).
- Perform regular system updates patch management and configuration management using automation tools (e.g. Ansible Puppet or Chef).
- Design implement and manage backup and disaster recovery strategies to ensure data integrity and business continuity.
- Troubleshoot and resolve server-related issues promptly minimizing downtime and impact on business operations.
- Collaborate with development and security teams to support application deployment scaling and secure configuration.
- Enforce security policies conduct vulnerability assessments and ensure compliance with data protection standards.
- Document system architecture procedures and incident resolutions for knowledge sharing and audit readiness.
- Participate in capacity planning and infrastructure optimization to support business growth and scalability.
- Provide technical guidance and mentorship to junior IT staff as needed.
Requirements
- Bachelors degree in Computer Science Information Technology or a related field.
- 25 years of hands-on experience in server infrastructure management system administration or IT operations.
- Proven experience with Linux/Unix and Windows Server environments.
- Strong knowledge of virtualization technologies (e.g. VMware Hyper-V) and containerization (e.g. Docker Kubernetes).
- Familiarity with cloud platforms (AWS Azure or GCP) and cloud-native services.
- Proficiency in scripting languages (e.g. Bash Python PowerShell) for automation and system management.
- Experience with configuration management and infrastructure-as-code tools (e.g. Ansible Terraform).
- Understanding of networking fundamentals firewalls load balancers and DNS management.
- Demonstrated ability to manage and troubleshoot complex system issues under pressure.
- Strong analytical problem-solving and communication skills.
Manage and maintain server infrastructure across cloud (AWS Azure or GCP) and on-premise environments.Monitor system performance availability and security using monitoring tools (e.g. Nagios Zabbix Prometheus).Perform regular system updates patch management and configuration management using automat...
- Manage and maintain server infrastructure across cloud (AWS Azure or GCP) and on-premise environments.
- Monitor system performance availability and security using monitoring tools (e.g. Nagios Zabbix Prometheus).
- Perform regular system updates patch management and configuration management using automation tools (e.g. Ansible Puppet or Chef).
- Design implement and manage backup and disaster recovery strategies to ensure data integrity and business continuity.
- Troubleshoot and resolve server-related issues promptly minimizing downtime and impact on business operations.
- Collaborate with development and security teams to support application deployment scaling and secure configuration.
- Enforce security policies conduct vulnerability assessments and ensure compliance with data protection standards.
- Document system architecture procedures and incident resolutions for knowledge sharing and audit readiness.
- Participate in capacity planning and infrastructure optimization to support business growth and scalability.
- Provide technical guidance and mentorship to junior IT staff as needed.
Requirements
- Bachelors degree in Computer Science Information Technology or a related field.
- 25 years of hands-on experience in server infrastructure management system administration or IT operations.
- Proven experience with Linux/Unix and Windows Server environments.
- Strong knowledge of virtualization technologies (e.g. VMware Hyper-V) and containerization (e.g. Docker Kubernetes).
- Familiarity with cloud platforms (AWS Azure or GCP) and cloud-native services.
- Proficiency in scripting languages (e.g. Bash Python PowerShell) for automation and system management.
- Experience with configuration management and infrastructure-as-code tools (e.g. Ansible Terraform).
- Understanding of networking fundamentals firewalls load balancers and DNS management.
- Demonstrated ability to manage and troubleshoot complex system issues under pressure.
- Strong analytical problem-solving and communication skills.
View more
View less