Role: Server Specialist III
Location: Houston TX (Onsite)
Duration: Contract
Client: Enterprise Products
12 Exp
Description:
Must Have:
- Clear Actionable Communication Demonstrated ability to communicate operational status incident impact and resolution steps across technical and business stakeholders especially during shift handoffs and escalations.
- Hands-on NOC Experience Proven experience in a Network Operations Center with direct involvement in alert triage and incident response.
- Infrastructure Monitoring and strong familiarity with tools like SolarWinds Orion Dynatrace or equivalent platforms to monitor system health and suppress noise during maintenance windows
- Reliability and Shift Discipline Consistent performance during assigned shifts including punctuality accountability and participation in on-call rotations.
- Server Troubleshooting Skills Ability to diagnose and resolve issues across Windows and Linux environments.
Nice To Have:
- Basic Scripting Awareness Familiarity with PowerShell or Bash for simple automation or log parsing tasks (not required but helpful for efficiency).
- Certifications Industry credentials such as CompTIA Server Microsoft Certified: Azure Administrator or Red Hat Certified System Administrator.
- ITSM and Ticketing - Experience with Helix Remedy or similar platforms for incident tracking and change control.
- Virtualization Experience Exposure to VMware or Hyper-V environments for server provisioning and troubleshooting.
JOB DESCRIPTION:
Were looking for a dedicated detail-oriented Analyst to join our Network Operations Center (NOC) team within the IT Server Operations group. This role is ideal for someone who thrives in a structured fast-paced environment and brings hands-on experience in NOC workflows server troubleshooting and infrastructure support. The position involves set shift work participate in an on-call rotation and play a key role in monitoring alerting and incident response. Success in this role means keeping systems stable communicating clearly across teams and driving operational efficiency through timely reporting and stakeholder engagement.
Key Responsibilities:
Monitoring Alerting and Incident Response:
o Monitor infrastructure health using tools like SolarWinds Orion Dynatrace or similar platforms.
o Respond to alerts with urgency and precision escalating appropriately and resolving issues within SLA targets.
o Log incidents thoroughly capturing actions and outcomes to support Root Cause Analysis post-mortem and knowledge sharing.
o Participate in a structured on-call rotation for after-hours support.
o Execute maintenance window tasks by validating application checkouts confirming maintenance mode and suppressing alert noise per implementation schedule.
Server Operations and Troubleshooting:
o Perform hands-on diagnostics and remediation for Windows and Linux servers in both physical and virtual environments.
o Maintain accurate documentation of assets configurations and operational standards.
o Troubleshoot technical issues create and manage support tickets and coordinate onsite assistance with vendor teams.
Reporting Communication and Engagement:
o Provide clear concise updates during shift handoffs and operational briefings to ensure continuity and transparency.
o Collaborate with cross-functional teams to align on incident priorities escalation paths and service impact.
o Partner with stakeholders to understand key performance metrics and tailor reporting and alerting solutions that align with specific application needs and infrastructure environments.
o Track key operational metrics for internal stakeholders highlighting areas of improvement and risk.
Security and Compliance:
o Apply server security best practices and respond to vulnerability alerts in a timely manner.
o Ensure all operational activities comply with internal policies and external regulatory standards.
Operational Excellence & Reliability:
o Identify recurring issues and contribute to preventive measures that improve system reliability and reduce noise.
o Help refine and maintain runbooks and escalation workflows to support consistent execution.
o Uphold high standards of punctuality ownership and accountability during assigned shifts.
Required Qualifications:
- 3 years of hands-on experience in NOC or server operations roles.
- Strong working knowledge of Windows Server and Linux environments.
- Experience with infrastructure monitoring and alerting tools.
- Familiarity with data center operations including hardware support.
- Solid understanding of networking fundamentals (TCP/IP DNS DHCP).
- Excellent troubleshooting documentation and communication skills.
- Comfortable working set shifts and participating in an on-call rotation.
Preferred Qualifications:
- Experience with SolarWinds Orion Dynatrace or similar observability platforms.
- Exposure to virtualization technologies such as VMware or Hyper-V.
- Familiarity with ITSM practices and ticketing systems (e.g. ServiceNow Remedy).
- Relevant certifications (Microsoft CompTIA Server Red Hat etc.).
Top daily tasks
- Orion Alerting Management: Manage and tune SolarWinds Orion alerts to ensure actionable signal clarity and suppress noise during maintenance windows.
- Email and Notification: Triaging Prioritize incoming alerts and system notifications escalating critical issues and maintaining stakeholder and NOC wide awareness.
- Failovers and Failbacks: Execution Perform and validate system failovers and failbacks ensuring service continuity and proper documentation.
- NOC Phone Support: Provide responsive phone support for infrastructure incidents service requests and operational escalations.
- DNS Entry Management: Create and update DNS records to support infrastructure changes update DNS based on failovers application status and setting forwarding zones in Infoblox.
Note: onsite support no remote/hybrid option. If candidate is not local remind them that relocation is at their own expense.
Role: Server Specialist III Location: Houston TX (Onsite) Duration: Contract Client: Enterprise Products 12 Exp Description: Must Have: Clear Actionable Communication Demonstrated ability to communicate operational status incident impact and resolution steps across technical and business st...
Role: Server Specialist III
Location: Houston TX (Onsite)
Duration: Contract
Client: Enterprise Products
12 Exp
Description:
Must Have:
- Clear Actionable Communication Demonstrated ability to communicate operational status incident impact and resolution steps across technical and business stakeholders especially during shift handoffs and escalations.
- Hands-on NOC Experience Proven experience in a Network Operations Center with direct involvement in alert triage and incident response.
- Infrastructure Monitoring and strong familiarity with tools like SolarWinds Orion Dynatrace or equivalent platforms to monitor system health and suppress noise during maintenance windows
- Reliability and Shift Discipline Consistent performance during assigned shifts including punctuality accountability and participation in on-call rotations.
- Server Troubleshooting Skills Ability to diagnose and resolve issues across Windows and Linux environments.
Nice To Have:
- Basic Scripting Awareness Familiarity with PowerShell or Bash for simple automation or log parsing tasks (not required but helpful for efficiency).
- Certifications Industry credentials such as CompTIA Server Microsoft Certified: Azure Administrator or Red Hat Certified System Administrator.
- ITSM and Ticketing - Experience with Helix Remedy or similar platforms for incident tracking and change control.
- Virtualization Experience Exposure to VMware or Hyper-V environments for server provisioning and troubleshooting.
JOB DESCRIPTION:
Were looking for a dedicated detail-oriented Analyst to join our Network Operations Center (NOC) team within the IT Server Operations group. This role is ideal for someone who thrives in a structured fast-paced environment and brings hands-on experience in NOC workflows server troubleshooting and infrastructure support. The position involves set shift work participate in an on-call rotation and play a key role in monitoring alerting and incident response. Success in this role means keeping systems stable communicating clearly across teams and driving operational efficiency through timely reporting and stakeholder engagement.
Key Responsibilities:
Monitoring Alerting and Incident Response:
o Monitor infrastructure health using tools like SolarWinds Orion Dynatrace or similar platforms.
o Respond to alerts with urgency and precision escalating appropriately and resolving issues within SLA targets.
o Log incidents thoroughly capturing actions and outcomes to support Root Cause Analysis post-mortem and knowledge sharing.
o Participate in a structured on-call rotation for after-hours support.
o Execute maintenance window tasks by validating application checkouts confirming maintenance mode and suppressing alert noise per implementation schedule.
Server Operations and Troubleshooting:
o Perform hands-on diagnostics and remediation for Windows and Linux servers in both physical and virtual environments.
o Maintain accurate documentation of assets configurations and operational standards.
o Troubleshoot technical issues create and manage support tickets and coordinate onsite assistance with vendor teams.
Reporting Communication and Engagement:
o Provide clear concise updates during shift handoffs and operational briefings to ensure continuity and transparency.
o Collaborate with cross-functional teams to align on incident priorities escalation paths and service impact.
o Partner with stakeholders to understand key performance metrics and tailor reporting and alerting solutions that align with specific application needs and infrastructure environments.
o Track key operational metrics for internal stakeholders highlighting areas of improvement and risk.
Security and Compliance:
o Apply server security best practices and respond to vulnerability alerts in a timely manner.
o Ensure all operational activities comply with internal policies and external regulatory standards.
Operational Excellence & Reliability:
o Identify recurring issues and contribute to preventive measures that improve system reliability and reduce noise.
o Help refine and maintain runbooks and escalation workflows to support consistent execution.
o Uphold high standards of punctuality ownership and accountability during assigned shifts.
Required Qualifications:
- 3 years of hands-on experience in NOC or server operations roles.
- Strong working knowledge of Windows Server and Linux environments.
- Experience with infrastructure monitoring and alerting tools.
- Familiarity with data center operations including hardware support.
- Solid understanding of networking fundamentals (TCP/IP DNS DHCP).
- Excellent troubleshooting documentation and communication skills.
- Comfortable working set shifts and participating in an on-call rotation.
Preferred Qualifications:
- Experience with SolarWinds Orion Dynatrace or similar observability platforms.
- Exposure to virtualization technologies such as VMware or Hyper-V.
- Familiarity with ITSM practices and ticketing systems (e.g. ServiceNow Remedy).
- Relevant certifications (Microsoft CompTIA Server Red Hat etc.).
Top daily tasks
- Orion Alerting Management: Manage and tune SolarWinds Orion alerts to ensure actionable signal clarity and suppress noise during maintenance windows.
- Email and Notification: Triaging Prioritize incoming alerts and system notifications escalating critical issues and maintaining stakeholder and NOC wide awareness.
- Failovers and Failbacks: Execution Perform and validate system failovers and failbacks ensuring service continuity and proper documentation.
- NOC Phone Support: Provide responsive phone support for infrastructure incidents service requests and operational escalations.
- DNS Entry Management: Create and update DNS records to support infrastructure changes update DNS based on failovers application status and setting forwarding zones in Infoblox.
Note: onsite support no remote/hybrid option. If candidate is not local remind them that relocation is at their own expense.
View more
View less