In the role as Linux Operations Engineer in Digital Data &IT R&D Solutions you will contribute to our purpose of Bringing People Closer. You will own and operate GNs Linux infrastructure and high-performance simulation cluster as well as on-premises Kubernetes environments. You will ensure reliability performance and security of the environment administer SLURM scheduling automate with Ansible and provide responsive support to R&D and engineering teams running compute-intensive simulations. This role blends daily operations incident/problem management and continuous improvement through automation and observability.
The team you will be part of
You will be part of Engagement & Tech Services where we collaborate closely with R&D and engineering teams and external vendors/consultants. We value a customer-centric mindset clear communication with technical and non-technical stakeholders ownership accountability bias for action and continuous improvement. We promote a team culture that values inclusivity respect and a collaborative approach to problem-solving and operational practices.
Your contribution is appreciated and you will:
Operate and support the Linux server estate Kubernetes environments and the HPC simulation cluster to maintain high availability and performance. This includes servers supporting development build pipelines git database (PostgreSQL) and general lab environments.
Administer SLURM clusters: partitions QoS job policies fair-share accounting (sacct slurmdbd) reservations and user quotas.
Administer Kubernetes clusters: configure RBAC and set up GitOps automation (ArgoCD/Helm).
Manage Kubernetes clusters to ensure optimal performance scalability and security in production.
Provision and maintain compute nodes (kickstart) Kubernetes clusters and SLURM clusters.
Oversee patching and vulnerability management for Linux and Kubernetes environments ensuring compliance and timely remediation
Monitor and tune performance (CPU memory I/O network job throughput) and lead capacity planning across Linux and Kubernetes.
Integrate Linux hosts with identity services (LDAP/AD/Entra) and manage access controls (sudo policies SSH keys).
Maintain observability stacks (Prometheus/Grafana/ELK/OpenSearch) creating actionable alerts and dashboards.
Own incident problem and change management processes (ITIL) including root cause analysis and post-incident reviews.
Document standards runbooks architecture and procedures; keep knowledge current and accessible.
Coordinate vendor engagements and license services for simulation tools and infrastructure components.
Participate in an on-call rotation for critical Linux/HPC and Kubernetes services.
To perform well in the role we imagine that you:
Bring an extensive background in Linux systems administration/operations and Kubernetes management across a diverse server fleet
Have experience with HPC clusters and Kubernetes environments (e.g. SLURM).
Are skilled in Ansible for configuration management and automation (roles playbooks CI for lint/test preferred).
Are proficient in scripting (Bash/Python) to automate routine tasks and diagnostics.
Understand networking fundamentals (TCP/IP DNS DHCP routing firewalls) in Linux/Kubernetes contexts.
Have practical experience with monitoring and logging in production (Prometheus/Grafana ELK/OpenSearch or equivalent)
Demonstrate troubleshooting across OS scheduler storage applications and Kubernetes environments.
Are familiar with ITSM practices (incident/problem/change) and value disciplined documentation and runbooks.
Have professional working proficiency in English.
It is beneficial to have experience with:
Kubernetes certification or equivalent practical experience.
CI/CD for infrastructure (GitLab CI/GitHub Actions) GitOps practices; Infrastructure as Code with Ansible.
Containerization (Docker Kubernetes) supporting simulation workflows.
Source control systems like GitHub or GitLab.
License server administration for engineering/simulation tools and Kubernetes services; vendor management.
Security frameworks and hardening (CIS benchmarks) vulnerability scanning and remediation.
A Bachelors degree in Computer Science Engineering or equivalent practical experience.
Tools & Technologies: SLURM Ansible Bash/Python Kubernetes Git/GitLab/GitHub Prometheus/Grafana or ELK/OpenSearch PXE/Kickstart LDAP/Active Directory/Entra Docker PostgreSQL Linux MUNGE.
At GN we pride ourselves on encouraging flexible working whenever possible. We trust our people to carry out their tasks to know when in-person collaboration is better than hybrid and to be present when its needed most.
We encourage you to apply
Even if you dont match all the above-mentioned skills we welcome your application if you think you have transferrable skills. We highly value a mindset and motivation that align with our core values to not only ensure growth for you but for your team and the wider GN organization as well.
We are focused on an inclusive recruitment process
All applicants will receive equal consideration for employment. As such we encourage you to submit your CV without a photo to ensure an equal and fair application process.
Should you have any special requirements for the interview please let the Hiring Manager know upon accepting invitation to interview.
How to apply
Use the APPLY link. Applications are assessed on a continuous basis so dont wait to send yours.
On a time crunch Feel free to only submit your up-to-date CV including a few sentences outlining your motivation for applying quick and easy.
Join us in bringing people closer
GN brings people closer through our advanced intelligent hearing audio video and gaming solutions. Inspired by people and motivated by innovation we deliver technology that enhance the senses of hearing and sight. We enable people with hearing loss overcome real-life problems improve communication and collaboration for businesses and provide great experiences for audio and gaming enthusiasts.
We hope you will join us on this journey and look forward to receiving your application.
#LI-GNGroup
SteelSeries is a leading manufacturer of gaming peripherals and accessories, including headsets, keyboards, mice, and mousepads.