In this role you will maintain and deploy critical infrastructure components and work across Linux systems and network infrastructure to ensure uptime and scalability. Youllcollaborate with developers SREs and network engineers to set guardrails (testingsecurity rollout policy) instrument systems for visibility and continuously improverelease practices. You will also share responsibility for on-call support responding toincidents in real time and helping drive post-incident improvements to strengthenreliability.
- Good understanding of Linux systems basic production troubleshooting and core networking concepts (TCP/IP HTTP DNS DHCP Proxy)
- Some hands-on experience with observability tools like Prometheus Grafana and Splunk
- Familiar with containerization and orchestration technologies (Docker Kubernetes)
- Comfortable supporting on-call rotations assisting with alerting improvements and contributing to post-incident reviews
- Familiarity with CDN DNS and proxies/load balancers
- Exposure to CI/CD platforms (GitHub Actions GitLab CI Jenkins etc.)
- Some experience with physical server hardware including configuration deployment firmware updates or hardware troubleshooting
- Bachelors degree in CS/Engineering or equivalent practical experience; certifications such as RHCSA/LPIC and CKA is a plus.
In this role you will maintain and deploy critical infrastructure components and work across Linux systems and network infrastructure to ensure uptime and scalability. Youllcollaborate with developers SREs and network engineers to set guardrails (testingsecurity rollout policy) instrument systems fo...
In this role you will maintain and deploy critical infrastructure components and work across Linux systems and network infrastructure to ensure uptime and scalability. Youllcollaborate with developers SREs and network engineers to set guardrails (testingsecurity rollout policy) instrument systems for visibility and continuously improverelease practices. You will also share responsibility for on-call support responding toincidents in real time and helping drive post-incident improvements to strengthenreliability.
- Good understanding of Linux systems basic production troubleshooting and core networking concepts (TCP/IP HTTP DNS DHCP Proxy)
- Some hands-on experience with observability tools like Prometheus Grafana and Splunk
- Familiar with containerization and orchestration technologies (Docker Kubernetes)
- Comfortable supporting on-call rotations assisting with alerting improvements and contributing to post-incident reviews
- Familiarity with CDN DNS and proxies/load balancers
- Exposure to CI/CD platforms (GitHub Actions GitLab CI Jenkins etc.)
- Some experience with physical server hardware including configuration deployment firmware updates or hardware troubleshooting
- Bachelors degree in CS/Engineering or equivalent practical experience; certifications such as RHCSA/LPIC and CKA is a plus.
View more
View less