Site Reliability Engineer (SRE DevOps) Engineering Productivity

Not Interested
Bookmark
Report This Job

profile Job Location:

Any - Poland

profile Monthly Salary: Not Disclosed
Posted on: 17 hours ago
Vacancies: 1 Vacancy

Department:

Software Engineering

Job Summary

Who Youll Work With

Arista Networks is looking for a skilled professional for our Engineering Productivity (EngProd) team to help maintain and support our rapidly expanding infrastructure and internal user base. The ideal candidate is someone who can wear many hats is versatile and is enthusiastic about learning new technologies. As a part of the software engineering team you will work with other team members to design build and administer secure scalable and fault-tolerant tools and infrastructure in a hybrid cloud environment.

Working in the EngProd group you will collaborate and work with other engineers to design build scale and operate the systems used by Aristas product development teams.  These systems are based on industry-standards including  Ansible Artifactory Gerrit Jenkins Kubernetes Grafana Spinnaker MySQL ElasticSearch Google Cloud Varnish Perforce Gerrit etc 3rd party storage appliances as well as internal systems developed from the ground-up to automate CI/CD testing analysis and visualization.

What Youll Do

  • Build deploy safely and incrementally and operate critical production systems with focus on scalability reliability observability performance and security.
  • Monitor support and enhance developer experience across services.
  • Build automation to remove toil and efficiently operate production systems.
  • Proactively monitor respond to and enhance alerts and set up automated alert handling
  • Create and maintain the incident response runbooks.
  • Triage platform/infrastructural issues and help Arista software engineers in their triages. Engage with 3rd party vendor support.
  • Write postmortem documents and build solutions to avoid incidents from repeating.
  • Plan and communicate maintenance windows on production systems.
  • Work with Aristas product development teams to identify infrastructural issues that are causing bottlenecks and limitations in their workflows. Design and implement solutions to resolve them.
  • Survey and adopt best practices around infrastructure/platform to maintain secure scalable and fault-tolerant systems.
  • Study the design and sufficient implementation details of OSS systems for better triage and fix resolution.

Qualifications :

Essential Skills

  • At least BSc Computer Science or Engineering 3 years experience MS Computer Science or Engineering 3 years experience or equivalent work experience.
  • Knowledge of one or more of Go Python shell scripting to be able to implement medium complexity automation workflows.
  • Knowledge of Linux (or UNIX) from administration and debugging perspective
  • Hands-on experience in operating software systems (infrastructure complex applications etc) at scale
  • Experience in server provisioning (esp from storage and networking perspective).
  • Strong problem solving and software troubleshooting skills
  • Experience with infrastructure-as-code

 Desired Skills

  • Experience managing databases - mariadb postgres mongodb etc
  • Experience with docker and virtualization technologies - kvm qemu kata-containers etc
  • Experience managing monitoring stack - Prometheus Loki Tempo InfluxDB Grafana Thanos etc
  • Experience managing ElasticSearch clusters
  • Experience managing Artifactory docker registry etc
  • Experience managing CI/CD systems like ArgoCD Spinnaker etc
  • Experience managing version control systems like Perforce Gerrit etc
  • Experience with infrastructure-as-code frameworks like Ansible
  • Experience managing large Java applications
  • Experience in storage infrastructure management eg: NAS SAN Ceph etc

#LI-SZ1


Remote Work :

Yes


Employment Type :

Full-time

Who Youll Work WithArista Networks is looking for a skilled professional for our Engineering Productivity (EngProd) team to help maintain and support our rapidly expanding infrastructure and internal user base. The ideal candidate is someone who can wear many hats is versatile and is enthusiastic ab...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and sof ... View more

View Profile View Profile