Site Reliability Engineer – UDF

F5 Networks

Not Interested
Bookmark
Report This Job

profile Job Location:

Seattle, OR - USA

profile Monthly Salary: $ 137600 - 206400
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

At F5 we strive to bring a better digital world to life. Our teams empower organizations across the globe to create secure and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity from protecting consumers from fraud to enabling companies to focus on innovation.

Everything we do centers around people. That means we obsess over how to make the lives of our customers and their customers better. And it means we prioritize a diverse F5 community where each individual can thrive.

Position Summary

This role will be a new member of our UnifiedDemo Framework(UDF)platformteamsupporting thelaunch and managementoftheF5 Guardrails andRedteamproduct linesinto UDF.Therole will focus on designing deploying and supportingKubernetes environmentsthat support a wide variety of usecases across many F5 teams. As a technical expert the SRE will work closely with cross-functional teams to instantiate AI featuresoptimizesystem performance and ensure reliability in production environments.

The ideal candidate will have deepexpertiseinKubernetesorchestration containerized architecturesandbuilds and runs systems with an operational excellence mindset. This individual will play a critical role in advancing the operational maturity and scalabilityof the UDF platform andensureour abilityto incorporate new F5 product lines and features.

Key Responsibilities

Kubernetes Orchestration and Management

  • Design deploy and manage Kubernetes clusters and ensure efficient container orchestration to support AI workloads.

  • Implement andmaintainKubernetes-based deployment pipelines

  • Optimizeresource allocation within Kubernetesclusterswhile reducing costs and maximizing performance.

  • Develop andmaintainhigh-availability and fault-tolerant Kubernetes architectures to ensure service continuity

Observability and Monitoring

  • Design and implement observability pipelines for real-time monitoring ofKubernetes clusters including metrics collection forscaling resourceutilization and system health.

  • Leverage tools such asCloudwatchDataDogGrafana or similar platforms to ensure visibility into Kubernetes-managedworkloads

  • Establish logging tracing and alerting strategies to enable proactive identification and resolution of performance or reliability issues.

Automation and Scalability

  • Automate infrastructure management tasks to support the efficient deployment and operation of AI functionalities including upgrades scaling and provisioning.

  • Support Infrastructure-as-Code (IaC) methodologies for the provisioning and configuration of environmentsleveragingtools such as Terraform or Helm.

  • Contribute to the development of CI/CD workflows tailoredfor automatic scaling and effective change management practices

Collaboration and Process Improvement

  • Collaborate withproduct teams and sales engineering to integrate F5 products into the UDF platform and ensure effectiveutilizationbythe sales organization.

  • Support root cause analysis (RCA) processes for issues affectingthe UDF platform driving long-term corrective actions to improve system reliability.

  • Provide technicalexpertiseto designoperational workflows and procedures that improve the agility and stability ofthe UDF platform.

Required Qualifications

  • Education:Bachelors degree in Computer Science Software Engineering or a related technical field (or equivalent experience).

  • Experience:

  • 4 years of experience in Site Reliability Engineering (SRE) DevOps or similar roles with a focus oncontainer management and AWS usage.

  • Strongexpertisein managing Kubernetes clusters and containerized workloads in production environments.

  • Hands-on experience deploying and managingKubernetes environments in AWS especially using EKS as well asin self-hosted ecosystems such ason-premisedatacenters.

  • Proficient in monitoring and observability tools includingCloudWatch GrafanaFluentdDataDog or equivalent platforms.

  • Expertisewith Infrastructure-as-Code (IaC) tools such as TerraformHelm orCloudFormation and CI/CD frameworks.

  • Solid understanding of networking storage andcomputeinfrastructure within containerized environments.

  • Proficiencyin coding and scripting languages including Python Go or Bash withfocuson automation and system integration.

  • Expertisein applying security best practices to Kubernetes environments including data protection and resource access controls.

  • Familiarity with GPU-based workloads in Kubernetes environments and optimization strategies for AIbased workloads.

  • Experience with orchestrating troubleshootingbest practicesandoptimizing complex network environments in AWSand GCP VPCs.

  • Experience working with hypervisors in GCP VPCs

Preferred Qualifications

  • Certifications:

  • Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD).

  • Relevant cloud certifications such as AWS Certified Solutions Architect orGCP Cloud Architectcertifications.

  • Familiarity with advanced Kubernetes tools and techniques such as service mesh technologies (IstioLinkerd) or Kubernetes operators for machine learning workflows.

  • Knowledge of distributed computing concepts and experience supporting large-scale AI workloads.

  • Practical experience integrating observability and monitoring into pipelines for inference engines and machine learning models.

#LI-Hybrid #LI-EM1

The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However the description may not be all-inclusive and responsibilities and requirements are subject to change.

The annual base pay for this position is: $137600.00 - $206400.00

F5 maintains broad salary ranges for its roles in order to account for variations in knowledge skills experience geographic locations and market conditions as well as to reflect F5s differing products industries and lines of business. The pay range referenced is as of the time of the job posting and is subject to change.

You may also be offered incentive compensation bonus restricted stock units and benefits. More details about F5s benefits can be found at the following link: F5 reserves the right to change or terminate any benefit plan without notice.

Please note that F5 only contacts candidates through F5 email address (ending with @) or auto email notification from Workday (ending with or @).

Equal Employment Opportunity

It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race religion color national origin sex sexual orientation gender identity or expression age sensory physical or mental disability marital status veteran or military status genetic information or any other classification protected by applicable local state or federal laws. This policy applies to all aspects of employment including but not limited to hiring job assignment compensation promotion benefits training discipline and termination. F5 offers a variety of reasonable accommodations for candidates. Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting .


Required Experience:

IC

At F5 we strive to bring a better digital world to life. Our teams empower organizations across the globe to create secure and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity from protecting consumers from fraud to enabling companies ...
View more view more

About Company

Company Logo

F5 application services ensure that applications are always secure and perform the way they should—in any environment and on any device.

View Profile View Profile