drjobs
Sr. Site Reliability Engineer
drjobs
Sr. Site Reliability....
Orangepeople
drjobs Sr. Site Reliability Engineer العربية

Sr. Site Reliability Engineer

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs

Job Location

drjobs

others - USA

Monthly Salary

drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Req ID : 1734472
The Sr. SRE in the Ref. Arch. team creates solutions to streamline delivery, improve quality and reduce costs of support for our applications. We make it easier to deliver high quality apps. You should be confident in designing and engineering your own solutions to problems knowing there is more than one way to solve a problem and working through which path will be leveraged on your projects. We are looking for people who are engineers at heart. Willingness to learn new technologies and help improve our organization is essential for success on this team because one day you could be working on a deployment pipeline, next might be implementing a containerized build system and then end up triaging an automation issue in production.
Expectations:
  • Experience with multiple operating systems, including OS performance monitoring, setup, configuration, tuning, and troubleshooting.
  • Experience with at least one web server and application server technology, including setup, configuration, performance monitoring, tuning, clustering, and debugging.
  • Experience with configuration management tools such as Chef or Puppet.
  • Experience with build and automation tools such as Rundeck, and Jenkins.
  • Working experience on one or more load balancer platforms including setting up pools, VIPs, layer 7 routing, debugging (F5 LTM preferred).
  • Design, test, and implement basic automation workflows for deployments and operational activities
  • Part of 24x7x365 on-call rotation
  • Able to use scripts and tools built by others, including the ability to troubleshoot or debug issues with these tools.
  • Able to interpret error messages from scripts, tools and applications to identify root cause.
  • Ability to author and update moderately complex scripts to automate repeatable production tasks (using scripting languages like bash, ksh, perl, powershell) and have advanced skills in at least one or more programming languages (e.g. Python, Ruby, Java, C#).
  • Demonstrates exceptional troubleshooting methodology, including the ability to author and instruct new methodologies to the Systems Engineering team.
  • Demonstrate ability to independently triage complex incidents.
  • Independently resolve moderately to highly complex system and application incidents.
  • Able to identify and propose system and application fixes for performance bottlenecks.
  • Able to evaluate new application requirements for capacity and run-time best practices.
  • Able to evaluate new system and/or infrastructure solutions for technical feasibility against known requirements and standards.
  • Comfortable presenting issues to management as well as peers, both written and verbally in a concise fashion.
  • Able to receive feedback in a constructive manner and consistently apply it to tasks.
  • Able to create system and production documentation, adhering to organization standards.
  • Evaluate technology solutions through research and lab work to select tools to solve a problem.
  • Engage with our customers to hear their needs, collect feedback, and feed that back into tangible solutions.
  • We are a service provider, so being receptive to feedback and ways to improve are essential.
  • Ensure the solutions you develop are valuable and being leveraged by the organization.
  • Ensure deliverables across engineering teams are of high quality and clearly documented.
  • Challenge the status quo through intellectual curiosity and natural inquisitiveness to look beyond the obvious for continuous improvement opportunities backed with factual arguments.
  • Be able to work collaboratively with local and remote team members.
  • Have ownership over your project and provide appropriate status to leadership on progress and key decisions.
  • Provide thought leadership, problem solving and analytical skills to solve hard to solve production issues impeding the availability & performance of applications.
Basic Qualifications:
  • 3-5+ years experience supporting and/or deploying web-based products or services.
  • Strong interpersonal, organizational, and communication skills.
  • Able to resolve matters/issues in a positive manner.
  • Understand basic application design and dependencies for the applications the team supports.
  • Able to create concise and accurate documentation for Level 1 and 2 staff for the resolution of simple to moderate incidents/issues.
  • Demonstrated inclusive leadership that embraces diversity.
  • Ability to successfully operate in a highly matrixed organizational system where partnership and influence are key drivers of success.
  • Has shown the ability to initiate change and act with integrity when tough decisions have to be made.
  • Demonstrated experience with software development lifecycle methodologies such as Agile/Scrum and Waterfall.
  • Proven experience with system analysis and design, development, and testing.
  • Demonstrated strong analytical and problem solving skills to achieve business results.
  • Ability to manage and prioritize multiple projects simultaneously.
  • Excellent organizational, communication and time management skills.
Must Haves:
  • Strong proficiency with networking and cloud network architecture including, HTTP, TCP/IP, DNS (AWS Route 53), subnetting, VPC, gateways, firewalls, and the ability to formulate architectures and migrations plans related to cloud networking.
  • The ability to create a cloud architecture from greenfield through production, going through proof of concept, documentation, and building the entire lifecycle of an architecture.
Preferred Qualifications:
  • Demonstrated release management experience with large, multi-platform implementations and technology stacks
  • Demonstrated experience with cloud deployment (e.g. AWS, Microsoft Azure)
  • Demonstrated experience with container technologies (e.g. Docker, Kubernetes)
Preferred Education:
  • Relaed equivalent experience
Required Education:
  • Bachelor's Degree in Computer Science or IS
Additional Responsibilities:
  • Participate in OrangePeople monthly team meetings, and participate in team-building efforts.
  • Contribute to OrangePeople technical discussions, peer reviews, etc.
  • Contribute content and collaborate via the OP-Wiki/Knowledge Base.
  • Provide status reports to OP Account Management as requested.
About us:
OrangePeople is an Enterprise Architecture and Project Management solutions company. Our most valuable asset is our people: dynamic, creative thinkers, who are passionate about doing quality work. As a member of the OrangePeople team, you will have access to industry-leading consulting practices, strategies & technologies, innovative training & education. An ideal OrangePeople Person is a technology leader with a proven track record of technical achievements and strong process/methodology orientation.

Employment Type

Full Time

Company Industry

About Company

100 employees
Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.