Senior Manager, Site Reliability Engineering

Oracle


Job Location:

Reston, VA - USA

Yearly Salary: $ 118300 - 251600
Posted on: 13 days ago
Vacancies: 1 Vacancy

Job Summary

Description

Capacity Ingestion and Management:

- Supports team members designing and architecting infrastructure and/or service sharing guidance on practices and terms for reliability and functionality.

- Supervises team members and provides direction to ensure accurate forecasting of demands for infrastructure and response to capacity needs ensuring systems have sufficient resources to handle current and future workloads and identifying resource gaps.

- Maintains a collaborative relationship with the software development team to develop infrastructures ensuring features are reliable and scalable according to deployment requirements.

- Implements expectations for identifying opportunities for prototyping and manages prototyping initiatives (e.g. testing new applications or infrastructures assisting in onboarding) to explore novel approaches.

Incident and Service Lifecycle Management:

- Monitors data collection triage technical analysis and redirection ensuring team members maintain and optimize operations and infrastructure reliability.

- Provides support to team members monitoring services ensuring they maintain up-to-date knowledge of performance and document their condition.

- Leverages advanced knowledge to aid team members in performing incident response root cause analyses and/or maintenance on assigned services (e.g. software installs version upgrades security updates backup and recovery).

- Monitors comprehensive health and performance reporting and ensures team members take appropriate actions based on trends in data.

- Ensures team members adhere to procedures when performing provisioning to support infrastructure applications and services.

- Encourages team members to experiment with new approaches for and perform decommissioning (e.g. shutting down servers removing data from databases) to remove objects that are no longer needed.

Automation:

- Implements standards for identifying and recommending opportunities for automation and assesses potential benefits to enhance operational efficiency.

- Takes a proactive role in reviewing and offering feedback on design automation tools or scripts acting as a leader during implementation.

- Shares strategies for conducting testing on automations to ensure they perform tasks correctly and produce expected results.

Technical Communication and Guidance:

- Reviews and provides feedback on release notes and ensures team members communicate comprehensive information about the scale capacity security performance attributes and requirements of services and technology with customers and immediate and related teams.

- Proactively anticipates and articulates the potential impact of infrastructure feature and tool changes considering their impact across team operations.

- Serves as a resource to team members on what information to communicate and how to communicate.

Troubleshooting and Resolution:

- Serves as a senior management escalation point for incidents and complex issues arising within Oracle services.

- Monitors the resolution of technical issues spanning multiple services ensuring effective investigation and debugging techniques are leveraged to achieve SLOs (service level objectives).

- Shares expectations for documenting incidents performing root cause analyses guiding team members to capture essential information for analysis and future reference.

- Implements guidelines for post-mortem procedures to prevent incident reoccurrence.

- Ensures team members adhere to service level agreements (SLAs) made with customers.

Innovation and Improvement:

- Sets expectations for conducting experiments and evaluating cutting-edge tools and technologies to optimize infrastructure performance and reliability taking proactive steps to adhere to security standards.

- Manages and contributes to the prioritization of initiatives to improve performance bottlenecks and deployments ensuring efficient resource usage speed and scalability.

- Implements standards for developing and maintaining knowledge of site reliability trends and sharing valuable insights and information with team members management and beyond to promote innovative building testing deploying and running services.

- Leverages analyses and data from teams to contribute to business development decisions (e.g. design changes).



Responsibilities

Planning & Execution:

- Manages multiple medium- to large-scale projects or initiatives across teams ensuring timelines deliverables and budgets when applicable are monitored and met. Provides direction to teams on project work setting priorities and aligning with business needs. Guides teams on adjusting plans to accommodate resource or timeline changes.

Collaboration & Partnership:

- Drives cross-functional partnerships to align expectations and shared objectives across multiple teams. Coaches team members to develop strategic relationships with business leaders stakeholders and external partners to foster collaboration and long-term success. Promotes inclusivity by actively seeking and listening to diverse perspectives ensuring others feel heard and respected.

Problem Solving:

- Provides direction to multiple teams on addressing complex operational and/or technical issues as well as providing guidance on analyzing complex data and/or information to identify solutions. Reviews and provides insights into unresolved or critical issues helping the team to identify potential solutions.

Continuous Learning:

- Models engaging in continuous learning to deepen expertise and stay ahead of industry trends integrating best practices into strategic planning. Leverages feedback to drive personal and team skill improvements. Identifies skill gaps across teams and empowers team members to pursue learning and knowledge sharing opportunities that build their expertise in new areas and coaches them to apply learnings to advance the organization.

Continuous Improvement:

- Drives team to collaborate on develop and implement ideas to increase the efficiency and effectiveness of processes protocols and workflows within and across teams providing oversight. Guides team to adopt new ideas for alternative approaches and methods and encourages feedback for continued improvement.

Performance and Development:

- Drives performance across teams by providing feedback and coaching in alignment with performance management processes guidelines and expectations. Discusses development goals with team members shares opportunities to facilitate career development and ensures individual goals are aligned with broader organizational goals. Develops and manages talent acquisition pipeline by leading candidate interviews monitoring promotion eligibility and/or orchestrating talent resources.

Minimum Job Qualifications
Education and/or Experience:
9 years of experience in software engineering infrastructure management or related field

OR

Bachelors Degree in Computer Science Engineering or related field AND 5 years of experience in software engineering infrastructure management or related field

OR

Masters Degree in Computer Science Engineering or related field AND 3 years of experience in software engineering infrastructure management or related field

OR

Doctorate in Computer Science Engineering or related field AND 1 year of experience in software engineering infrastructure management or related field.

Job Skills:
Same Skills a prior level

Automation Experience:
5 years of experience in automation.

Programming Experience:
5 years of experience in programming and/or scripting.

Preferred Job Qualifications
Education and/or Experience:
11 years of experience in software engineering infrastructure management or related field

OR

Bachelors Degree in Computer Science Engineering or related field AND 7 years of experience in software engineering infrastructure management or related field

OR

Masters Degree in Computer Science Engineering or related field AND 5 years of experience in software engineering infrastructure management or related field

OR

Doctorate in Computer Science Engineering or related field AND 3 years of experience in software engineering infrastructure management or related field.

People Leadership / Management Experience:
2 years of experience in a leadership role with direct reports.

Budget Experience:
2 years of experience working with operating budgets and/or project financials.

Automation Experience:
7 years of experience in automation.

Programming Experience:
7 years of experience in programming and/or scripting.

The position is located in Reston VA / Austin TX.



Qualifications
Disclaimer:

Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements such as immunization/occupational health mandates and/or drug testing requirements.

Range and benefit information provided in this posting are specific to the stated locations only

US: Hiring Range in USD from: $118300 to $251600 per annum. May be eligible for bonus equity and compensation deferral.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge skills experience market conditions and locations as well as reflect Oracles differing products industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following:
1. Medical dental and vision insurance including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - M3





Required Experience:

Senior Manager

DescriptionCapacity Ingestion and Management:- Supports team members designing and architecting infrastructure and/or service sharing guidance on practices and terms for reliability and functionality.- Supervises team members and provides direction to ensure accurate forecasting of demands for infra...

About Company

Company Logo

As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when eve ... View more

View Profile View Profile