drjobs FLEX Senior Systems Engineer - SRE

FLEX Senior Systems Engineer - SRE

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bethesda, MD - USA

Hourly Salary drjobs

$ 52 - 88

Vacancy

1 Vacancy

Job Description

Description

The Senior Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability scalability and performance of mission-critical cloud and on-prem services that support millions of Marriot customers globally. This role involves overseeing incident management driving automation efforts and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams Applications teams Infrastructure and the broader Applications and Infrastructure Delivery teams to develop key metrics and KPIs to improve applications stability availability and performance. The ideal candidate will bring strong communication skills collaborating with key stakeholders across the company to optimize cloud infrastructure and uphold the highest standards of operational excellence in a dynamic fast-paced environment.

Job Responsibilities:

  • Ensure the reliability availability and performance of mission-critical cloud services implementing best practices for monitoring alerting and incident management.
  • Oversee the management of high-severity incidents driving quick resolution and post-incident analysis to identify root causes and prevent recurrence.
  • Drive the automation of operational processes and ensure systems can scale effectively to support growing user demand optimizing cloud and on-prem infrastructure and resource usage.
  • Develop and execute the SRE strategy aligned with business goals and communicate service health reliability and performance metrics to senior leadership and stakeholders

Drive Applications Performance Management and Monitoring:

  • Assess application architectures to identify key monitoring points
  • Identify Key Performance Indicators apply monitoring and report out on compliance.
  • Gather information to develop reporting metrics and KPIs
  • Ensure that all applications adhere to appropriate monitoring standards based on their technology/business process
  • Determine forums and cadence to provide regular monitoring updates

Building Successful Relationships:

  • Collaborates with Enterprise Application and Architecture and Infrastructure teams to continuously improve processes and procedures.
  • Liaises with vendors and Service Providers to select services and tools that best meet company goals

Managing Projects and Priorities:

  • Functions as a strategic senior technical expert within the department.
  • Develops specific goals and plans to prioritize organize and accomplish work.
  • Champions leaders vision for product and service delivery.
  • Makes and executes the necessary decisions to keep moving forward toward achievement of goals.
  • Determines priorities schedules plans and necessary resources to promote completion of any projects on schedule.
  • Generates and provides accurate and timely results in the form of reports presentations etc.
  • Plans develops implements and evaluates the quality of operations.

Delivering on the Needs of Key Stakeholders:

  • Understands and meets the needs of key stakeholders.
  • Communicates concepts in a clear and persuasive manner that is easy to understand.
  • Demonstrates an understanding of business priorities.
  • Supports achievement of performance goals budget goals team goals etc.

Providing Technical Support and Consultation:

  • Provides technical expertise and technical leadership within own and other teams.
  • Provides recommendations to improve the effectiveness of processes and programs.
  • Demonstrates advanced knowledge of job-relevant issues products systems and processes.
  • Demonstrates advanced knowledge of function-specific procedures.
  • Applies knowledge/judgment to achieve business goals.
  • Foresees identifies and resolves problems.
  • Keeps up-to-date technically and applies new knowledge to job.
  • Performs other reasonable duties as required for this position.

Skill and Experience:

  • 10-12 years experience in information technology process and/or technical project management including:
  • 5 years of experience as a Site Reliability Engineer (SRE) building and managing highly available and mission critical systems
  • Deep understanding of SRE practices such as Service Level Objectives Error Budgets Toil Management Observability & Monitoring Blameless Postmortems Incident Response Process Capacity Planning
  • Deep knowledge of public cloud platforms (AWS Azure etc) and cloud-native services with 5 years of hands-on experience in designing implementing and maintaining highly available scalable and secure infrastructure and services.
  • Proven automation and programming experience in one or more of the following languages: Java Python Go Perl Bash PowerShell
  • Strong working knowledge of modern continuous development techniques and pipelines (Agile Kanban Jira CI/CD Helm Jenkins Git Artifactory Vault)
  • Production level expertise with containerization orchestration engines such as Kubernetes (EKS AKS ACK)
  • Experience with Infrastructure as Code (Iac) tools like Terraform and CloudFormation.
  • Experience with configuration management and automation tools such as Ansible.
  • Strong hands-on experience with Linux(RHEL Ubuntu CentOS) and Windows administration.
  • Solid understanding of Virtualization Technologies (VMware vSphere KVM etc)
  • Good hands-on experience of Windows Failover Clustering and Linux HA solutions such as Pacemaker Red Hat Cluster Suite etc
  • Deep understanding and/or experience with Cloud Native Relational and NoSQL databases like RDS MySQL PostgreSQL Cassandra or Couchbase
  • Experience with Monitoring and Observability tools such as Dynatrace Splunk Prometheus Grafana etc ensuring visibility into service health.
  • Experience with deploying monitoring and troubleshooting large-scale distributed applications in cloud environments such as AWS
  • Experience in vulnerability assessment patching security compliance of infrastructure applications and databases
  • Experience in implementing OS and cloud hardening guidelines and perform regular vulnerability remediation.
  • Familiarity with security frameworks such as ISO27001 SOCII PCI-DSS and/or HIPAA
  • Experience working with SaaS IaaS and PaaS offerings
  • Ability to work with global teams located in US and India
  • 8 years experience in a technical discipline role with experience in planning implementing and evaluating processes systems and/or initiatives
  • Broad technical acumen across multiple disciplines applications with a solid understanding of current technologies
  • Experience applying measurement processes/methods for assessing program outputs and outcomes or progress toward goals and objectives.
  • Extremely high level of analytical ability with complex problems
  • Ability to work across organizational boundaries to help lead and influence change
  • Ability to command the process across all levels to ensure customer focus; including being assertive and self-starting
  • Demonstrated leadership experience in influence and garnering alignment from external organizations
  • Ability to align change management strategies with projects
  • Skilled in conceptualizing creative solutions documenting them and presenting/selling them to senior management
  • Very high level of interpersonal skills to work effectively with others motivate employees and elicit work output in a team environment

Education and Certifications: Undergraduate degree in Computer Science or related technical field or equivalent experience/certification

The pay range for this position is $52.06 to $88.99 per hour.

FLEX opportunities offer coverage for medical dental vision health care flexible spending account dependent care flexible spending account life insurance disability insurance accident insurance adoption expense reimbursements paid parental leave 401(k) plan stock purchase plan discounts at Marriott properties commuter benefits employee assistance plan and childcare discounts. Benefits are subject to terms and conditions which may include rules regarding eligibility enrollment waiting period contribution benefit limits election changes benefit exclusions and others.

Marriott HQ is committed to a hybrid work environment that enables associates to Be connected. Headquarters-based positions are considered hybrid for candidates within a commuting distance to Bethesda MD.

At Marriott International we are dedicated to being an equal opportunity employer welcoming all and providing access to opportunity. We actively foster an environment where the unique backgrounds of our associates are valued and greatest strength lies in the rich blend of culture talent and experiences of our are committed to non-discrimination on any protected basis including disability veteran status or other basis protected by applicable law.




Required Experience:

Senior IC

Employment Type

Full-Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.