Principal Site Reliability Engineer

Not Interested
Bookmark
Report This Job

profile Job Location:

Boston, NH - USA

profile Monthly Salary: $ 131000 - 185000
Posted on: 12 hours ago
Vacancies: 1 Vacancy

Job Summary

Our world is transforming and PTC is leading the software brings the physical and digital worlds together enabling companies to improve operations create better products and empower people in all aspects of their business.

Our people make all the difference in our success. Today we are a global team of nearly 7000 and our main objective is to create opportunities for our team members to explore learn and grow all while seeing their ideas come to life and celebrating the differences that make us who we are and the work we do possible.

Principal Site Reliability Engineer-Hybrid Boston MA

About the Role

We are looking for a Principal Site Reliability Engineer (SRE) to play a critical role in ensuring the longterm reliability scalability and operational excellence of our platform.

As a Principal SRE you will operate with a high degree of autonomy and influence. You will lead complex crossorganization reliability initiatives shape reliability strategy and serve as a technical authority and trusted advisor across engineering.

Your work will directly shape the experience of our customers by ensuring the platform is fast resilient and dependable. As a Principal SRE you will help protect customer trust by driving reliability across the entire system lifecycle.

This role is ideal for engineers who enjoy solving ambiguous highimpact problems at scale influencing system design across teams and raising the reliability bar for an entire organization.

What Youll Do

Own Reliability at Scale

  • Lead design implementation and evolution of reliability availability and resiliency strategies for largescale distributed systems written primarily in Java
  • Apply deep experience operating complex distributed systems to guide architectural decisions reliability strategies and longterm system evolution
  • Identify systemic risks in application architecture data flows and infrastructure and drive architectural improvements that measurably improve availability performance and scalability
  • Set and evolve reliability standards best practices and operational principles across R&D

Drive Operational Excellence

  • Lead efforts to prevent detect and mitigate incidents through technical improvements and operational maturity
  • Serve as a senior coordination point during major incidents helping manage response and guide longterm remediation
  • Champion blameless post-incident reviews and ensure learnings translate into durable system improvements

Reduce Toil Through Engineering

  • Apply advanced software engineering practices to eliminate manual work reduce operational load and improve system observability
  • Design and build internal platforms automation and tooling that support Javabased services and their operational needs
  • Raise the bar on monitoring alerting and SLO/SLI adoption across systems

Lead Through Influence and Collaboration

  • Partner deeply with product engineers architects and engineering leadership to ensure reliability and operability are firstclass concerns in system design
  • Review and influence designs for complex systems involving technologies such as datastores messaging systems and coordination services
  • Serve as a technical mentor and coach for SREs and other engineers raising overall engineering and operational maturity

Shape Strategy and Direction

  • Contribute to longerterm reliability and infrastructure strategy aligned with business growth
  • Stay current with industry trends in SRE distributed systems and the Java ecosystem turning insights into practical improvements
  • Help define what great reliability looks like for the organization and how we measure it

What Were Looking For

Required Experience & Expertise

  • US Citizenship or Green Card holder only for this role due to ITAR requirements.
  • Ability to commute to the Seaport Boston office 2-3 days a week.
  • 7 years of experience in software engineering site reliability engineering or systems engineering roles
  • Extremely strong proficiency with the Java programming language and its ecosystem including building debugging and operating production Java services
  • Deep experience operating complex distributed systems in production environments
  • Strong software engineering background with a track record of delivering highquality maintainable code

Technical Strength

  • Expert understanding of incident management service reliability and performance engineering
  • Strong handson experience with observability (metrics logs traces) capacity planning and SLOdriven reliability
  • Deep familiarity with modern cloudbased infrastructure CI/CD pipelines and infrastructureascode practices
  • Ability to reason about failure modes across application data and infrastructure layers

Leadership & Influence

  • Demonstrated ability to lead complex initiatives that span teams and organizational boundaries
  • Comfortable making highimpact technical decisions in ambiguous environments
  • Strong communicator who can influence design and operational decisions across a wide range of stakeholders

Mindset

  • Systems thinker focused on rootcause analysis and durable fixes
  • Calm and effective under pressure especially during highseverity incidents
  • Curious datadriven and committed to continuous improvement

Nice to Have

  • Experience operating or supporting systems using technologies such as MongoDB ZooKeeper and RabbitMQ
  • Background in performance tuning and scalability optimization of Java services
  • Experience setting or influencing engineering standards at the organization level
  • Prior involvement in evolving SRE or platform practices in a growing engineering organization
  • Experience designing operating or scaling systems in cloud environments such as AWS (preferred) including familiarity with core services networking models and reliability features

Why This Role Matters

The Principal SRE role is critical to ensuring our platform continues to meet the reliability expectations of our customers as we grow. Your work will have a broad and lasting impactnot only on individual systems but on how reliability and operational excellence are approached across engineering.

PTC carefully considers a wide varietyof factors when determining compensation. The anticipated annual salary range for this position is between $131000 - 185000. The salary range reflects a good-faith estimate of compensation at the time of posting. Actual compensation may vary based on a candidates skills qualifications experience and location. Additionally candidates may be eligible for a performance-based bonus. Employees also have the opportunity to become a PTC shareholder through our employee share purchase program (ESPP) which allows for the purchase of discounted PTC stock. Certain roles may also be eligible for participation in our equity programs. Employees may be eligible for medical dental and vision insurance paid time off and sick leave tuition reimbursement 401(k) contributions and employer match flexible spending accounts life insurance disability coverage and if you are an office-assigned employee a generous commuter subsidy. All total rewards and benefits programs are

subject to applicable plan eligibility and other terms and conditions.

For more information about PTCs comprehensive benefits please visit our Careers Page.

Applications will be accepted on an on-going basis.

At PTC we believe in the power of diverse ideas and perspectives. As a global company that values and respects all identities cultures and perspectives we strive to create an inclusive PTC for ALL through an environment where everyone feels like they belong and are empowered to bring their true authentic selves to work. Proud to be an Equal Opportunity Employer we welcome applicants from all backgrounds and hire without regard to race national origin religion age color ethnicity ancestry marital status sex (including pregnancy) sexual orientation gender identity gender expression genetic information disability veteran status or any other characteristic protected by local state or federal laws rules or regulations. It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

PTC endeavors to make accessible to all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process please contact PTCs Talent Acquisition team at This contact information is for accommodation requests only and cannot be used to inquire about the status of applications.

Life at PTC is about more than working with todays most cutting-edge technologies to transform the physical world. Its about showing up as you are and working alongside some of todays most talented industry leaders to transform the world around you.

If you share our passion for problem-solving through innovation youll likely become just as passionate about the PTC experience as we are. Are you ready to explore your next career move with us

We respect the privacy rights of individuals and are committed to handling Personal Information responsibly and in accordance with all applicable privacy and data protection laws. Review our Privacy Policy here.


Required Experience:

Staff IC

Our world is transforming and PTC is leading the software brings the physical and digital worlds together enabling companies to improve operations create better products and empower people in all aspects of their business.Our people make all the difference in our success. Today we are a global team...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

PTC accelerates digital transformation with a new generation of technology that powers the ways you engineer, manufacture, and service your physical products.

View Profile View Profile