Site Reliability Engineer

Point72

Not Interested
Bookmark
Report This Job

profile Job Location:

Bengaluru - India

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

JOB TITLE

Site Reliability Engineer

A Career with Point72s Technology Team

As Point72 reimagines the future of investing our Technology group is constantly improving our companys IT infrastructure positioning us at the forefront of a rapidly evolving technology landscape. Were a team of experts experimenting discovering new ways to harness the power of open source solutions and embracing enterprise agile methodology. We encourage professional development to ensure you bring innovative ideas to our products while satisfying your own intellectual curiosity.

What youll do

- Design and implement automated operational workflows to improve system reliability and reduce manual intervention

- Build and maintain observability solutions using tools such as Datadog to deliver metrics monitoring alerting and dashboards

- Partner with development teams to improve application reliability deployment safety and performance through SRE best practices

- Develop and maintain CI/CD pipelines and deployment automation using Bitbucket/Jenkins GitHub Actions and related tooling

- Engineer scalable solutions for production environments across Linux and Windows systems

- Automate infrastructure and operational tasks using Python PowerShell Bash or similar scripting languages

- Support and enhance reliability of database platforms such as SQL Server and MongoDB from an SRE perspective

- Participate in incident response drive root cause analysis and implement longterm reliability improvements

- Define and enforce SLOs SLIs and error budgets in partnership with application teams

- Collaborate with Networking Platform and Security teams to ensure endtoend system reliability

- Enable selfservice and standardized operational patterns for development teams

Whats required

- Strong handson experience with Linux and Windows operating systems

- Proven experience building automation and tooling using Python or similar languages

- Deep understanding of observability and monitoring preferably with Datadog

- Experience with CI/CD pipelines and deployment automation (Bitbucket GitHub Actions Jenkins etc.)

- Operational and performance knowledge of SQL Server and MongoDB

- Familiarity with cloud platforms (AWS or similar) and hybrid architectures

- Solid understanding of networking concepts such as DNS load balancing and TCP/IP

- Experience working closely with application development teams in an SRE or DevOps role

- Experience with Kubernetes OpenShift and containerized workloads

- Knowledge of infrastructureascode tools (Terraform CloudFormation ARM)

- Experience implementing automated scaling and performance tuning

- Background in reliability engineering or DevOps in an enterprise environment

- Familiarity with security and compliance considerations in production systems

- Strong bias toward automation over manual processes

- Focus on improving longterm reliability rather than reactive firefighting

- Comfortable owning systems endtoend and driving improvements

-Clear communication skills with the ability to work effectively across engineering platform and operations teams

-Commitment to the highest ethical standards

About Point72

Point72 is a leading global alternative investment firm led by Steven A. Cohen. Building on more than 30 years of investing experience Point72 seeks to deliver superior returns for its investors through fundamental and systematic investing strategies across asset classes and geographies. We aim to attract and retain the industrys brightest talent by cultivating an investor-led culture and committing to our peoples long-term growth. For more information visithttps:// Experience:

IC

JOB TITLE Site Reliability EngineerA Career with Point72s Technology TeamAs Point72 reimagines the future of investing our Technology group is constantly improving our companys IT infrastructure positioning us at the forefront of a rapidly evolving technology landscape. Were a team of experts experi...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

We invest in Discretionary Long/Short, Macro, and Systematic strategies. We’re inventing the future of finance by revolutionizing how we develop our people and how we use data to shape our thinking. Join our team to innovate, experiment, and be the best at what you do.

View Profile View Profile