At Roblox we believe that careers are dynamic and should be a continuous progression through challenges and opportunities that build new skills and strengths.
As part of our commitment to fostering growth and development we encourage you to learn more about this opportunity and apply if you see a potential fit!
Roblox storage team plays a fundamental role in enabling the companys success by designing and running highly scalable and secure data storage systems across geo-regions globally. As a Software Engineer on the team you will lead the development of next-generation data security and availability architecture designing distributed software and tools to manage storage systems that support exabyte-scale data and handle hundreds of millions of transactions per second. Additionally this role is highly multi-functional requiring close collaboration with analytics security and product teams to understand customer requirements and develop integrated solutions.
You will:
- Partner with Security Product and Engineering teams to collect requirements drive and influence the strategy to define the data security for all of Roblox storage systems including OLTP databases Object store Queue Search etc.
- Have a leading role in designing implementing and running our storage Infra-as-a-Service offerings particularly hardening the data security and availability aspects.
- Improve & scale our large distributed 24x7 services and deliver features with urgency cost efficiency zero down time and high reliability
- Design and build frameworks or tools to automate development testing deployment management and monitoring of mission critical services
- Collaborate with partner teams producing project work plans measurable metrics delivery milestones rollout plan oncall alerts and runbooks while leveraging existing technology stack
- Give a high level of attention to create high quality & reusable code keep development continuously without compromising site reliability
- Improve SLA of the offering services and end-end rollout time of our suite of software solutions
You have:
- Strong interest designing & delivering large-scale distributed systems handling millions of real-time requests per second.
- Data management knowledge in one or more following technologies: RDBMS (CockroachDB SQL Server PostGres MySQL RDB) Caching(Redis) Kafka KV store(DynamoDB Cassandra) OLAP(ClickHouse) Object Storage (Ceph) is a plus
- Experience building deployment pipelines on top of container orchestrators like Kubernetes or Nomad and service discovery systems like Consul
- Experience with programming languages like Rust Go Java or C
- Scripting and test automation abilities
- Experience with telemetry stacks like Grafana Prometheus monitoring AlertManager and Kibana
- BS degree (or equivalent professional experience) in Computer Science 1-3 years of hands on experience