Distributed Systems Engineer Data Platform Analytics and Alerts

Cloudflare

Not Interested
Bookmark
Report This Job

profile Job Location:

San Francisco, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

About Us

At Cloudflare we are on a mission to help build a better Internet. Today the company runs one of the worlds largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware installing software or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network which gets smarter with every request. As a result they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazines Top Company Cultures list and ranked among the Worlds Most Innovative Companies by Fast Company.

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us!

Locations Available: London (UK) Lisbon (Portugal) Austin (US) Denver (US) Atlanta (US)

About Role

We are looking for experienced and highly motivated engineers to join our DATA Org and help build the future of data at Cloudflare. Our organisation is responsible for the entire data lifecycle - from ingestion and processing to storage and retrieval - powering the critical logs and analytics that provide our customers with real-time visibility into the health and performance of their online properties.

Our mission is to empower customers to leverage their data to drive better outcomes for their business. We build and maintain a suite of high-performance scalable systems that handle more than a billion events in a second. As an engineer in our organisation you will have the opportunity to work on complex distributed systems challenges across different parts of our data stack.

Our Data Organisation is strategically composed of several key teams each focusing on a distinct aspect of our comprehensive data platform:

  • Data Delivery / Data Pipeline: This team is responsible for the design development and operation of our distributed data delivery pipeline. This system is a high-throughput low-latency powerhouse primarily written in Go and is tasked with ingesting processing and intelligently routing massive volumes of data originating from across Cloudflares vast global network to multiple core destinations. This involves handling diverse data types and ensuring reliable timely delivery to various downstream systems.

  • Analytical Database Platform:

  • Data Retrieval (Customer-Facing Products): This department is focused on building and continuously improving our customer-facing products making data not only accessible but also genuinely actionable for our users. This department comprises two main groups:
    • Analytics and Alerts: Members of this group are at the forefront of developing our public APIs such as the GraphQL Analytics API providing customers and internal Cloudflare teams with flexible access to their data. They will also work on our alerting platform empowering users to configure and receive near real-time alerts based on the critical logs and metrics observed by our robust data platform. This includes designing intuitive alerting mechanisms and ensuring the reliability of notification systems.
    • Logs and Audit Logs: This specialised team is dedicated to building a robust and easy-to-use logging platform that powers reliable data delivery and seamless integrations with customer destinations. The teams mission is to make it simple for customers to access manage and use their log data ensuring that critical datasets including comprehensive audit logs are delivered securely and efficiently to their preferred storage and analysis platforms. The work spans developing intuitive connectors ensuring data integrity optimising delivery pipelines and upholding strict standards for compliance performance and usability.

Responsibilities

This role is focusing on the Analytics and Alerts group. As a Software Engineer you will focus on the following areas:

  • Develop and enhance our customer-facing APIs focusing on performance reliability and an intuitive user experience.
  • Design build and maintain our near real-time alerting platform from data processing and anomaly detection to reliable notification delivery.
  • Optimise the performance of complex analytical queries that power our APIs and dashboards working closely with the database platform team.
  • Create intuitive and powerful tools that allow customers to explore their data and configure meaningful alerts based on logs and metrics.
  • Scale our API and alerting infrastructure to support a growing number of internal and external use cases.
  • Collaborate with front-end engineers and product managers to define API contracts and deliver a seamless data experience for our users.
  • Ensure the operational health of our APIs and alerting systems by developing comprehensive monitoring and participating in an on-call rotation (with the flexibility to be on-call outside of standard working hours as needed).

Key Qualifications

  • 3 years of experience working in software development covering distributed systems and scalable APIs.
  • Strong programming skills (Go is preferable) with a deep understanding of software development best practices for building performant customer-facing services.
  • Hands-on experience with modern observability stacks including Prometheus Grafana and a strong understanding of handling high-cardinality metrics at scale.
  • Strong knowledge of SQL including extensive experience with complex query optimisation.
  • A solid foundation in computer science including algorithms data structures distributed systems and concurrency.
  • Strong analytical and problem-solving skills with a willingness to debug troubleshoot and learn about complex problems at high scale.
  • Ability to work collaboratively in a team environment and communicate effectively with other teams across Cloudflare.
  • Experience developing and scaling APIs particularly GraphQL is a strong plus.
  • Experience with data streaming technologies (e.g. Kafka Flink) for real-time processing is a plus.
  • Experience with Infrastructure as Code tools like SALT or Terraform is a plus.
  • Experience with Linux container technologies such as Docker and Kubernetes is a plus.

If youre passionate about building scalable and performant data platforms using cutting-edge technologies and want to work with a world-class team of engineers then we want to hear from you! Join us in our mission to help build a better internet for everyone!

What Makes Cloudflare Special

Were not just a highly ambitious large-scale technology company. Were a highly ambitious large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: Since 2014 weve equipped more than 2400 journalism and civil society organizations in 111 countries with powerful tools to defend themselves against attacks that would otherwise censor their work technology already used by Cloudflares enterprise customers--at no cost.

Athenian Project: In 2017 we created the Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free so that their constituents have access to election information and voter registration. Since the project weve provided services to more than 425 local government election websites in 33 states.

1.1.1.1: We released 1.1.1.1 to help fix the foundation of the Internet by building a faster more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Heres the deal - we dont store client IP addresses never ever. We will continue to abide by our privacy commitment and ensure that no user data is sold to advertisers or used to target consumers.

Sound like something youd like to be a part of Wed love to hear from you!

This position may require access to information protected under U.S. export control laws including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer. We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness. All qualified applicants will be considered for employment without regard to their or any other persons perceived or actual race color religion sex gender gender identity gender expression sexual orientation national origin ancestry citizenship age physical or mental disability medical condition family care status or any other basis protected by law. We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include but are not limited to changing the application process providing documents in an alternate format using a sign language interpreter or using specialized equipment. If you require a reasonable accommodation to apply for a job please contact us via e-mail at or via mail at 101 Townsend St. San Francisco CA 94107.

About UsAt Cloudflare we are on a mission to help build a better Internet. Today the company runs one of the worlds largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and...
View more view more

Key Skills

  • Apache Hive
  • S3
  • Hadoop
  • Redshift
  • Spark
  • AWS
  • Apache Pig
  • NoSQL
  • Big Data
  • Data Warehouse
  • Kafka
  • Scala

About Company

Company Logo

Make employees, applications and networks faster and more secure everywhere, while reducing complexity and cost.

View Profile View Profile