Software Engineer, Web Crawling

Exa

Not Interested
Bookmark
Report This Job

profile Job Location:

Singapore - Singapore

profile Yearly Salary: SGD 90000 - 300000
Posted on: 5 days ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

Exa is building a search engine from scratch to serve every AI agent. We build massive-scale infrastructure to crawl the web train state-of-the-art embedding models to process it and design super high performant vector databases in rust to search over it. If you like compute we also own a $5M H200 GPU cluster (and soon 5xing that) and regularly spin up batchjobs with tens of thousands of machines.

As a Web Crawler engineer youd be responsible for crawling the entire web. Basically build Google-scale crawling!

Who You Are

  • You have extensive experience building and scaling web crawlers or would be excited to ramp up very quickly

  • You have experience with some high performance language (C Rust etc.)

  • You are familiar with TypeScript Playwright modern web design CDP (Chrome DevTools Protocol)

  • Youre comfortable optimizing a system to an exceptional degree

  • You care about the problem of finding high quality knowledge and recognize how important this is for the world

What You Could Do

  • Build a distributed crawler that can handle 100M pages per day

  • Optimize crawl politeness and rate limiting across thousands of domains

  • Design systems to detect and handle dynamic content JavaScript rendering and anti-bot measures

  • Create intelligent crawl scheduling and prioritization algorithms for maximum coverage efficiency

This is an in-person opportunity in Singapore. Were happy to sponsor international candidates.

In addition to premium healthcare benefits (medical dental vision) we also offer fertility benefits and a monthly wellness stipend to all of our employees.


Required Experience:

IC

Exa is building a search engine from scratch to serve every AI agent. We build massive-scale infrastructure to crawl the web train state-of-the-art embedding models to process it and design super high performant vector databases in rust to search over it. If you like compute we also own a $5M H200 G...
View more view more

About Company

Company Logo

Real-time AI search engine with a powerful web search API, web crawling API, SERP API, and deep research tools. Search and extract structured content from websites and live data.

View Profile View Profile