Scraper Developer, Online Brand Protection
Department:
Job Summary
Position Overview:
We are looking for a Scraper Developer to join our Online Brand Protection this role you will be responsible for maintaining the performance of our web scraping systems and expanding our data collection capabilities across marketplaces websites and webshops domain registries and social media platforms. The data you gather will directly support the analysis and enforcement efforts that safeguard our clients brands online.
This is a hands-on engineering role where you will primarily focus on building maintaining and enhancing scrapers including handling challenges related to anti-bot defenses proxies and browser fingerprinting. You will collaborate closely with developers data scientists and analysts who depend on the data you deliver. You will work within a modern tech stack that includes Python Playwright Selenium Google Cloud Firebase BigQuery SQL and a growing set of anti-detection libraries as part of a small focused team reporting to the Technical Lead and working closely with cross-functional stakeholders.
Youll love this job if you enjoy:
- Maintaining and enhancing existing scrapers across marketplaces (Amazon Alibaba and similar) websites and webshops domain and WHOIS sources and social media platforms.
Building new scrapers for uncovered sourcesfrom initial site analysis to full production deployment.
Ensuring scraper reliability against anti-bot defenses using techniques such as rotating proxies browser fingerprinting request pacing and header/cookie management.
Monitoring scraper health diagnosing failures and quickly resolving breakages due to site changes.
Structuring and storing collected data effectively to ensure downstream pipelines receive clean complete and well-organized data.
Running scrapers at scale using scheduling and orchestration tools with proper retry mechanisms and queue management.
Packaging and deploying scrapers in a containerized version-controlled environment.
Writing clean maintainable code and actively participating in code reviewsboth giving and receiving.
Documenting scrapers to ensure they are easy to run debug and extend by other team members.
What youll need to be successful:
- Experience with Selenium alongside Playwright.
Experience with CAPTCHA-solving approaches.
Experience working with Google Cloud and virtual machines.
Experience scraping marketplaces or social media platforms.
Experience reverse-engineering mobile or web APIs.
Familiarity with browser fingerprinting libraries (e.g. undetected-chromedriver playwright-stealth or similar).
Exposure to Java or TypeScript is beneficial as the platform evolves.
Qualifications Required:
- 3 years of relevant experience in web scraping or a similar domain.
Strong Python experience including real-world application of Playwright.
Hands-on experience with scraping at scale against sites with active anti-scraping mechanisms (beyond static HTML).
Solid understanding of HTTP HTML DOM browser developer tools and modern data-loading techniques (XHR fetch dynamic rendering).
Experience managing proxies request headers cookies and maintaining scraper performance against anti-bot defenses.
Proficiency with Git and experience working in containerized environments such as Docker.
Fluent in written and verbal English.
Anaqua Inc. is a premium provider of integrated intellectual property (IP) management technology solutions and services. Anaquas AQX platform combines best practice workflows with big data analytics and tech-enabled services to create an intelligent environment designed to inform IP strategy enable IP decision-making and streamline IP operations. Today nearly half of the top 100 U.S. patent filers and global brands as well as a growing number of law firms worldwide use Anaquas solutions. Over one million IP executives attorneys paralegals administrators and innovators in large and medium-sized companies use the platform for their IP management needs. The companys global operations are headquartered in Boston with offices across the U.S. Europe and Asia. For additional information please visit or on LinkedIn.
Required Experience:
Senior IC
About Company
Unify innovation and IP docketing, prosecution, renewals, and portfolio management on a single powerful platform. Contact Anaqua today!