Software Engineer (Backend, Python) Content Understanding
San Francisco, CA - USA
Department:
Job Summary
Scribd Inc. is on a mission to advance human understanding. Our four products Scribd Slideshare Everand and Fable help billions of people across the globe move beyond access and into insight application and expertise.
Culture at Scribd Inc.
We support a culture where our employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.
We believe the best work happens when individual flexibility is balanced with meaningful community connection. Scribd Flex empowers employees to choose the workstyle and location that support their best performance while committing to intentional in-person moments that strengthen collaboration and culture. Occasional in-person attendance is required for all Scribd Inc. employees regardless of location.
So what are we looking for in new team members At Scribd Inc. we hire for GRIT. Traditionally defined as the intersection of passion and perseverance toward long-term goals GRIT reflects the mindset we expect from every employee. For us it also serves as a practical framework for how we work: setting and achieving Goals delivering Results within your role contributing Innovative ideas and solutions and strengthening the broader Team through collaboration and attitude.
This posting reflects an approved open position within the organization.
About the team:
The ML Content Understanding team powers metadata extraction enrichment and content understanding across all Scribd brands. We process hundreds of millions of documents billions of images and deliver high-quality metadata to enable content discovery and trust for millions of users worldwide.
Our systems operate at massive scale supporting diverse datasets like user-generated content (UGC) ebooks audiobooks and more. We work at the intersection of machine learning data engineering and distributed systems collaborating closely with applied research and product teams to deploy scalable ML and LLM-powered solutions in production.
Role Overview:
Were seeking a Software Engineer II with strong backend development experience and a passion for solving complex data this role youll design build and optimize distributed systems that extract enrich and process metadata for a wide range of content. Youll work closely with ML engineers product managers and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful high-performance solutions. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.
Tech Stack:
Our team uses various technologies. The following are the ones that we use on a regular basis: Python Scala Ruby on Rails Airflow Databricks Spark HTTP APIs AWS (Lambda ECS SQS ElastiCache Sagemaker Cloudwatch Datadog) and Terraform.
Key Responsibilities:
Design and build scalable systems to extract enrich and process metadata from millions of documents images and audio content.
Leverage LLMs to integrate capabilities like summarization classification extraction and enrichment into metadata pipelines.
Collaborate with cross-functional teams including ML engineers and product managers to deliver scalable efficient and reliable metadata solutions.
Optimize and refactor existing systems for performance scalability and reliability.
Ensure data accuracy integrity and quality through automated validation and monitoring.
Participate in code reviews ensuring best practices are followed and maintaining high-quality standards in the codebase.
Manage and maintain data pipelines security and infrastructure
Requirements:
4 years of professional software engineering experience
Proficiency in Python Scala Ruby or similar languages
Experience designing and building distributed systems at scale
Hands-on experience building deploying and optimizing solutions using ECS EKS or AWS Lambda
Experience with infrastructure-as-code tools like Terraform (or similar)
Experience working with a public cloud provider (AWS Azure or Google Cloud)
Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads
Proven ability to test profile and optimize systems for performance scalability and reliability
Bachelors degree in Computer Science or equivalent professional experience
Bonus: Experience working with LLMs or integrating ML models into production systems
At Scribd Inc. your base pay is one part of your total compensation package and is determined within a range. Our pay ranges are based on the local cost of labor benchmarks for each specific role level and geographic location. San Francisco is our highest geographic market in the United States.
In the state of California the reasonably expected salary range is between $126000 minimum salary in our lowest geographic market within California to $196000 maximum salary in our highest geographic market within California.
In the United States outside of California the reasonably expected salary range is between $103500 minimum salary in our lowest US geographic market outside of California to $186500 maximum salary in our highest US geographic market outside of California.
In Canada the reasonably expected salary range is between $131500 CADminimum salary in our lowest geographic market to $174500 CADmaximum salary in our highest geographic market.
We carefully consider a wide range of factors when determining compensation including but not limited to experience; job-related skill sets; relevant education or training; and other business and organizational needs. The salary range listed is for the level at which this job has been the event that you are considered for a different level a higher or lower pay range would apply. This position is also eligible for a competitive equity ownership and a comprehensive and generous benefits package.
Working at Scribd Inc.
Are you currently based in a location where Scribd Inc. can employ you
Employees must have their primary residence in or near one of the following cities. This includes surrounding metro areas or locations within a typical commuting distance:
United States:
Atlanta Austin Boston Dallas Denver Chicago Houston Jacksonville Los Angeles Miami New York City Phoenix Portland Sacramento Salt Lake City San Diego San Francisco Seattle Washington D.C.
Canada:
Ottawa Toronto Vancouver
Mexico:
Mexico City
Benefits at Scribd Inc.
Scribd Flex (flexible work model)
Comprehensive health dental and vision coverage
Mental health support and disability coverage
Generous paid time off including vacation sick time holidays winter break volunteer time and sabbaticals
Paid parental leave and family support benefits
Retirement matching and employee equity
Learning and development programs and professional growth opportunities
Wellness and home office stipends
Complimentary access to the Scribd Inc. suite of products
Enterprise access to leading AI tools
Get to Know Scribd Inc.
About Scribd Inc.
Life at Scribd Inc.
We want our interview process to be accessible to everyone. You can inform us of any reasonable adjustments we can make to better accommodate your needs by emailing about the need for adjustments at any point in the interview process.
Scribd Inc. is committed to equal employment opportunity regardless of race color religion national origin gender sexual orientation age marital status veteran status disability status or any other characteristic protected by law. We encourage people of all backgrounds to apply and believe that a diversity of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.
Required Experience:
IC
About Company
Explore over 195M documents from a global community. Share information, and find inspiration on Scribd.