AI ML Developer

Sumeru Solutions

Not Interested
Bookmark
Report This Job

profile Job Location:

Atlanta, GA - USA

profile Monthly Salary: Not Disclosed
Posted on: 6 hours ago
Vacancies: 1 Vacancy

Job Summary

Job Responsibilities - Identity Resolution

  • Design and lead end-to-end identity resolution architecture combining probabilistic models ML and embedding-based techniques to build the authoritative customer identity graph
  • Build and optimize large-scale entity matching systems across billions of records and multiple data domains - ensuring every US adult is accurately represented in CDP
  • Architect advanced candidate generation and blocking strategies (LSH phonetic encoding semantic similarity) that balance precision with computational feasibility at population scale
  • Design high-precision matching pipelines using ensemble approaches (rules ML LLM-based validation) to maximize accuracy of golden customer profiles
  • Develop scalable clustering and graph-based approaches for unified customer identity resolution with clear confidence scoring and auditability
  • Lead implementation of embedding pipelines and similarity search systems using transformer models for semantic-level identity matching

Job Responsibilities - AI/LLM

  • Architect and build LLM-powered systems for entity resolution including zero-shot and few-shot classification workflows that handle edge cases traditional models miss
  • Design and implement RAG-based architectures for enriching and contextualizing customer data from unstructured sources
  • Lead development of NLQ-to-SQL platforms enabling business users to query CDP - the authoritative source of truth - using natural language
  • Translate ambiguous business questions into structured queries with schema awareness semantic layers and guardrails that protect data integrity
  • Define best practices for prompt engineering evaluation and LLM observability - ensuring AI outputs meet the trust standards CDP demands
  • Design and optimize vector search architectures (Pinecone Qdrant pgvector) for large-scale retrieval across customer data
  • Evaluate and integrate emerging frameworks such as LangChain LangGraph and agentic workflows where they strengthen CDP capabilities

Education and Work Experience

  • Bachelors or Masters degree in Computer Science Data Science or related field
  • 6 years of experience in ML/AI engineering
  • Proven experience building production-grade entity resolution or identity graph systems at scale
  • Experience designing LLM-based applications in enterprise environments with high accuracy and trust requirements

Technical Skills

  • Advanced programming: Python
  • Deep expertise in ML algorithms for similarity classification and clustering - particularly in identity resolution contexts
  • Strong experience with transformer models embeddings and semantic search at population scale
  • Hands-on experience with LLM APIs and orchestration frameworks
  • Strong SQL and experience with distributed data processing (Spark Dask)
  • Experience with vector databases and ANN search systems (FAISS Pinecone etc.)
  • Expertise in ML lifecycle management (MLflow or equivalent)
  • Understanding of data governance privacy and security requirements for customer identity data

Knowledge Skills and Abilities

  • Strong system design and architectural thinking for AI/ML systems at population scale
  • Ability to balance precision recall and scalability in identity resolution systems - understanding that accuracy directly impacts CDPs authority as the source of truth
  • Strong understanding of data semantics and customer domain modeling across diverse data sources
  • Leadership in driving AI engineering best practices standards and quality benchmarks
  • Ability to collaborate across data engineering product security and business teams to deliver trusted customer intelligence
Job Responsibilities - Identity Resolution Design and lead end-to-end identity resolution architecture combining probabilistic models ML and embedding-based techniques to build the authoritative customer identity graph Build and optimize large-scale entity matching systems across billions of record...
View more view more