Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailWe are looking for a Senior Data Scientist with experience in AIdriven document processing OCR and generative AI applications. This role requires handson expertise in designing training and deploying machine learning models as well as operationalizing AI agents within production systems. You will be working closely with LLM frameworks orchestration tools and AWS infrastructure to build scalable intelligent AI agents to assist XOi users executing a variety of workflows.
Data Analysis & Pattern Discovery: Analyze large complex and multimodal datasets to uncover insights that drive AI model development and business strategy.
Model Development: Design develop and optimize stateoftheart LLMbased and NLP models using deep learning frameworks like PyTorch and TensorFlow.
OCR & NLP Solutions: Build and implement OCR and NLP solutions to extract process and analyze textual information from various document types in realworld enterprise settings.
Machine Learning Pipelines: Develop and manage endtoend machine learning pipelines for data ingestion preprocessing model training evaluation and deployment with CI/CD and MLOps best practices.
AI Agent Development: Implement AI agents using LangChain LangSmith MCP and other tools to automate diagnosis symptomissueresolution identification summarization and task optimization.
LLM FineTuning & Deployment: Orchestrate and integrate large language models via AWS Bedrock SageMaker and other cloudnative services for providing intelligent assistance to users executing domainspecific workflows.
Experimentation & Optimization: Conduct experiments to finetune model parameters evaluate different architectures and optimize performance for specific use cases like document processing accuracy entity recognition and semantic understanding.
Prompt Engineering: Design prompt templates and implement prompt chaining for multistep agents and leverage frameworks such as MCP in targeted retrievalaugmented generation (RAG) pipelines.
ThirdParty & API Integration: Collaborate on integrating opensource models thirdparty APIs and proprietary tools into enterprise production systems.
Collaboration & CrossFunctional Work: Partner with software engineers data engineers and product managers to deploy AI models into scalable reliable production applications.
Research & Continuous Learning: Stay at the forefront of advancements in LLMs OCR NLP and AI continuously evaluating new tools frameworks and techniques for adoption.
Mentorship: Provide technical leadership guidance and mentorship to junior data scientists and machine learning engineers within the team.
Masters or PhD in Computer Science Data Science Machine Learning or a related technical field.
2 years of professional experience in data science or machine learning with a focus on LLMs OCR and NLP applications.
Experience applying NLP and computer vision techniques to largescale enterprise document processing tasks including data extraction classification and workflow automation.
Proficiency in Python and ML libraries such as Scikitlearn TensorFlow or PyTorch.
Strong handson experience with OCR frameworks (Tesseract AWS Textract or similar).
Direct experience working with LangChain LangSmith or other LLM orchestration tools and exposure to providing MCPbased data contexts for LLMs.
Skilled in deploying ML models via AWS SageMaker Bedrock or other cloudnative services.
Solid grasp of LLM prompting techniques vector databases agentdriven AI architectures and RAG pipelines.
Familiarity with MLOps principles reproducibility practices model governance and CI/CD for AI systems.
Experience integrating generative AI solutions (Gemini OpenAI Anthropic Mistral) into document workflows is highly desirable.
Knowledge of model finetuning transfer learning and multimodal data processing preferred.
Thrive in a fastpaced environment and are driven by a passion to excel.
Value collaboration and believe in the power of teamwork.
Possess strong analytical and organizational skills with a focus on delivering highquality results on time and within budget.
Are passionate about mentoring and sharing your expertise with other team members.
Believe in a teamoriented approach to success and are committed to serving our customers.
Show deep curiosity about the ever changing landscape of AI and Data Science
XOi offers a comprehensive benefits package that includes medical dental and health with eligibility the 1st of the next month following your date of hire
XOi offers a 401(k) with eligibility at 90 days
XOi offers Discretionary Time Off
All new employees receive a one time $500 New Hire Stipend to support any updates needed for your home office space
All employees receive a $50/monthly stipend to be used for personal wellness and $50/ monthly stipend towards internet expenses
Required Experience:
Senior IC
Full-Time