drjobs Sr. GEN AI Engineer with RAG, LLM, and Cloud

Sr. GEN AI Engineer with RAG, LLM, and Cloud

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Costa Mesa, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Gen AI/Python AWS Pagemaker

Location : Preferred: Costa Mesa Dallas Pheonix

Hybrid: 3 days/week in-office preferred

End client Nam - Experian

We are looking for a Senior Software Engineer specializing in Retrieval-Augmented Generation (RAG) systems with experience in large language models (LLMs) vector databases and cloud-based microservices. Your role will focus on building integrating and optimizing LLM workflows using LangChain and managing complex infrastructure with AWS services like Lambda and ECS. Youll bring expertise in containerized environments using Docker and work with vector databases to power data-driven applications. You will report to a Staff Software Engineer and work remote in the United States or hybrid based on proximity to our office.

Youll Have Opportunity To

  • RAG Workflow Development: Design and deploy LLM-driven RAG workflows using LangChain and vector databases to provide high-accuracy data retrieval and enhanced content generation.
  • Vector Database Management: Integrate and manage vector databases like Qdrant for optimized high-speed vector searches and data retrieval.
  • Cloud Computing: Use AWS services including Lambda and ECS to build serverless architectures and scalable containerized applications.
  • API & Backend Development: Build APIs with FastAPI and Uvicorn to support low-latency interactions and handle high traffic volumes.
  • Monitoring & Observability: Implement observability best practices using Datadog ddtrace and logging tools to maintain performance and troubleshoot complex workflows.

Qualifications

Required Skills:

  • Proficiency in LLM and RAG Workflows: experience with LangChain and vector databases applying RAG techniques for intelligent data retrieval and generation.

Python Proficiency (>3.11 < 3.12): Advanced Python skills including experience with asynchronous programming.

  • Proficient in AWS environment
  • Understanding of MCP Servers

Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race color religion sex pregnancy sexual orientation gender identity national origin age protected veteran status or disability status.

Employment Type

Full-time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.