Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailJob Title: Data scientists
Job Location: Charlotte NC (Hybrid) Need Local candidate only
Project Details:
Implemented a chatbot internally with the bank build interface and now users can interact.
Its a RAG framework so instead of turning it into the actual applications you can prompt it to give you prevectorized queries.
Able to feed it documents even if they dont know what your team is or who you are. Should have 1000 users by the end of the year and another 2000 next year.
Must Haves / Required Skills:
LLMs & Inference:
Experience with major LLMs specifically Llama 3 Mistral and possibly Quinn.
Direct experience with VLLM (an inference engine) is a perfect match as its the core technology they are using to handle batched requests.
Experience with Nvidia Triton is a big bonus and a key part of their model serving infrastructure.
Core Development:
Python: A mandatory skill. They are using Python 3.12 but experience with 3.10 and above is sufficient.
Web Frameworks: Knowledge of Flask or FastAPI is required as they are using a Python endpoint to host the LLM.
Java: A secondary preferred skill used to create the REST service that interacts with the front-end UI.
Database & Data Management:
Vector Databases: Experience with Redis and other vector databases is essential for the RAG component.
SQL: Required.
RAG Skills: The candidate needs to understand how to handle the business-side parameters from the product team and push back if they are technically unfeasible. This shows they need to be a critical thinker not just a code-jockey.
Infrastructure & Operations (MLOps)
Containers & Orchestration: Knowledge of containers and OpenShift (a Kubernetes platform) for CI/CD.
CI/CD Tools: Experience with XLR and Datical for pipeline deployments is required.
Hardware: A solid understanding of GPUs is necessary as they are the most critical and challenging component of their infrastructure.
Agile: The team uses Agile methodology.
Scaling: The projects growth is tied to hardware availability. The initial deployment will be capped at 1000 users and scaling will only happen with more budget. This shows the importance of efficient resource management.
Skills / Experience That Are A Plus
nice to have but not necessarily required:
Based on exp likes to dig deep on exp.
General awareness of Vector DB as opposed to relational and others.
Pushing code to a controlled environment and seeing something go into production. Better if its an AI application.
Any model experience they have had professionally some quantitative models in the past or have written white papers before (required at BofA)
Full-time