Data Scientist with Python
Location: Dallas TX
Job description below.
Data Science Skills
- Deploy and maintain knowledge ingestion pipelines and integration into API-based services.
- Strong knowledge of data cleansing NLP parsing tools regex and optimization strategies for RAG.
- Experience fine-tuning and deploying LLMs.
- Stay current on the latest GenAI trends frameworks and coding practices.
Software Development Skills
- Solve complex problems by writing and testing application code developing and validating data pipelines and automating tests and deployment.
- Proficiency in designing and building in cloud environments such as Azure GCP or AWS.
- Uphold quality through DevOps pipeline unit- integration- and end-to-end testing.
Required Qualifications:
- 5 years of experience programming with Python.
- 1 years of experience in Git and version control.
- 1 years of experience with a public cloud (
- 1 years using Scikit-learn
- 1 years of Langchain experience.
- 1 years of LLM experience building RAG systems
- 1 year experience with vector databases such as MongoDB Atlas or Pinecone
- Portfolio of LLM applications and sample projects
- 2 years of NLP experience using tools such as NLTK SpaCy and Beautiful Soup
- 1 years of LLM experience building RAG systems at scale (10000 documents).
- 1 years experience in DevOps with GitHub Actions or similar CI/CD tools.
- 1 years of writing and deploying microservices
- 1 years of Kubernetes experience
- Strong code refactoring skills
- Strong debugging skills.
- Familiarity with concepts such as Agents adding memory to LLMs routing moderating content