Job Title: Generative AI Engineer
Location: Orlando FL
Domain: Healthcare
Duration: Long Term Contract
Looking for W2 Candidates. No C2C
Key Responsibilities:
- Design build and fine-tune LLMs and multimodal generative models using state-of-the-art architectures.
- Lead the development of generative AI applications including chatbots content generation tools summarizers and automated document processors.
- Integrate external tools and APIs using agentic workflows (CrewAI LangChain agents) to build reasoning-capable AI solutions.
- Optimize model inference using quantization distillation and hardware-specific acceleration (ONNX Triton).
- Ensure secure ethical and compliant AI use through robust evaluation pipelines and fairness metrics.
Required Experience:
- Experience in AI/ML with focused on generative AI or large language models.
- Solid understanding of transformer-based architectures and vector similarity search.
- Experience deploying LLMs (e.g. GPT LLaMA Falcon Claude) in production settings.
- Proven record of using embedding models and retrieval-augmented generation (RAG) to improve contextual accuracy.
Technical Proficiencies:
- Proficient in Python with strong experience in PyTorch Transformers and LangChain.
- Hands-on experience with OpenAI Azure OpenAI Anthropic Claude and other LLM providers.
- Familiarity with vector databases (Pinecone FAISS) orchestration tools and scalable inference.
Soft Skills:
- Strong communication and leadership qualities.
- Proven ability to collaborate with cross-functional engineering and product teams.
- Comfortable working in fast-paced iterative environments.
Best Regards:
Tanuja P
Phone: 1-
Email: