REQUIREMENTS:
- Total experience:10years.
- Experience with LLMs (e.g. DeepSeek LLaMA) and inference frameworks (Ollama vLLM ).
- Proficiency in OpenCV PyTorch YOLO or TensorFlow and model conversion workflows.
- Strong experience in Docker DevOps and CI/CD pipeline integration.
- Programming skills in Python with solid experience in Linux and shell scripting.
- Understanding of edge AI hardware (Jetson/NXP/Qualcomm) and embedded deployment.
- Familiarity with Yocto OS and custom Linux builds.
- Strong grasp of model optimization and compression techniques.
- Experience with Langchain AI agents and RAG pipelines.
- Good knowledge of inference acceleration using CUDA and GPU-specific kernels.
- Excellent communication and collaboration skills.
RESPONSIBILITIES:
- Understanding functional requirements thoroughly and analyzing the clients needs in the context of the project
- Envisioning the overall solution for defined functional and non-functional requirements and being able to define technologies patterns and frameworks to realize it
- Determining and implementing design methodologies and tool sets
- Enabling application development by coordinating requirements schedules and activities.
- Being able to lead/support UAT and production roll outs
- Creating understanding and validating WBS and estimated effort for given module/task and being able to justify it
- Addressing issues promptly responding positively to setbacks and challenges with a mindset of continuous improvement
- Giving constructive feedback to the team members and setting clear expectations.
Qualifications :
Bachelors or masters degree in computer science Information Technology or a related field.
Remote Work :
Yes
Employment Type :
Full-time