Data ScienceGen AI Specialist

Ford Motor

Not Interested
Bookmark
Report This Job

profile Job Location:

Chennai - India

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

Description

Youll be working alongside leading technical experts from all around the world on a variety of products involving Sequence/token classification QA/chatbots translation semantic/search and summarization among others.



Responsibilities

Responsibilities:

  • Design NLP/LLM/GenAI applications/products by following robust coding practices
  • Explore SoTA models/techniques so that they can be applied for automotive industry usecases
  • Conduct ML experiments to train/infer models; if need be build models that abide by memory & latency restrictions
  • Deploy REST APIs or a minimalistic UI for NLP applications using Docker and Kubernetes tools.
  • Showcase NLP/LLM/GenAI applications in the best way possible to users through web frameworks (Dash Plotly Streamlit etc.)
  • Converge multibots into super apps using LLMs with multimodalities.
  • Develop agentic workflow using Autogen Agentbuilder langgraph
  • Build modular AI/ML products that could be consumed at scale.


Qualifications

Qualifications:

Education: Bachelors or masters degree in computer science Engineering Maths or Science

Performed any modern NLP/LLM courses/open competitions is also welcomed.

Technical Requirements:

Soft Skills:

  • Strong communication skills and do excellent teamwork through Git/slack/email/call with multiple team members across geographies.

GenAI Skills:

  • Experience in LLM models like PaLM GPT4 Mistral (open-source models)
  • Work through the complete lifecycle of Gen AI model development from training and testing to deployment and performance monitoring.
  • Developing and maintaining AI pipelines with multimodalities like text image audio etc.
  • Have implemented in real-world Chat bots or conversational agents at scale handling different data sources.
  • Experience in developing Image generation/translation tools using any of the latent diffusion models like stable diffusion Instruct pix2pix.
  • Expertise in handling large scale structured and unstructured data.
  • Efficiently handled large-scale generative AI datasets and outputs.

ML/DL Skills:

  • High familiarity in the use of DL theory/practices in NLP applications
  • Comfort level to code in Huggingface LangChain Chainlit Tensorflow and/or Pytorch Scikit-learn Numpy and Pandas
  • Comfort level to use two/more of open source NLP modules like SpaCy TorchText farm-haystack and others

NLP Skills:

  • Knowledge in fundamental text data processing (like use of regex token/word analysis spelling correction/noise reduction in text segmenting noisy unfamiliar sentences/phrases at right places deriving insights from clustering etc.)
  • Have implemented in real-world BERT/or other transformer fine-tuned models (Seq classification NER or QA) from data preparation model creation and inference till deployment.

Python Project Management Skills

  • Familiarity in the use of Docker tools pipenv/conda/poetry env
  • Comfort level in following Python project management best practices (use of logging pytests relative module importssphinx docsetc.)
  • Familiarity in use of Github (clone fetch pull/pushraising issues and PR etc.)

Cloud Skills and Computing:

  • Use of GCP services like BigQuery Cloud function Cloud run Cloud Build VertexAI
  • Good working knowledge on other open-source packages to benchmark and derive summary.
  • Experience in using GPU/CPU of cloud and on-prem infrastructures.
  • Skillset to leverage cloud platform for Data Engineering Big Data and ML needs.

Deployment Skills:

  • Use of Dockers (experience in experimental docker features docker-compose etc.)
  • Familiarity with orchestration tools such as airflow Kubeflow
  • Experience in CI/CD infrastructure as code tools like terraform etc.
  • Kubernetes or any other containerization tool with experience in Helm Argoworkflow etc.
  • Ability to develop APIs with compliance ethical secure and safe AI tools.

UI:

  • Good UI skills to visualize and build better applications using Gradio Dash Streamlit React Django etc.
  • Deeper understanding of javascript css angular html etc. is a plus.

Miscellaneous Skills:

Data Engineering:

  • Skillsets to perform distributed computing (specifically parallelism and scalability in Data Processing Modeling and Inferencing through Spark Dask RapidsAI or RapidscuDF)
  • Ability to build python-based APIs (e.g.: use of FastAPIs/ Flask/ Django for APIs)
  • Experience in Elastic Search and Apache Solr is a plus vector databases.




Required Experience:

IC

DescriptionYoull be working alongside leading technical experts from all around the world on a variety of products involving Sequence/token classification QA/chatbots translation semantic/search and summarization among others.ResponsibilitiesResponsibilities:Design NLP/LLM/GenAI applications/product...
View more view more

Key Skills

  • GIS
  • Computer Data Entry
  • Facilities Management
  • ADMA
  • Fleet
  • Key Account

About Company

Company Logo

Ford® is Built for America. Discover the latest lineup in new Ford vehicles! Explore hybrid & electric vehicle options, see photos, build & price, search inventory, view pricing & incentives & see the latest technology & news happening at Ford.

View Profile View Profile