The ideal Senior Data Scientist candidate for this role will have insurance industry experience with Natural Language Processing (NLP) and perhaps Computer Vision (CV) especially applied to data extraction problems. They will be skilled in transfer learning and possess an understanding of and capability in Deep Learning (DL). They can perform work in each of those areas by leveraging a cloud environment like Databricks on Azure with Python in a notebook and IDE environment with version control leveraging git. They can navigate API based generative AI models like those from OpenAI to develop solutions. They can perform these tasks collaboratively transparently and seek to improve the skill of the team overall.
Job Responsibilities
- Deliver automated document extraction and data enrichment solutions.
- Manipulate data using Python or SQL; develop ad hoc queries to investigate data anomalies and to summarize data for pattern detection.
- Manually label and validate small training data sets.
- Develop roundtrip data flows that prepare data for IDP and format the results for analytical purposes.
- Build predictive models and analytic solutions using Python; apply NLP and machine learning techniques.
- Use AI technologies such as natural language processing to extract business data from unstructured and semistructured sources (e.g. loss history reports insurance applications claim files application scraping using OCR/ generative AI etc.).
- Create technical documentation to archive endtoend processes.
Qualifications :
- 5 years of experience as Data Scientist
- 2 years of experience for NLP
- Experience applying quantitative methods in a corporate environment
- Experience with Python from a functional programming paradigm able to manage dependencies virtual environments and version control in git
- Experience with cloud computing platforms such as Azure
- Expertise in supervised learning and unsupervised learning along with experience in deep learning and transfer learning
- Experience with generative algorithms (e.g. GAN VAE etc.) as well as foundation models (e.g. GPT4o SAM Mistral)
- Experience developing solutions from inception through deployment
Preferred Qualifications
- Graduate degree in a quantitative field
- Experience with sequential algorithms (e.g. LSTM RNN GRU etc.)
- Experience evaluating ethical implications of AI and considerations around controlling them
Additional Information :
- BS in Computer Science Information Technology degree Data Analytics or equivalent
Remote Work :
Yes
Employment Type :
Fulltime