Senior Data Scientist
Req number:
R5797
Employment type:
Full time
Worksite flexibility:
Hybrid
Who we are
CAI is a global technology services firm with over 8500 associates worldwide and a yearly revenue of $1 billion. We have over 40 years of excellence in uniting talent and technology to power the possible for our clients colleagues and communities. As a privately held company we have the freedom and focus to do what is rightwhatever it takes. Our tailor-made solutions create lasting results across the public and commercial sectors and we are trailblazers in bringing neurodiversity to the enterprise.
Job Summary
Were searching for an experienced Senior Data Scientist who excels at statistical analysis feature engineering and end to end machine learning operations. Your primary mission will be to build and productionize demand forecasting models across thousands of SKUs while owning the full model lifecyclefrom data discovery through automated re training and performance monitoring. This is a Full-time and Hybrid position.
Job Description
What Youll Do
- Advanced ML Algorithms: Design train and evaluate supervised & unsupervised models (regression classification clustering uplift).
Apply automated hyperparameter optimization (Optuna HyperOpt) and interpretability techniques (SHAP LIME). - Data Analysis & Feature Engineering: Perform deep exploratory data analysis (EDA) to uncover patterns & anomalies.
Engineer predictive features from structured semistructured and unstructured data; manage feature stores (Feast).
Ensure data quality through rigorous validation and automated checks. - TimeSeries Forecasting (Demand): Build hierarchical intermittent and multiseasonal forecasts for thousands of SKUs.
Implement traditional (ARIMA ETS Prophet) and deeplearning (RNN/LSTM TemporalFusion Transformer) approaches.
Reconcile forecasts across product/category hierarchies; quantify accuracy (MAPE WAPE) and bias. - MLOps & Model Lifecycle: Establish model tracking & registry (MLflow SageMaker Model Registry).
Develop CI/CD pipelines for automated retraining validation and deployment (Airflow Kubeflow GitHub Actions).
Monitor data & concept drift; trigger retuning or rollback as needed. - Statistical Analysis & Experimentation: Design and analyze A/B tests causal inference studies and Bayesian experiments.
Provide statisticallygrounded insights and recommendations to stakeholders. - Collaboration & Leadership: Translate business objectives into datadriven solutions; present findings to exec & nontech audiences.
Mentor junior data scientists review code/notebooks and champion best practices.
What Youll Need
- M.S. in Statistics (preferred) or related field such as Applied Mathematics Computer Science Data Science.
- 5 years building and deploying ML models in production.
- Expertlevel proficiency in Python (Pandas NumPy SciPy scikitlearn) SQL and Git.
- Demonstrated success delivering largescale demandforecasting or timeseries solutions.
- Handson experience with MLOps tools (MLflow Kubeflow SageMaker Airflow) for model tracking and automated retraining.
- Solid grounding in statistical inference hypothesis testing and experimental design.
- Experience in supplychain retail or manufacturing domains with highgranularity SKU data.
- Familiarity with distributed data frameworks (Spark Dask) and cloud data warehouses (Big Query Snowflake).
- Knowledge of deeplearning libraries (PyTorch TensorFlow) and probabilistic programming (PyMC Stan).
- Strong datavisualization skills (Plotly Dash Tableau) for storytelling and insight communication.
Physical Demands
- This role involves mostly sedentary work with occasional movement around the office to attend meetings etc.
- Ability to perform repetitive tasks on a computer using a mouse keyboard and monitor.
Reasonable accommodation statement
If you require a reasonable accommodation in completing this application interviewing completing any pre-employment testing or otherwise participating in the employment selection process please direct your inquiries to or (888).
Required Experience:
Senior IC