This 6-month internship focuses on designing and implementing a robust data pipeline from raw data ingestion to clean structured datasets and visualisation. This will enable the R&D team to improve algorithms and explore predictive analysis using machine learning.
We have a database with the raw data from our users pumping or training sessions. Currently we analyse specific points of interest by opening session graphs in this database manually and coming to a conclusion. The aim of this internship is to create the connection between our database and Big Query to import users raw data and aggregate it to make analysis automatic large-scale faster modifiable and always up to date.
Expected outputs
Scalable and documented data collection and storage pipeline
Automated data cleaning and preprocessing scripts
Centralized dashboards for visualization
Example datasets and analysis notebooks for core projects
Recommendations for future predictive or AI-based studies
Work with Big Query and Metabase
Required Experience:
Intern
Perifit is the health technology brand for all womankind. From pelvic floors to breastfeeding, and beyond, we’re here to help.