Senior Data Engineer Pharma R&D
Job Summary
At Roche you can show up as yourself embraced for the unique qualities you bring. Our culture encourages personal expression open dialogue and genuine connections where you are valued accepted and respected for who you are allowing you to thrive both personally and professionally. This is how we aim to prevent stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche where every voice matters.
The Position
Data Engineer
Experience - 4 to 8 years
Location- Pune
Job description
The Senior IT Data Engineer is responsible for leading designing developing and maintaining scalable and robust data pipelines and infrastructure. This role involves independently building ETL/ELT processes optimizing data storage solutions (such as data warehouses and data lakes) ensuring data quality and reliability and monitoring data systems. You will collaborate closely with data scientists and analysts to meet their specific data requirements utilizing strong programming skills in Tableau Snowflake Talend Python or Scala expert SQL knowledge and proficiency in big data technologies like Spark.
The ideal candidate will possess a strong background in the pharmaceutical or biotechnology industry with experience working within Regulatory Affairs Clinical Operations or Pharmacovigilance / Safety team with a solid understanding on the E2E process flow across R&D. Additionally a proven track record of navigating the stringent requirements of GxP environments (GCP GMP GVP) and managing complex cross-functional data workflows is highly desirable.
Description of the area
Job Responsibilities
End-to-End Pipeline Delivery: Independently leads the design build and maintenance of scalable data pipelines managing specific data engineering projects autonomously from inception to deployment.
Performance Optimization & Problem Solving: Solves complex data ingestion and processing challenges actively optimizing data flows to enhance overall system performance and reliability.
Stakeholder Alignment & Integration: Partners directly with business units and data scientists to understand data requirements effectively bridging technical execution with non-technical business needs.
Strategic Infrastructure Impact: Owns large-scale data engineering initiatives implementing robust strategies that significantly modernize and strengthen the organizations data infrastructure.
Complex Data Integration: Manages large intricate data ecosystems by seamlessly integrating multiple diverse data sources to ensure efficient secure cross-platform data flows.
Qualifications
Education / Experience
Large-Scale Data Systems Management: Demonstrated experience owning major data engineering initiatives and managing complex high-volume enterprise data systems.
Autonomous ETL/ELT Pipeline Development: Proven track record of independently architecting building and maintaining automated ETL/ELT data ingestion and transformation processes.
Storage Solution Optimization: Hands-on experience designing and optimizing modern data storage environments including data warehouses and data lakes for peak performance and cost efficiency.
Data Quality & Reliability Assurance: Expert capability in implementing rigorous data quality checks data cleansing rules and reconciliation frameworks across all pipelines.
System Monitoring & Observability: Strong experience building robust monitoring alerting and logging systems to ensure continuous high availability and minimal downtime of data workflows.
Technical Skills
Programming & ETL/ELT Mastery: Advanced proficiency in SQL Python and Scala combined with expert use of tools like Talend to build ingest and process complex structured and unstructured data streams.
Cloud & Big Data Architecture: Deep expertise leveraging distributed computing frameworks (Spark Hadoop) and cloud-native data platforms (Snowflake) to manage and scale high-volume enterprise-level data systems.
Optimization & Performance Engineering: Proven capability to solve complex data processing bottlenecks tune analytical environments for tools like Tableau and continuously optimize end-to-end data flows for maximum efficiency and reliability.
Good to have : AI expertise
Additional Qualifications
Pharma & GxP Compliance: Extensive experience in pharma or biotech architecting data pipelines that strictly comply with GxP frameworks (GCP/GMP/GVP) data integrity principles and computer systems validation (CSV).
Compliant Data Delivery: Proven capability to build scalable data solutions within regulated R&D environments aligning technical execution with critical Clinical Regulatory and Safety milestones.
Workflow & Data Optimization: Skilled at identifying data bottlenecks eliminating operational silos and optimizing fragmented workflows to ensure automated streamlined cross-platform data transfers.
Who we are
A healthier future drives us to innovate. Together more than 100000 employees across the globe are dedicated to advance science ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities foster creativity and keep our ambitions high so we can deliver life-changing healthcare solutions that make a global impact.
Lets build a healthier future together.
Roche is an Equal Opportunity Employer.
Required Experience:
Senior IC
About Company
F. Hoffmann-La Roche AG is a Swiss multinational healthcare company that operates worldwide under two divisions: Pharmaceuticals and Diagnostics. Its holding company, Roche Holding AG, has bearer shares listed on the SIX Swiss Exchange. The company headquarters are located in Basel.