Work Schedule
Standard (Mon-Fri)
Environmental Conditions
Office
Job Description
How will you make an impact
Thermo Fisher Scientific is seeking a Data Engineer to work with the Data Science Center of Excellence and Data Architecture team! The data platform is primarily based on Oracle Exadata database AWS Redshift and Databricks-based Delta and Unity Catalog technologies toward Lakehouse transition to enable Data Science Data Analytics Customer Analytics and Data Services for critical Application and Business enablement.
What will you do
- Design develop test deploy support enhance data integration solutions seamlessly to connect and integrate Thermo Fisher enterprise systems in our Data Science and Enterprise Data Platform.
- Innovate for data integration in Apache Spark-based Platform to ensure the technology solutions integrate modern capabilities.
- Facilitate capturing requirements and process mapping workshops review business/functional requirement documents author technical design documents testing plans and scripts.
- Assist with implementing standard operating procedures facilitate review sessions with functional owners and end-user representatives and leverage technical knowledge and expertise to drive improvements.
- Defining designing and documenting reference architecture and leading the implementation of BI and analytical solutions.
- Follow agile development methodologies to deliver solutions and product features by following DevOps practices.
Experience Knowledge Skills Abilities
- 4-year degree with major in computer science engineering (or equivalent) from an accredited university (preferred) will substitute for minimum 4-5 years professional IT experience
- Experience in ETL (Data extraction data transformation and data load processes)
- 6 years working experience in data integration and pipeline development.
- Proven track record using Databricks and Apache Spark.
- Data lake and Delta lake and Unity Catalog experience with AWS Glue and Athena.
- Demonstrated ability with AWS Cloud on data integration with Apache Spark Glue Kafka Elastic Search Lambda S3 Redshift RDS MongoDB/DynamoDB ecosystems.
- Strong real-life experience in Python development especially in pySpark in AWS Cloud environment
- Design develop test deploy maintain and improve data integration pipeline.
- Strong analytical experience with database in writing complex queries query optimization debugging user defined functions views indexes etc.
- Solid experience with source control systems including Git and Jenkins build and continuous integration tools.
- Understanding of development methodology and actual experience writing functional and technical design specifications.
- Must be willing to learn Generative AI.
Thermo Fisher Scientific is an EEO/Affirmative Action Employer and does not discriminate on the basis of race color religion sex sexual orientation gender identity national origin protected veteran status disability or any other legally protected status.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process to perform essential job functions and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Work ScheduleStandard (Mon-Fri)Environmental ConditionsOfficeJob DescriptionHow will you make an impactThermo Fisher Scientific is seeking a Data Engineer to work with the Data Science Center of Excellence and Data Architecture team! The data platform is primarily based on Oracle Exadata database AW...
Work Schedule
Standard (Mon-Fri)
Environmental Conditions
Office
Job Description
How will you make an impact
Thermo Fisher Scientific is seeking a Data Engineer to work with the Data Science Center of Excellence and Data Architecture team! The data platform is primarily based on Oracle Exadata database AWS Redshift and Databricks-based Delta and Unity Catalog technologies toward Lakehouse transition to enable Data Science Data Analytics Customer Analytics and Data Services for critical Application and Business enablement.
What will you do
- Design develop test deploy support enhance data integration solutions seamlessly to connect and integrate Thermo Fisher enterprise systems in our Data Science and Enterprise Data Platform.
- Innovate for data integration in Apache Spark-based Platform to ensure the technology solutions integrate modern capabilities.
- Facilitate capturing requirements and process mapping workshops review business/functional requirement documents author technical design documents testing plans and scripts.
- Assist with implementing standard operating procedures facilitate review sessions with functional owners and end-user representatives and leverage technical knowledge and expertise to drive improvements.
- Defining designing and documenting reference architecture and leading the implementation of BI and analytical solutions.
- Follow agile development methodologies to deliver solutions and product features by following DevOps practices.
Experience Knowledge Skills Abilities
- 4-year degree with major in computer science engineering (or equivalent) from an accredited university (preferred) will substitute for minimum 4-5 years professional IT experience
- Experience in ETL (Data extraction data transformation and data load processes)
- 6 years working experience in data integration and pipeline development.
- Proven track record using Databricks and Apache Spark.
- Data lake and Delta lake and Unity Catalog experience with AWS Glue and Athena.
- Demonstrated ability with AWS Cloud on data integration with Apache Spark Glue Kafka Elastic Search Lambda S3 Redshift RDS MongoDB/DynamoDB ecosystems.
- Strong real-life experience in Python development especially in pySpark in AWS Cloud environment
- Design develop test deploy maintain and improve data integration pipeline.
- Strong analytical experience with database in writing complex queries query optimization debugging user defined functions views indexes etc.
- Solid experience with source control systems including Git and Jenkins build and continuous integration tools.
- Understanding of development methodology and actual experience writing functional and technical design specifications.
- Must be willing to learn Generative AI.
Thermo Fisher Scientific is an EEO/Affirmative Action Employer and does not discriminate on the basis of race color religion sex sexual orientation gender identity national origin protected veteran status disability or any other legally protected status.
We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process to perform essential job functions and to receive other benefits and privileges of employment. Please contact us to request accommodation.
View more
View less