Title: Python Data Engineer
Location: Hartford CT/NC (F2F Interview)
Duration: Longterm Contract
Key Responsibilities
- Data Pipeline Design and Implementation: Design develop and maintain scalable and reliable data pipelines using Python and Azure services like Azure Data Factory & Databricks.
- Data Storage Solutions: Design and implement data storage solutions on Azure including Azure Data Lake Storage Azure SQL Database and Azure Synapse Analytics integrating Python scripts where necessary.
- Data Integration and ETL: Develop and implement ETL (Extract Transform Load) processes using Python and Azure services to integrate data from various sources.
- Data Quality and Validation: Ensure data quality and accuracy through testing and validation processes using Python.
- Collaboration and Communication: Collaborate with data analysts data scientists and other stakeholders to understand their needs and deliver data solutions with a focus on Pythonbased implementations.
- Performance Monitoring and Optimization: Monitor the performance of data pipelines and infrastructure and identify and address performance issues using Pythonbased solutions.
- Data Security and Compliance: Ensure data security and compliance with industry standards and regulations implement encryption and security protocols using Python.
- Documentation: Maintain uptodate documentation for data processes and pipelines with detailed Python code documentation.
Skills and Experience:
- Programming Languages: Proficiency in Python and SQL.
- Azure Services: Experience with Azure Data Factory Azure Databricks Azure Synapse Analytics Azure Data Lake Storage and other relevant Azure services with a focus on integrating Python.
- Data Pipeline Tools: Experience with data pipeline tools and frameworks especially those that support Python.
- Data Modeling and Schema Design: Experience in designing and implementing data models and schemas using Python.
- ETL Processes: Experience in developing and implementing ETL processes using Python.
- Data Quality and Validation: Knowledge of data quality and validation techniques especially with Python.
- Collaboration and Communication: Strong communication and collaboration skills with an emphasis on Pythonbased solutions.
- Problemsolving and Troubleshooting: Ability to identify and resolve technical issues using Python.
- Cloud Computing: Understanding of cloud computing concepts and technologies with a focus on Python implementations.