The Emerging Devices organization is looking for a Data Engineer that has a deep understanding of the full lifecycle from data generation to the end-user application. Were looking for a leader to design implement and maintain analytical and ML Ops infrastructure to support business-critical analytics products such as operational reporting forecasts causal analysis and operational health monitoring.
You will directly influence the success of our organization by working on critical data engineering problems building high-quality accurate and architecturally sound data pipelines that align with our business needs. You will work across diverse science/engineering/business teams acting as the business-facing subject matter expert for data storage feature instrumentation and data privacy with the responsibility of managing end-to-end execution and delivery across projects.
Key job responsibilities Design implement and support an analytical data infrastructure (S3 / Redshift) Interface with other technology teams to extract transform and load data from a wide variety of data sources using SQL and Apache Spark variants Build and maintain orchestration services for ML operations focused projects Maintain a high bar for data compliance and protecting customer privacy Collaborate with Product Manager Analysts and Business Intelligence Engineers (BIEs) to recognize and help adopt best practices in reporting and analysis: data integrity test design analysis validation and documentation. Help continually improve ongoing reporting and analysis processes automating or simplifying self-service support for customers.
About the team You will be working on the product analytics team supporting the Product and Marketing stakeholders for the Consumer Robotics organization. You will own the data engineering function for the team and partner with science and business intelligence team members to ensure they have the data infrastructure required to deliver insights and reporting to our organization.
- 5 years of data engineering experience - Experience with data modeling warehousing and building ETL pipelines - Experience with SQL - Experience in at least one modern scripting or programming language such as Python Java Scala or NodeJS - Experience mentoring team members on best practices - Knowledge of distributed systems as it pertains to data storage and computing - Experience with writing managing and optimizing spark pipelines
- Masters degree in computer science engineering analytics mathematics statistics IT or equivalent - Knowledge of professional software engineering & best practices for full software development life cycle including coding standards software architectures code reviews source control management continuous deployments testing and operational excellence - Experience with orchestration tooling (Airflow dagster AWS step functions)
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status disability or other legally protected status.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.