Amazon Appstore is responsible for providing delightful customer experiences across Amazon devices (FireTV Tablets) with a vast selection of relevant apps games and services. The Appstore team is seeking an experienced Data Engineer to join our central Data Engineering and Analytics team. The role will be responsible for closely partnering with Software Development Engineers and Business Intelligence Engineers to build high quality data pipelines and manage Appstore wide central data lake. The Appstore Data Lake powers critical external customer (3P developer) facing reporting as well as self-service analytics for internal stakeholders. This is an exciting opportunity to work on very large datasets and influence products that impact tens of millions of customers on a daily basis.
Key job responsibilities
In this role you will:
- Manage and administer data platform built on AWS services such as EC2 RDS Redshift Kinesis EMR Lambda etc
- Design and implement end-to-end data pipelines (ETL) to ensure efficient data collection cleansing transformation and storage supporting both real-time and offline analytics needs.
- Help continually improve ongoing reporting and analysis processes simplifying self-service support for customers
- Contribute to Data Governance strategy for mitigating disparate data sources where applicable.
- Collaborate with cross-functional teams (e.g. Product Operations Engineering) to align data logic integrate multi-source data (e.g. user behavior transaction logs AI outputs) and build a unified data layer.
- 2 years of data engineering experience
- Experience with data modeling warehousing and building ETL pipelines
- Experience with one or more scripting language (e.g. Python KornShell)
- Experience with big data processing technology (e.g. Hadoop or ApacheSpark) data warehouse technical architecture infrastructure components ETL and reporting/analytic tools and environments
- Bachelors degree or above in computer science computer engineering or related field
- Experience with AWS technologies like Redshift S3 AWS Glue EMR Kinesis FireHose Lambda and IAM roles and permissions
- Experience with data visualization software (e.g. AWS QuickSight or Tableau) or open-source project
- Knowledge of software engineering best practices across the development life cycle including agile methodologies coding standards code reviews source management build processes testing and operations
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit
for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.