At Amazon we believe that every day is still day one. We are working to be the most customercentric company on earth. To get there we need exceptionally talented bright and driven people.
As a Data Engineer you will be working in one of the worlds largest and most complex data warehouse environments. You should be passionate about working with huge data sets and be someone who loves to bring datasets together to answer business questions. You should have deep expertise in creation and management of datasets. You will build data analytical solutions that will address increasingly complex business questions.
You should be expert at implementing and operating stable scalable data flow solutions from production systems into enduser facing applications/reports. These solutions will be fault tolerant selfhealing and adaptive. You will be working on developing solutions that provide some of the unique challenges of space size and speed. You will implement data analytics using cuttingedge analytics patterns and technologies that are inclusive of but not limited to various AWS Offerings EMR Lambda Kinesis and Spectrum. You will extract huge volumes of structured and unstructured data from various sources (Relational /Nonrelational/NoSQL database) and message streams and construct complex analyses. You will write scalable code and tune performance running over billion of rows of data. You will implement data flow solutions that process data on Spark Redshift and store in both Redshift and File based storage (S3 for reporting and adhoc analysis.
You should be detailoriented and must have an aptitude for solving unstructured problems. You should work in a selfdirected environment own tasks and drive them to completion.
You should have excellent business and communication skills to be able to work with business owners to develop and define key business questions and to build data sets that answer those questions. You own customer relationship about data and execute tasks that are manifestations of such ownership like ensuring high data availability low latency documenting data details and transformations and handling user notifications and training.
Key job responsibilities
Design and develop the pipelines required for optimal extraction transformation and loading of data from a wide variety of data sources using SQL Python and AWS big data technologies.
Oversee and continually improve production operations including optimizing data delivery redesigning infrastructure for greater scalability code deployments bug fixes and overall release management and coordination.
Establish and maintain best practices for the design development and support of data integration solutions including documentation.
Work closely with Product teams Travel Team and Business Intelligence Engineer to explore new data sources and deliver the data.
Able to read write and debug data processing and orchestration code written Python/Scala etc following best coding standards (e.g. version controlled code reviewed etc.
3 years of data engineering experience
Experience with data modeling warehousing and building ETL pipelines
Experience building/operating highly available distributed systems of data extraction ingestion and processing of large data sets
Experience as a Data Engineer or in a similar role
Experience with AWS technologies like Redshift S3 AWS Glue EMR Kinesis FireHose Lambda and IAM roles and permissions
Experience with nonrelational databases / data stores (object storage document or keyvalue stores graph databases columnfamily databases)
Experience working on and delivering end to end projects independently
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race national origin gender gender identity sexual orientation protected veteran status disability age or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process including support for the interview or onboarding process please visit
for more information. If the country/region youre applying in isnt listed please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $118900/year in our lowest geographic market up to $205600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on jobrelated knowledge skills and experience. Amazon is a total compensation company. Dependent on the position offered equity signon payments and other forms of compensation may be provided as part of a total compensation package in addition to a full range of medical financial and/or other benefits. For more information please visit This position will remain posted until filled. Applicants should apply via our internal or external career site.