Amazon Music is awash in data! To help make sense of it all, the Data Authority and Warehouse team enables repeatable, easy, in depth analysis of music customer behaviors. We reduce the cost in time and effort of analysis, data set building, model building, and user segmentation. Our goal is to empower all teams at Amazon to make data driven decisions and effectively measure their results by providing high quality, highly available data, and democratizing data access through self-service tools.
If you love the challenges that come with big data then this role is for you. We collect billions of events a day, manage hundreds of terabytes of data on Redshift, Oracle, and S3, and develop data pipelines using SQL based ETL, Java/Scala Spark-EMR, and Java services. And we are just getting started!
You are a talented, enthusiastic, and detail-oriented Data Engineer who wants to take on big challenges in an agile way. Duties include data modeling and information design, ETL on Redshift using SQL, ETL using python/scala/java MR jobs on Spark/EMR, fact and dimension development and maintenance, and Tableau setup, administration, and development. With the launch of Amazon Music Unlimited and the continued growth of Amazon Prime Music, providing high quality data to our internal customers has a direct impact on the quality of our customer experience.
We focus our development on core platform features our internal customers need and deal in AWS technologies like Redshift, S3, EMR, EC2, DynamoDB, and Kinesis Firehose for the rest. In 2017 this team will help design and deliver the next generation big data platform, and roll out the Data Authority service for master data management. If these sound like the right kind of challenges for you, please apply today!