Amazon

Returning Candidate?

Data Engineer-Core Machine Learning

Data Engineer-Core Machine Learning

Job ID 
525170
Location 
US-WA-Seattle
Posted Date 
10/5/2017
Company 
Amazon Web Services, Inc.
Position Category 
Business Intelligence
Recruiting Team 
..

Job Description

Machine Learning is changing how Amazon operates, from Marketing and Advertising to Supply Chains and Data Centers. The CoreML team is a unique center of excellence, working on solving challenging ML problems across Amazon. Are you passionate about Big Data (Amazon scale), Machine Learning and Artificial intelligence?

Big Data Processing
Production, processing, and analysis of TB’s of customer, asin or sensor level data. Collecting and processing data coming from various channels such as onsite, free search, paid search, social, paid social email, associates etc. We heavily use AWS services such as AWS Flow, S3, EC2, and EMR (Spark).

Machine Learning
We build various Machine Learning solutions that learn and become better with time by the addition of new data and validation methodologies. We work with both supervised and unsupervised machine learning approaches not limited to regression, classification, clustering etc.

Our products solve customer problems across Amazon and we have accelerating adoption across businesses. We have services that provide predictions from our models to influence multiple facets of Amazon customer experience. However, as we look forward our rate of innovation is dependent on the quality and breadth of data we input to these models.

We are looking for an outstanding individual who combines superb technical, communication, and analytical capabilities with a demonstrated ability to get the right things done quickly and effectively. This person must be comfortable working with a team of software development engineers to raise the bar of the data pipelines we build and maintain. Given the cross Amazon nature of our products, the individual should be highly self-directed having good cross-team collaboration skills.

The ideal candidate for our team is a thinker and a doer: someone who loves algorithms and mathematical precision, but at the same time enjoys implementing real systems, and is motivated by the prospect of spectacular business returns.




Basic Qualifications

* Demonstrated ability in data modeling, ETL development, and data warehousing.
* A desire to work in a collaborative, intellectually curious environment.
* Degree in Computer Science, Engineering, Mathematics, Physics, or a related field and at least 2 years work experience
* Industry experience as a Data Engineer or related specialty (e.g., Software Engineer, Business Intelligence Engineer, Data Scientist) with a track record of manipulating, processing, and extracting value from large datasets.
* Experience with Oracle, Redshift, Teradata, etc.
* Experience with Big Data Technologies (esp. Spark)
* Coding proficiency in at least one modern programming language (e.g. Python, Java)

Preferred Qualifications

* Experience building/operating systems for data extraction, ingestion, and processing of large data sets
* Experience building data products incrementally and integrating and managing datasets from multiple sources
* Experience leading large-scale data warehousing and analytics projects, including using AWS technologies – Redshift, S3, EC2, Data-pipeline and other big data technologies
* Experience using machine learning and statistical tools such as Python/Pandas, R etc .
* Tools to process large data sets.