Amazon's selection and catalog systems team is looking for talented and creative engineers for a newly created catalog enrichment project to develop algorithms and build systems to automatically solve a variety of Information Extraction and Data Mining problems related to the Amazon Product Catalog, one of the company's biggest assets.
Our goal is to enhance the customer experience by automatically detecting product features from free-text in Amazon's product catalog, and enriching the catalog with structured semantic information. We use machine learning to develop models that can extract missing data or fix inaccurate data automatically and drive efficient workflows allowing humans to apply their judgment when necessary. On day to day basis we solve problems due to sheer scale (billions of products in the catalog), diversity (all kinds of products ranging from electronics to cosmetics to music in multiple languages), hundreds of sources of data (millions of sellers selling hundreds of products each, reviews and behavioral feedback from customers visiting the Amazon website).
As a software developer you will be handling the scale of the Catalog (several billion products), building machine understanding of text (NLP/Information extraction), doing advanced feature engineering, building robust scalable and maintainable machine learning models, and ensuring extreme high quality of information we provide to Amazon customers. You will be taking pride in building complete end-to-end solutions; researching a problem, implementing the solution, building a scalable and reliable service and ensuring the service is there when you need it.
Our team is in its nascent stage and offers a creative, fast-paced, entrepreneurial work environment where you’ll be at the center of Amazon innovation.
- Push the leading edge in the area of Information Extraction (IE) by designing and testing new algorithms and techniques.
- Analyze large amounts of data to discover patterns and build models to extract valuable information from various sources (e.g. product catalog and customer reviews)
- Establish scalable, efficient, automated machine learning platforms including building tools to collect training and evaluation data.
- Owning and improving customer-facing features derived from scalable & automated IE systems.