Machine Learning Scientist, Search Experience @ Search Experience strives to understand the intent of Amazon customers and present results that are relevant to making informed purchase decisions.
Tools & Technologies: Machine Learning, Natural Language Processing, Python, Theano, AWS(EC2, S3), Hadoop, Pig, Word2Vec, Vowpal Wabbit
Data: Amazon Search Datasets
Unannounced Projects From April 2015 to Present (9 months) Greater Seattle AreaSoftware Development Engineer, X-Ray for Books @ X-Ray lets your explore the "Bones of the Book". See important passages in the book, See all the passages across a book that mention relevant entities(ideas, fictional characters, historical figures, places, or topics of interest).
Tools & Technologies:Java, AWS(S3, DynamoDB, SNS, SQS, Flow, EC2), YARN, MapReduce, Spark, Mallet, StanfordCoreNLP, OpenNLP, Word2Vec, Object Oriented Software Development, Machine Learning, Natural Language Processing, Distributed Computing.
Data: Kindle Book Content, Amazon Customer Reviews, Kindle Annotations, Goodreads datasets.
Built EC2 based YARN cluster. This cluster is capable of running MapReduce, Spark & Hive.
Shipped Notable Clips. This feature lets readers easily browse all notable passages in a book.
Shipped the key value store for X-Ray entities.
Shipped the X-ray Character merge and naming algorithms.
Founding member of the team. Played important role in shipping critical system components. From June 2011 to March 2015 (3 years 10 months) Greater Seattle AreaSoftware Development Engineering Intern, X-Ray for Books @ X-Ray lets your explore the "Bones of the Book". See important passages in the book, See all the passages across a book that mention relevant entities(ideas, fictional characters, historical figures, places, or topics of interest).
Tools & Technologies:Java, AWS(S3, DynamoDB, SNS, SQS, Flow, EC2), StanfordCoreNLP, OpenNLP, Object Oriented Software Development, Machine Learning, Natural Language Processing, Distributed Computing.
Data: Kindle Book Content
Founding member of the team. Played important role in defining the data model upon which future system components were based. From January 2011 to April 2011 (4 months) Greater Seattle AreaStudent Research Programmer @ ISI conducts basic and applied research across an exceptionally wide range of advanced information processing, computer and communications technologies.
Tools & Technologies: Java, Mallet, PDF Extraction, Topic modeling, Classification.
Lead developer on LAPDF-Text. https://github.com/BMKEG/lapdftextProject
Shipped topic modeling based tools for corpus analysis. From February 2010 to January 2011 (1 year) Greater Los Angeles AreaSoftware Engineer @ From June 2007 to June 2009 (2 years 1 month)
Master's, CS Specialization in Human Language technology @ University of Southern California From 2009 to 2011 BE, computer science @ Pune Institute of Computer Technology From 2003 to 2007 Abhishek Patnia is skilled in: Machine Learning, Natural Language Processing, Information Extraction, Distributed Systems, Scalability, Hadoop, Apache Hue, Apache Spark, Software Development, Java, Computer Science, Algorithms, Git, Shell Scripting, Spring