Senior Software Engineer at Uber
San Francisco, California
Interests: Data Mining, Data Integration, Big Data Analytics, Real-Time data Processing, Distributed Systems. Machine LearningBackend Software Engineer @ Built and improved event organizer reporting tools. Built and maintaining sqoop infrastructure to copy tables from MySQL to Hive via Oozie. Assisted the Data Warehousing team in Hadoop cluster upgrade and migration of old data. Replaced unix cron based...
Interests: Data Mining, Data Integration, Big Data Analytics, Real-Time data Processing, Distributed Systems. Machine LearningBackend Software Engineer @ Built and improved event organizer reporting tools. Built and maintaining sqoop infrastructure to copy tables from MySQL to Hive via Oozie. Assisted the Data Warehousing team in Hadoop cluster upgrade and migration of old data. Replaced unix cron based job scheduling with Oozie hadoop based job scheduling for third party data collection. Designed and implemented strategies to improve Oozie scheduler and hdfs data storage. Built and maintained ETL jobs to clean and aggregate data for the analytics team using Hive, Impala. Automated the collection of third party data through APIs. From February 2014 to Present (1 year 11 months) Graduate Research Assistant @ Integrated graph based query rewriting with the BIRN mediator (developed at ISI). Developed and integrated Global-Local-as-View(GLAV) model with the BIRN mediator. Developed and implemented a virtual data integration system which extends the BIRN Mediator to include OWL 2 ontologies where these ontologies are used as domain schema. From August 2012 to December 2013 (1 year 5 months) Mobile Platform Backend Software Development Intern @ Designed and developed a real-time distributed big data processing tool using Apache Kafka and Twitter Storm. Data gathered as a result helps in monitoring, alerting and trending of useful metrics for eBay Marketplaces. From May 2013 to August 2013 (4 months) Visiting Research Scholar @ Worked in research and development of a data integration tool with Prof. Michael Stonebraker. Developed and Implemented name resolution using approximate string matching through trigram phrase matching on the data set provided by Goby.com. Developed and implemented Silny, unsupervised algorithm for entity resolution on sparse relational data using tf/idf metric. This project resulted in a startup, Tamr based in Cambridge, MA. http://www.tamr.com/ Tamr recently raised $16M from Google Ventures and NEA. From May 2011 to December 2011 (8 months) Visiting Research Scholar @ Compared RTLinux with Vanilla Linux and other commercial real time operating systems. Learned about advanced time management in the Linux kernel. From May 2010 to July 2010 (3 months) Bachelor of Technology (B.Tech.), Computer Science and Engineering @ Indian Institute of Technology, Guwahati From 2008 to 2012 Master's Degree, Computer Science @ University of Southern California From 2012 to 2013 Dhruv Sharma is skilled in: Algorithms, Machine Learning, Artificial Intelligence, Java, Data Mining, Computer Science, Hadoop, Python, Software Development, Data Integration, Apache Kafka, Storm, Big Data, PostgreSQL, C++
Eventbrite
Backend Software Engineer
February 2014 to Present
Information Sciences Institute
Graduate Research Assistant
August 2012 to December 2013
eBay Inc
Mobile Platform Backend Software Development Intern
May 2013 to August 2013
MIT
Visiting Research Scholar
May 2011 to December 2011
Boston University
Visiting Research Scholar
May 2010 to July 2010
Built and improved event organizer reporting tools. Built and maintaining sqoop infrastructure to copy tables from MySQL to Hive via Oozie. Assisted the Data Warehousing team in Hadoop cluster upgrade and migration of old data. Replaced unix cron based job scheduling with Oozie hadoop based job scheduling for third party data collection. Designed and implemented strategies to... Built and improved event organizer reporting tools. Built and maintaining sqoop infrastructure to copy tables from MySQL to Hive via Oozie. Assisted the Data Warehousing team in Hadoop cluster upgrade and migration of old data. Replaced unix cron based job scheduling with Oozie hadoop based job scheduling for third party data collection. Designed and implemented strategies to improve Oozie scheduler and hdfs data storage. Built and maintained ETL jobs to clean and aggregate data for the analytics team using Hive, Impala. Automated the collection of third party data through APIs.
What company does Dhruv Sharma work for?
Dhruv Sharma works for Eventbrite
What is Dhruv Sharma's role at Eventbrite?
Dhruv Sharma is Backend Software Engineer
What industry does Dhruv Sharma work in?
Dhruv Sharma works in the Computer Software industry.
Who are Dhruv Sharma's colleagues?
Dhruv Sharma's colleagues are Tim Kennedy, Pavel Shpilev, Alex Calder, Antonio Fazari, Mark Calleija, Simon Perry, Ankur Ghosh, Joel C., Rahul Chhabra, and Simon Robb
Enjoy unlimited access and discover candidates outside of LinkedIn
One billion email addresses and counting
Everything you need to engage with more prospects.
ContactOut is used by
76% of Fortune 500 companies