Image of Dmitriy Ryaboy

Dmitriy Ryaboy

Senior Engineering Manager, Analytics Infrastructure

VP Software Engineering at Zymergen, Inc.

San Francisco, California

Section title

Dmitriy Ryaboy's Email Addresses & Phone Numbers

Dmitriy Ryaboy's Work Experience


Senior Engineering Manager, Analytics Infrastructure

February 2012 to Present

The Apache Software Foundation

Apache Pig committer

January 2010 to 2014


Senior Software Engineer

January 2010 to February 2012

Dmitriy Ryaboy's Education

Carnegie Mellon University

MS Computer Science

2008 to 2009

University of California, Berkeley

BSc Electrical Engineering and Computer Science

1998 to 2002

About Dmitriy Ryaboy's Current Company


Analytics Infrastructure, Data Platform, Product Instrumentation and Experimentation.

About Dmitriy Ryaboy

📖 Summary

Specialties: Building and leading strong engineering teams. Databases, Hadoop. Experience with large (petabyte-sized) clusters, hand-coded ETL systems, distributed systems. As a manager, I ensure the team has a clear objective, instill a culture of taking pride in quality and workmanship, and having done that, focus on getting roadblocks and distractions out of their way.Senior Engineering Manager, Analytics Infrastructure @ Analytics Infrastructure, Data Platform, Product Instrumentation and Experimentation. From February 2012 to Present (3 years 11 months) Apache Pig committer @ Token non-Yahoo in the early days. Pig was a learning experience -- working in open source, working with Hadoop (starting with Hadoop 0.18), working with a large existing codebase. Some of the more interesting things I contributed to: PIG-1427: Turns out many UDFs normally run in milliseconds, but given bad enough input will process a single call for minutes. On very large data sets this happens with depressing regularity, a lot of parsing code has these corners. Pig-1427 allowed UDFs to cut off execution after a configured amount of time. PIG-1205, PIG-1680, PIG-1782 and others Essentially rewrote the HBaseStorage, turned it from a simple loader into a storer as well, got it to take advantage of a number of HBase features, and added a number of things someone actually using HBase might want to have access to, like row keys. Together with Bill Graham we got it to the point that's mostly unchanged today, though others have added features since. PIG-1755 and PIG-2403 Deleted a couple thousand lines of code. Drop in the bucket for code duplication in Pig, sadly, but fighting the good fight... PIG-2151 Add annotation to specify output schema in Java UDFs. If you've ever tried to write the outputSchema function for a Pig UDF and retained your sanity, you will know why this is useful. PIG-2525: Support pluggable PigProcessNotifcationListeners on the command line This is what made awesome things like Twitter's Ambrose and Netflix's Lipstick possible. PIG-2888 + others: Improve performance of POPartialAgg A complete rewrite of the in-memory map-side aggregation to work in non-optimal use cases. Turned one 14-minute job into a 30-second job. Removed hard-coded large memory buffers. I also helped write the first version of Pig-on-Spark, helped with Pig's auto-local mode, and advised on things like pluggable reducer estimation and code-genned classes for tuples with known schemas. From January 2010 to 2014 (4 years) Senior Software Engineer @ Responsible for all aspects of the offline analytics infrastructure. Development of ETL and data processing tools, data lifecycle enforcement, batch workflow libraries, major contributions to Apache Pig in collaboration with the open-source community surrounding that project. Served as the main reference for all things Hadoop-related. Grew the team, recruiting infrastructure engineers, data scientists, and information visualization specialists. From January 2010 to February 2012 (2 years 2 months) Software Engineering Intern @ While in graduate school, I had the good fortune to intern at Cloudera, then just a few months old. During my internship, I surveyed Hadoop data processing and ETL solutions employed by Cloudera friends and customers, contributed improvements to Apache Pig, and created a visual Pig script builder prototype. From May 2009 to August 2009 (4 months) Senior Software Engineer @ From August 2008 to May 2009 (10 months) Software Engineer @ As a Software Engineer with the Data Engineering group at, my responsibilities include ETL, summarization, and analysis of large volumes of clickstream data. Some highlights: * Responsible for all software for identifying and flagging non-human traffic to the website. Developed and tuned heuristics for both post-factum and real-time analysis of the clickstream data. * Created system for distribution of non-standard reports via the web, ftp, or email. * SQL tuning, ad hoc data analysis, automations, requirements gathering, and more. From September 2005 to August 2008 (3 years) Software Developer @ * Created a tool for automated alignment of related microbial dna * Developed of Vista Genome Browser 2.0, a Java 2 applet/servlet combination that provides a dynamic visual front end to a database of multiple species whole-genome alignments. Responsible for all server-side computation, communication with MySQL databases, XML generation, and XML parsing on the client side. Load balancing and query optimization techniques were used to optimize browser performance and prevent resource drain. * Developed and maintained Text Browser, a Perl and MySQL based interface that allows web users to browse whole genome alignments. * Created an easily adaptable system for highly controlled data access over the Internet. The Perl-based server featured user/group permission capability, transparent data access, file locking and usage tracking. From 1999 to September 2005 (6 years) MS, Computer Science @ Carnegie Mellon University From 2008 to 2009 BSc, Electrical Engineering and Computer Science @ University of California, Berkeley From 1998 to 2002 Dmitriy Ryaboy is skilled in: Pig, Databases, Hadoop, MapReduce, Distributed Systems, Scalability, HBase, Big Data, Information Retrieval, Perl, Java, Apache Pig, Data Mining, Software Engineering, MySQL, Maven, Cascading

Dmitriy Ryaboy’s Personal Email Address, Business Email, and Phone Number

are curated by ContactOut on this page.

Frequently Asked Questions about Dmitriy Ryaboy

What company does Dmitriy Ryaboy work for?

Dmitriy Ryaboy works for Twitter

What is Dmitriy Ryaboy's role at Twitter?

Dmitriy Ryaboy is Senior Engineering Manager, Analytics Infrastructure

What is Dmitriy Ryaboy's personal email address?

Dmitriy Ryaboy's personal email address is d****[email protected]

What is Dmitriy Ryaboy's business email address?

Dmitriy Ryaboy's business email addresses are not available

What is Dmitriy Ryaboy's Phone Number?

Dmitriy Ryaboy's phone (213) ***-*311

What industry does Dmitriy Ryaboy work in?

Dmitriy Ryaboy works in the Computer Software industry.

Who are Dmitriy Ryaboy's colleagues?

Dmitriy Ryaboy's colleagues are Jonas Neubert, Walter Tamulis, Matt Chamberlin, Joshua Coats, Elise Reus, Tessa Alexanian, Andrew Jesaitis, Jed Estep, Zachary Palchick, and Kiyan Ahmadizadeh

10x your recruitment & sales conversations

Contact over 200M professionals
instantly by email or phone. Reveal
personal & work email addresses, as
well as phone numbers accurately with
our ContactOut Chrome extension.

In a nutshell

Dmitriy Ryaboy's Personality Type

Extraversion (E), Intuition (N), Feeling (F), Judging (J)

Average Tenure

2 year(s), 10 month(s)

Dmitriy Ryaboy's Willingness to Change Jobs



Open to opportunity?

There's 92% chance that Dmitriy Ryaboy is seeking for new opportunities

Engage candidates 10x faster

Enjoy unlimited access and discover candidates outside of LinkedIn

one billion email addresses

One billion email addresses and counting

Everything you need to engage with more prospects.

2x More emails
vs. competitors
99% Accuracy
40+ Integrations

ContactOut is used by

76% of Fortune 500 companies

Microsoft Nestle PWC Merck Rackspace
Try ContactOut
for free today
  • 50 contacts/month
  • Works on standard LinkedIn only
  • Work emails, personal emails, mobile numbers
* 1 user per company limit
Try ContactOut for Free