Summary of Qualifications
• 4 years of experience in application development, Database management, Apache Hadoop administration and development.
• 3.5+ years of experience in big data technologies : Hadoop HDFS, Map-Reduce, Pig, Hive, Oozie, Flume, Hcatalog, Sqoop, zookeeper, NoSql : Cassandra and Hbase
Apache Hadoop Development Experience
• Experience in solving complex analytical problems and performing joins with Map-Reduce.
• Strong knowledge of apache hive administration and development.
• Experience in writing UDFS, UDAFS, and UDTFS in java for hive.
• Solid experience in Pig administration and development.
• Experience in writing pig udfs (Eval, Filter, Load and Store) and macros.
• Familiar with embedding hive and pig in java.
• Experience in using Hcatalog for Hive, Pig and Hbase.
• Worked on developing ETL processes to load data from multiple data sources to HDFS using FLUME and SQOOP, perform structural modifications using Map-Reduce, HIVE and analyse data using visualization/reporting tools.
• Familiar with writing Oozie workflows and Job Controllers for job automation.
• Familiar with importing and exporting data using Sqoop.
• Experience in using Flume to stream data into HDFS.
• Familiar with administering and developing in Cassandra and Hbase.
• Experience in writing mrunit test cases and pig unit test cases
Application Development Experience
• Solid background in Object-Oriented analysis and design.
• In-depth understanding of Data Structure and Algorithms.
• Good Understanding of Distributed Systems and Parallel Processing architecture.
• Experience in developing multithreaded applications.
• Experience in deploying the multinode Hadoop cluster with different Hadoop components (HIVE, PIG, SQOOP, OOZIE, FLUME, HCATALOG, HBASE, ZOOKEEPER) using Cloudera Manager.
• Strong knowledge in configuring NameNode high availability and NameNode federation.
• Experience in deploying Hadoop 2.0(YARN)
Sr. Big Data Hadoop Platform Engineer @ • Designed, developed, and supported fast, deterministic, and scalable frameworks for all levels of Groupon's data pipeline software stack.
• Determined when components meet acceptable quality criteria and standards.
• Monitored, investigated and identified problems with production components, processes and data; helped ETL engineers diagnose and test fixes.
• Contributed to Groupon’s data framework products such as ZombieRunner and Megatron to the Open Source community.
• Developed frameworks and automation tools that uses Object-Oriented Programming language Python and Hadoop Stack.
• Worked extensively on distributed databases and query languages such as SQL, HQL or HBASE.
• Worked extensively within the Hadoop/Spark ecosystem.
• Worked on storage, replication, and indexing.
• Developed many data automation tools with the knowledge of scripting languages.
• Worked extensively with Unix/Linux systems.
• Experienced in building frameworks, platforms and APIs.
• Worked and familiar with design patterns.
• Worked on Hadoop environments to build and support including design, capacity planning, cluster set up, performance tuning and monitoring.
• Strong understanding of Hadoop eco-system such as HDFS, MapReduce, HBase, Pig, Hadoop streaming, Sqoop, Spark/Shark and Hive installing, administering, and supporting Linux operating systems and hardware in an enterprise environment.
• Contributed in typical system administration and programming skills such as storage capacity management, performance tuning.
• Managed and monitored Hadoop clusters and platform infrastructure.
• Responsible for cluster availability.
• Supported development and production deployments. From December 2014 to Present (11 months) San Francisco Bay Area
Master of Science (MS), Management Information Systems @ Oklahoma State University From 2012 to 2014 Bachelor of Technology (BTech), Computer Science & Engineering @ Shanmugha Arts, Science, Technology and Research Academy From 2007 to 2011 Ajay Guyyala is skilled in: Hadoop, Cassandra, SQL, Data Mining, Unix, SAS, PL/SQL, SAS programming, Data Analysis, C, Predictive Modeling, Software Project..., CPP, SAS Certified Base..., Oracle SQL, C++, Statistical Modeling, Oracle, Statistics, Tableau, Java, SAS E-Miner, VBA, IBM SPSS Modeler, Rapid Miner, JMP, Text Mining, Oracle 9i, Microstrategy, Apache Pig, Hive, impala, Hadoop eco system, MapReduce, Oozie, Sqoop, Spark, HDFS, ZooKeeper, Mahout, Flume, HBase, Yarn, Databases, Shell Scripting, Microsoft SQL Server, Analytics, MySQL, Agile Methodologies, Microsoft Office