Big Data Lead at Accenture , London, UK
London, Greater London, United Kingdom
➢ Substantial experience in Big Data projects using Hadoop, Hive, HBase, Cassandra, working with large data sets for AWS Configuration management and deployment to Spark+Kafka cluster (real-time scalable distributed engine) on Cloudera premises. Production implementation of Lambda architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala ➢ Data engineer focused on the immediate benefits for the business using... ➢ Substantial experience in Big Data projects using Hadoop, Hive, HBase, Cassandra, working with large data sets for AWS Configuration management and deployment to Spark+Kafka cluster (real-time scalable distributed engine) on Cloudera premises. Production implementation of Lambda architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala ➢ Data engineer focused on the immediate benefits for the business using the Big Data tools (HDFS and MapReduce paradigm, Spark, Hive, Sqoop, Hbase, Cassandra, SAP HANA, MongoDB) with advanced analytical and visualization APIs (graph DB – Titan, Neo4J, Tinkerpop, software development – Scala, Akka, R, WEKA, Gremlin) ➢ Proven history of building large-scale data processing systems and serving as an expert in data warehousing solutions while working with a variety of database technologies, practical experience in architecting highly scalable, distributed systems using different open source tools as well as designing and optimizing large, multi-terabyte data warehouses (data lakes). ➢ Played a lead role in determination of overall solution architectures and designs consistent with architecture to support strategic Big Data initiatives across domains ➢ Designed a large data lake and distributed data framework using lambda architecture as Big Data analytics platform for processing customer transactions (sales orders and payments) using Sqoop, Scala, Hadoop, Hive and Pig to facilitate fraud and outlier detection. ➢ Installed and configured Big Data ecosystem - Apache Hadoop, Spark cluster, Kafka+Spark Streaming, Hive and HBase on the prototype and production environment. Configured SQL database to store Hive metadata. Loaded unstructured data into Hadoop File System (HDFS). ➢ Created ETL jobs to load business transactions (MySQL, S3, Redis) data and server data into Hive, HBase, Cassandra to provide fast access to historical data on the fly for predictive modeling with real time data from the stream
📖 Summary
Senior Data Engineer @ ➢ Substantial experience in Big Data projects using Hadoop, Hive, HBase, Cassandra, working with large data sets for AWS Configuration management and deployment to Spark+Kafka cluster (real-time scalable distributed engine) on Cloudera premises. Production implementation of Lambda architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala ➢ Data engineer focused on the immediate benefits for the business using the Big Data tools (HDFS and MapReduce paradigm, Spark, Hive, Sqoop, Hbase, Cassandra, SAP HANA, MongoDB) with advanced analytical and visualization APIs (graph DB – Titan, Neo4J, Tinkerpop, software development – Scala, Akka, R, WEKA, Gremlin) ➢ Proven history of building large-scale data processing systems and serving as an expert in data warehousing solutions while working with a variety of database technologies, practical experience in architecting highly scalable, distributed systems using different open source tools as well as designing and optimizing large, multi-terabyte data warehouses (data lakes). ➢ Played a lead role in determination of overall solution architectures and designs consistent with architecture to support strategic Big Data initiatives across domains ➢ Designed a large data lake and distributed data framework using lambda architecture as Big Data analytics platform for processing customer transactions (sales orders and payments) using Sqoop, Scala, Hadoop, Hive and Pig to facilitate fraud and outlier detection. ➢ Installed and configured Big Data ecosystem - Apache Hadoop, Spark cluster, Kafka+Spark Streaming, Hive and HBase on the prototype and production environment. Configured SQL database to store Hive metadata. Loaded unstructured data into Hadoop File System (HDFS). ➢ Created ETL jobs to load business transactions (MySQL, S3, Redis) data and server data into Hive, HBase, Cassandra to provide fast access to historical data on the fly for predictive modeling with real time data from the stream From December 2014 to Present (11 months) TorntoSenior BI Develper / Project Manager @ ➢ Lead in the implementation of Business Applications and/or IT Infrastructure projects (project teams of 10+ people) starting with requirements gathering to project deployment ad post go-live support. ➢ Business Intelligence thought leader with new emphasis on support, services, standards, and best practices (MicroStrategy, Tableau Software+R, Alteryx, Hadoop +R) ➢ Operational intelligence implementation - real-time business analytics from Big Data (Hadoop, Spark, Kafka, NoSQL, Splunk) ➢ Adept at concentrating on business events rather than known reporting requirements so as to model whole business process areas (Business Event Analysis and Modeling technology) ➢ Strong practical foundation in project management Data warehousing , resource management , BI reporting tools, process standardization ➢ Experience working with large data sets, experience working with distributed computing tools a plus (C#+Map/Reduce, HDInsight (Hadoop) , Hive) ➢ Proven skills in translating results of complex analytic problems into actionable recommendations or strategies , practical competence in data analysis and data science in mining and equipment reliability domains From July 2008 to March 2015 (6 years 9 months) Toronto Oleg Baydakov is skilled in: Business Process, ERP, Business Analysis, JD Edwards, Business Process..., Mining, Vendor Management, Integration, SharePoint, Financial Reporting, IT Management, MS Project, Business Intelligence, SAP, Change Management
What company does Oleg Baydakov work for?
Oleg Baydakov works for Paytm Labs
What is Oleg Baydakov's role at Paytm Labs?
Oleg Baydakov is Senior Data Engineer
What industry does Oleg Baydakov work in?
Oleg Baydakov works in the Information Technology and Services industry.
Who are Oleg Baydakov's colleagues?
Oleg Baydakov's colleagues are Johann Els, Sachin Singh, Roxanne Ward, Javed Iqbal, Kayes Ahsan, Oliver Seeley, Sunny Singh, Selda Dinc, Aaron King, and Arti Khetarpal
Extraversion (E), Intuition (N), Feeling (F), Judging (J)
3 year(s), 10 month(s)
Unlikely
Likely
There's 85% chance that Oleg Baydakov is seeking for new opportunities
Enjoy unlimited access and discover candidates outside of LinkedIn
One billion email addresses and counting
Everything you need to engage with more prospects.
ContactOut is used by
76% of Fortune 500 companies