• Full-stack Data & Software Engineer with a solid programming background in Scala, Java & Python
• Expert in Data/Hadoop development, Hadoop Ecosystem (HDFS, Map/Reduce, Spark, Flume, Impala/Hive, Pig and ZooKeeper)
• Expert in Backend development in Scala, Java and Python. Additionally, a solid background in Web development, Server maintenance
• Polyglot Programmer
• Very fast learning abilities and a hunger for new knowledge.
• Writes clean and concise code and is willing to adopt code style to suit the needs of a team.
• Definitely a team player and a great sense of humor.
• Interested in: Data, Machine Learning, Scala, Hadoop Ecosystem, new languages, diving into new algorithms, Container technologies (Docker et al.)
• Excited by new technologies, and inspires others to be excited about them too!
• Loves writing about stuff that matters
• Gets the brainstorms going and the nerd-juices flowing.
• Favorite playthings: Hadoop, Spark, Scala, Scalding, Docker, Python and the Python data stack
• Open Source Addict
• Strong believer in integrating with existing SaaS solutions where possible
• Collects Metrics everywhere
• Trained and experienced musician
Software Engineer @ From December 2015 to Present (1 month) Amsterdam Area, NetherlandsData Engineer/Scala Engineer @ I was brought in at The New Motion to design and realise their (Big) Data strategy. I identified all data that’s spread out through the company and brought it together in a central Hadoop-based hub, based on the Cloudera Hadoop distribution. Using this data warehouse and the technologies from the Hadoop ecosystem, I enabled data-based decision making in the daily jobs of the people at TNM.
To support a data-driven company culture and empower people to ask the right questions from the data, I created various custom tools that make tasks like querying the cluster and creating dashboards much easier. Furthermore I introduced tools like Tableau to enable a true self-service data environment.
In a team of data engineers, we then focused on:
- Ingestion of data from external sources (Salesforce, Navision) using Avro, KiteSDK and more.
- Enabling more up-to-date data ingestion for our relational data sources using Sqoop
- Speeding up ETL processes using Spark
- (Near) Realtime ingestion and aggregation of application & system logs, click streams and other fast-moving metrics using Flume, Kafka and Spark Streaming
- Integrating existing infrastructure/applications with the Hadoop systems in ways that make sense.
- Predictive analytics and machine learning using Spark MLLib
Because of my broad interests I also entertained myself with:
- Backend development in Scala
- Building an infrastructure based around Amazon Web Services & Docker From June 2014 to November 2015 (1 year 6 months) Amsterdam Area, NetherlandsSoftware Engineer @ At Info.nl I was responsible for developing Java EE webapplications for large clients in and outside of The Netherlands, such as Greenwheels and Allianz, using technologies like Java, JavaEE 7, PostgreSQL, OpenMQ, Glassfish 4, Scala, Gatling, Spring, Stripes and the Magnolia CMS.
I was also responsible for architecting and developing BigData applications using Hadoop, Pig, Cascading, Hive and Impala for analysing application logs for the whole company and for analysing customer- and behavioural data that our clients’ applications generate. For this purpose I developed some extensions to Flume to help us gather logs in custom formats into HDFS using the Avro format (see: https://github.com/DandyDev/flume-plugins ) From November 2012 to May 2014 (1 year 7 months) Amsterdam Area, NetherlandsLecturer Datastructures & Algorithms @ I've taught datastructures and algorithms to graduate students majoring in ICT. The course is co-developed with a fellow lecturer and covers most basic datastructures, searching and sorting algorithms, time-complexity, Java Collections Framework and Generics. From October 2013 to March 2014 (6 months) Rotterdam Area, NetherlandsLecturer Cryptography/Security @ I've taught an existing course in Cryptography and Security to graduate students majoring in ICT. The course covers the history of cryptography and the practical workings of the most used cryptosystems and applies a pragmatic combination of hands-on labs and theoretical lectures. From September 2013 to December 2013 (4 months) Rotterdam Area, NetherlandsIntern Hadoop/Marketing Intelligence @ My bachelor thesis revolved around the question in what way BigData technologies, and Hadoop in particular, could be applied in a marketing context, for the processing and analysis of customer- and response data. This particular internship caught my attention because it's less common to see BigData problems identified and tackled in the realm of marketing. 2organize was looking for ways to both increase the efficiency in processing data for their clients and glean more insights and extract more value from the data form their customers that was used in marketing automation processes. I formulated ways in which marketing (automation) processes could benefit from using technologies like Hadoop, Pig and Hive. In the end, I worked together with the marketing intelligence analists to deliver a Hadoop-based prototype that was used to gain insights in the response data from a series of large-scale email campaigns for a big client. From July 2012 to October 2012 (4 months) Rotterdam Area, NetherlandsSoftware Engineer @ Developing small to medium-sized actionable websites for high profile clients in Ruby on Rails. Most websites were part of multichannel marketing campaigns that typically had a lifecycle of a couple of months to a half year. Because of the relatively short lifespan of those websites, and the changing requirements, development was done under strict deadlines and within an agile mindset.
Clients include Vodafone, KPN, NS (Dutch railways) and others. From July 2012 to October 2012 (4 months) Rotterdam Area, NetherlandsLecturer Cryptography 101 @ I’ve developed an extracurricular course for collegestudents covering the history of cryptography and detailing the most common cryptosystems, applying a pragmatic combination of hands-on labs and theoretical lectures. From January 2012 to July 2012 (7 months) Rotterdam Area, NetherlandsLead Engineer @ Leading a team of 5 – 10 developers and designers in bringing the MOvE Online CMS for Artists to the next level. Responsible for transitioning the company to using SCRUM. Several systems that were designed by me: new Authentication system, API, Functional design for future Mobile App, and a host of new features in the CMS that led to a customer growth of 50% in 18 months. From September 2011 to June 2012 (10 months) Rotterdam Area, NetherlandsSoftware Engineer @ Responsible for building new features and modules for a hosted CMS solution geared towards Artists and Musicians. Extensive use of: PHP5/Symfony/Doctrine/MySQL/HTML5/CSS3
Also responsible for server installation and maintenance (Ubuntu Server, LAMP stack) From June 2010 to September 2011 (1 year 4 months) Rotterdam Area, NetherlandsOwner & Pianist/Composer/Arranger @ I (co-)developed and participated in many musical projects as pianist/composer/arranger/bandleader and producer. I toured extensively, including a tour to Singapore and other appearances abroad.
Some of the highlights:
• Composed, recorded and co-produced album for Dutch singer Renee van Bavel
• Composer, arranger, musical director and pianist of 4 major theatrical tours with Renee van Bavel
• Composer, arranger, musical director and pianist of 3 festival productions for the large Boulevard Festival.
• Arrangements and orchestrations of 3 opera shows for Theatre Company Hofplein
• Various other appearances as recording artist and live pianist From September 2005 to June 2011 (5 years 10 months)
Bachelor, Computer Science @ Rotterdam University of Applied Science From 2009 to 2012 BMus with Honours, Music & Performance @ From 2001 to 2005 Music @ From 1999 to 2001 Daan Debie is skilled in: Java, Python, Hadoop, Big Data, Apache Spark, Machine Learning, Scala, Software Design, Java Enterprise Edition, PostgreSQL, Flume, Pig, Hive, Cascading, Data Analysis
Websites:
http://www.dandydev.net