Data scientist with a PhD in particle physics followed by a CERN Fellowship at the LHC. Expert developer of scalable and production-ready software architectures that are heavy on statistical analysis, forecasting, recommendation, automated information extraction or classification tasks. Proven team-and talent-management skills. A main author of numerous and well-cited scientific articles, including a prominent role in the first Higgs-boson discovery paper and subsequent landmark Higgs-properties publications. Co-founder of x.ai, a 50+ company building an automated personal assistant. Leading the data science team (15)
What I like: hard problems (I mean really hard), preferably never solved before. Not scared of time pressure or high levels of uncertainty. Need to work in teams with good communication flow. In other words, I am attracted to innovation.
-----------------------------
SKILL SET
-----------------------------
Machine Learning : At ease with common data science tasks: data modeling, analysis, forecasting, recommendation engines and unsupervised pattern recognition. Strong focus on natural language processing (NLP) tasks, including information extraction from unstructured data, text classification and calendar/event pattern recognition. Some of my favorite algos: Support Vector Machines, Neural Nets, Random Forests, Conditional Random Fields, Latent Dirichlet Distributions.
Favorite tools : love Scala. Lots of Python and C++. Some R, Root, and Octave
Data structures : MongoDB, ElasticSearch, Hadoop, Spark, SQL and other less known..
Dev-ops : serious about unit testing and GitFlow discipline, type safety, error handling, logging, etc..
Lead Data Scientist and Co-Founder @ Data Science tasks. Focus on Machine Learning applied to natural language processing :
-> Information extraction and semantic analysis of unstructured text (such as emails or calendars)
-> algos: svm's, random forests, CRF's, deep learning techniques applied to NLP
-> tools : Scala, Spark, Python, AWS, mongoDB... and whatever is needed.
-> coolest job ever.. From June 2014 to Present (1 year 5 months) NYCResearch Faculty / Data Scientist @ -> Led team of researchers in the design of an adaptive traffic analysis system which analyses and responds to real-time public transit/parking/and private vehicle positions. Part of a $3M FHWA Grant to develop a state-of-the-art traffic analysis system.
-> Lecture college students in physics and mathematics at FIU's Honors College. The program is intended to help advanced students dive deeper into specific research topics From August 2013 to July 2014 (1 year) Miami, FloridaData Scientist @ -> developed algorithms for adaptive advertising with under-200 millisecond response time
-> correlate user behaviour/profile data to likelihood of buying various commercial products
-> Tools: Scala, Python, ElasticSearch, MongoDB From October 2013 to February 2014 (5 months) West Palm Beach, Florida AreaDESY Research Fellowship @ Worked with a research team at the Large Hadron Collider at CERN. I was co-ordinator of a 15-institute research group within the Higgs-hunting group at ATLAS.
-> Developed large-scale data analysis tools/techniques in the search for evidence of a new fundamental particle/force of nature (the Higgs boson)
-> Developed analytical methods to differentiate signal from background and test the performance of such methods.
-> Many presentations with heavy data visualisation in conferences/workshops, etc...
-> Led effort of reducing the uncertainty in the Higgs signal strength measurement. The success of this 18-month project resulted in a number of novel data-analysis methods and an outstanding 80% reduction in this uncertainty. It also gave the team and the PhD students within my team a prominent role in the landmark Higgs discovery analysis, the results of which enjoyed ample world-wide press coverage in the summer of 2012.
-> Tools : C++, Root, Python, Monte Carlo techniques, farm/cluster interaction From May 2011 to July 2013 (2 years 3 months) Hamburg Area, GermanyCERN Fellowship @ -> Higgs discovery team : identical description to DESY Fellowship (see above) From October 2008 to January 2011 (2 years 4 months) Geneva Area, SwitzerlandPhD in high-energy particle physics @ -> Using billions of high-energy proton-electron collision data, extracted a measurement of alpha_s, one of the fundamental parameters of Quantum Chromodynamics, the theory that describes the sub-structure of protons.
-> The measurement resulted in 4 publications and led to a significant reduction in the uncertainty of this parameter, whose practical use is crucial in particle physics
-> Many presentations with heavy data visualisation in conferences/workshops, etc...
-> Tools: C++, Awk, Root, PAW, Fortran, cluster/farm interaccion From June 2004 to September 2008 (4 years 4 months) Hamburg
PhD, Elementary Particle Physics, Cum Laude @ Universidad Autonoma de Madrid From 2003 to 2008 Bachelor of Arts (B.A.), Physics @ UC Berkeley From 2000 to 2003 Bachelor in Applied Mathematics, Applied Mathematics @ UC Berkeley From 2000 to 2003 Florida Interna Marcos Belenguer is skilled in: Mathematical Modeling, Physics, Data Analysis, Algorithms, Monte Carlo Simulation, Particle Physics, Foreign Languages, Statistics, Python, Applied Mathematics, Teaching, Machine Learning, C++, Simulations, Linux, Numerical Analysis, Scientific Computing, Data Mining, Science, Experimentation, Mathematica, Hardware Diagnostics, Creative Strategy, Experimental Physics, Theory, R, Programming La, Presentations, Seminars, Hardware, Problem Solving, ROOT, Big Data, Programming, Matlab, Research, Mathematics, Natural Language..., Data Science, Project Management, Distributed Systems, Databases, Analysis, Computer Science, Data Visualization, Signal Processing, High Performance..., Scientific Writing, Scala