I am a recent graduate with a B.S in statistics from Cal Poly. I fell in love with the computing aspect of statistics, specifically the use of R in big data mining. Since being exposed to a simple decision tree, I became fascinated with machine learning and predictive analytics, and am hoping to enter the workforce in
I am a recent graduate with a B.S in statistics from Cal Poly. I fell in love with the computing aspect of statistics, specifically the use of R in big data mining. Since being exposed to a simple decision tree, I became fascinated with machine learning and predictive analytics, and am hoping to enter the workforce in that field. It is my goal to use predictive models that allow companies to make actionable decisions.
Data Analyst @ Using R and Python to push analysis results into a Semantic Data Lake containing billions of triples. This analysis includes topics such as PCA, Cluster analysis, various relationship measures such as Cramer's V, among others. This grants access for anyone to obtain interesting analytical results on patient data through the Semantic Data Lake.
Also taking part in a large-scale project aimed at improving the outcomes of patients at high risk for developing acute respiratory failure in/outside the ICU. This involves taking part of the creation of a real-time R model (random forest) that can be accessed through the Semantic Data Lake, producing scores that imply either intervention or not. From October 2015 to Present (3 months) Junior Data Scientist Intern @ At Franz Inc. I used SPARQL to query data from a semantic database containing patient data from Montefiore Hospital. With this data, I made use use of R and python for extensive data manipulation, and applied various supervised and unsupervised machine-learning algorithms (Naïve Bayes, Random Forests, K-means etc…) to diagnostic data containing millions of observations. Used the R package SuperLearner to create an ensemble of classification algorithms to predict readmission within 30 days, create a fuzzy cluster to group patients together based on their medical history, and made use of several other machine learning algorithms to answer pressing healthcare analytics problems. From July 2015 to Present (6 months) Web Analyst Intern @ I was part of a web analytical team that performs analysis in order to gain more insight into the company's website. I utilize Adobe Omniture in order to obtain copious amounts of data regarding the website. Based on this data, I create reports through R and Excel that aid decisions for the creative, marketing, and engineering divisions. I created predictive machine learning models to estimate the probability a member of the website will cancel their membership.
Also took part in the online advertising aspect of the company, and managed ad groups and keywords in Google Adwords. From February 2015 to Present (11 months) San Luis Obispo, California AreaCal Poly Soccer Team Analyst @ Worked with three other statisticians to carefully follow and quantify the soccer games of one of the top college soccer teams in the nation. Taking detailed observations for every game, we tracked certain key areas the head coach was interested in, and used them to provide valuable information that could ultimately better the team. Through using R, excel, and other software, we created models using logistic regression, descriptive statistics, and other techniques and gave frequent presentations to the coach and assistant coaches. From February 2014 to December 2014 (11 months) Statistical Analyst Intern @ Was responsible for the statistics of a group of phd professors working to publish a long term film analytics project. Worked primarily with text data. Used R and text packages like "tm" to create corpus objects and look for associations between words and terms. Gathered experience with big data extraction: primarily using R to create and format tidy data sets, and perform several types of analysis. Another key challenge was presenting my findings in understandable formats for people not literate in statistics. From July 2014 to August 2014 (2 months) Groningen, Netherlands
Bachelor of Science (BS), Statistics @ California Polytechnic State University-San Luis Obispo From 2011 to 2015 Boudewijn Aasman is skilled in: R, Statistics, Predictive Analytics, Statistical Consulting, Data Mining, Survey Research, Online Advertising, Text Mining, Statistical Computing, Web Analytics, Microsoft Excel, Presentations
Looking for a different
Get an email address for anyone on LinkedIn with the ContactOut Chrome extension