New statistical learning course on openclassroooms

My new course on statistical learning is now available on openclassrooms.

Design Effective Statistical Models to Understand Your Data

Explore linear, logistic and polynomial regression with hands on exercises, real-world use-cases and non trivial datasets.

Bridging the gap between statistical modeling and machine learning.

Teaching data science at UM6P

In 2018, the renowned École Polytechnique, Mohammed VI Polytechnic University and the Foundation of École Polytechnique launched a Chair in “Data Science and Industrial Processes” in Morocco which I had the chance of inaugurating in the amazing UM6P campus of the Emines school of industrial management located in Benguerir Morocco.

Read more: Teaching Data Science at UM6P

Workshops: Topic modeling in R

I recently conducted 2 workshops on Topic Modeling with the R STM package for 2 very different student populations:

Universite Paris Est Marne la Vallee In French, for the Master in ETUDES NUMERIQUES ET INNOVATION at the Université Paris Marne La Vallée. Slides, Github

Berklee Valencia The Berklee College of Music Digital Studies MBA in Valencia

Data Science at General Assembly

GA In the summer 2016, I had the pleasure of teaching a full Data Science Curriculum at General Assembly. Over 20 sessions and 60 hours, we covered a lot of ground, a lot of topics and some of the final students projects were amazing. The slides, code and datasets are all available on github at

The course covers:

  • Statistical inference
  • Bias and variance, Learning curves and overfitting.
  • Visualization with matplotlib and plotly
  • Supervised: Regression and classification
  • Unsupervised: Clustering
  • Time series: ARMA models
  • Tree based models: Random forests and Boosted trees
  • Support Vector Machines
  • NLP: sentiment analysis, Topic Modeling, POS tagging, wordnet, …
  • Logistic regression

It was an intense curriculum!