January 19, 2026 · 9 min read
How to work with AI
A set of best pratices on how to work and collaborate with AI to co-own your work

Latest Articles
Thoughts on AI, Machine Learning, Data Science, and MLOps.
January 19, 2026 · 9 min read
A set of best pratices on how to work and collaborate with AI to co-own your work

January 18, 2026 · 4 min read
Mimesis is a web-based interview training platform. It allows master-level students and researchers to conduct realistic interviews with adaptive AI personas grounded in real interview data. The project combines two strong axes: rapid delivery using agentic AI coding practices, and deep conceptual work on persona design as a proxy for human social behavior.
Mimesis
August 1, 2025 · 5 min read
Un projet expérimental mêlant intelligence artificielle et humanités numériques pour traduire les œuvres de Molière en français moderne, vers par vers. Conçu comme un laboratoire de travail avec les LLM, il explore le prompt engineering, la génération contrôlée de texte et les enjeux d’accessibilité, à travers une application web et mobile pensée pour un usage pédagogique réel.
Moliere.love
August 1, 2024 · 3 min read
Building something that works is no longer the hard part. Knowing whether it should exist is. This piece looks at two AI-heavy projects that never shipped, not because the technology failed, but because business reality intervened. From toxic content detection on TikTok to a PMF validation platform for solo founders, both experiments used AI to compress time-to-insight, test assumptions early, and expose strategic constraints fast. The focus is on how AI reshapes early-stage decision-making by making it cheaper to be wrong — and clearer when it’s time to move on.

June 19, 2024 · 5 min read
Analyse multimodal des videos TikTok grace au deep learning et machine learning classique
December 10, 2023 · 16 min read
Score a RAG pipeline with a generated Q&A baseline. Evaluate the performance RAG components: chunking, embedding and generative models as well as prompts.
December 7, 2023 · 3 min read
Specify Non-English, French stopwords in a weaviate collection with the Python V4 API
December 1, 2023 · 4 min read
Retrieval Augmented Generation is a simple technique to explore and question a proprietary corpus based on chunking, embeddings and LLMs. Chunking will make or break your RAG!
November 10, 2023 · 3 min read
The enduring appeal of Molière's plays lies in their ability to serve as mirrors reflecting universal human flaws and shortcomings. The humor in Molière's characters, often perceived as mockery, actually prompts self-reflection and self-awareness, offering a form of absolution for common human imperfections.


August 1, 2023 · 3 min read
At Karhoo, I led pricing and forecasting for a global taxi marketplace operating in real time across multiple continents. Facing accuracy ceilings and scaling limits with bespoke scikit-learn random forest models, we benchmarked and migrated to Vertex AI AutoML on GKE, with feature pipelines in BigQuery and automated model promotion. The transition delivered significantly higher prediction performance, lower infrastructure and operational costs, built-in drift handling, and greater flexibility for global rollout under strict latency constraints.

February 2, 2021 · 2 min read
The course covers basic BOW to static embeddings, glove style, with NLTK, Spacy and Gensim.
February 15, 2020 · 5 min read
Here are a few tips to make things easier and just a little bit pro when sharing your data analysis Jupyter notebook
October 21, 2019 · 2 min read
I am very excited to announce that my new course on statistical learning is now available on openclassrooms.

February 15, 2019 · 3 min read
I had the chance to inaugurate the new Data Science and Industrial Processes curriculum launched in 2018 at the Emines school of industrial management

February 21, 2018 · 11 min read
Deep Learning can be expensive. Use startup scripts to reduce virtual instance usage and lower costs. Reshape your workflow to maximize GPU based instances

February 6, 2018 · 6 min read
My iPhone addiction was ruining my life. So I decided to fight back. Here are the 5 simple and easy tricks I've implemented to regain control.

January 1, 2018 · 13 min read
Learn the best Google Cloud Storage features with these gsutil commands.

December 4, 2017 · 14 min read
You can now build super efficient predictive analytics pipelines on AWS without a single line of code!

November 2, 2017 · 2 min read
A recap of the most frequent gsutil commands for Google Cloud Storage.
October 24, 2017 · 1 min read
A talk I gave on how to use AWS ML to build powerful predictive analytics pipelines. Data Science without coding.
May 17, 2017 · 2 min read
Le topic modeling est une technique d'analyse non supervisée de texte qui fait surgir les thèmes principaux exprimés dans un corpus.
February 19, 2017 · 3 min read
Writing a book is hard work. Here's how I did it for my book on the Amazon Machine Learning service.

March 16, 2016 · 7 min read
Slides for the Boston Data Science Meetup on batch processing with scikit-learn

February 8, 2016 · 5 min read
Topic Modeling of Twitter Followers

February 5, 2016 · 12 min read
Amazon Machine Learning is a predictive analytics service with binary/multiclass classification and linear regression features. The service offers a simple workflow but lacks model selection features and has slow execution times. Predictive performances are satisfying.

December 5, 2015 · 1 min read
Jupyter notebooks are the top of the iceberg when it comes to online collaborative tools for data science. Learn about the other ones.
November 19, 2015 · 6 min read
Timemaps are a simple way to visualize the frequency of events and can be used in NLP and to detect bots

November 12, 2015 · 1 min read
Summarization, sentiment analysis, topic modeling, all relevant NLP methods to analyze political debates.

October 30, 2015 · 2 min read
Scikit learn implements out of core or batch processing in several algorithms. High speed boost when working on Large datasets.
September 16, 2015 · 12 min read
Comparing LSA and LDA for topic modeling of a corpus of twitter followers.

September 4, 2015 · 10 min read
Topic modeling of twitter followers with Gensim and Latent Dirichlet Allocation with python 3 code.
