“Data versioning is like flossing. Everyone agrees it’s a good thing to do, but few do it.” ~ Chip Huyen, Designing Machine Learning Systems Unlike code versioning, it is a lot more difficult to implement data versioning in data science / machine learning projects. It is because of the following reasons: ➡️ Data is often… Continue reading Data versioning
Author: Chee Yee Lim
Concept drift vs data drift vs covariate drift
Do you always get confused among concept drift vs data drift vs covariate drift like me? The diagram (from a research paper, https://arxiv.org/abs/1511.03816) provides a clear illustration of the different terms. In summary, concept drift in data refers to changes in environmental conditions that differ from the original environmental conditions under which a model is… Continue reading Concept drift vs data drift vs covariate drift
MLU-Explain : Visual explanation of ML concepts
A very cool website from Amazon that explains various machine learning concepts using interactive and visual essays. https://mlu-explain.github.io/ Using simple and interesting examples, the website really brings to life many core concepts in machine learning and makes them accessible to more people. This reminds me of how I learned physics during my high school era.… Continue reading MLU-Explain : Visual explanation of ML concepts
ML system design – problem definition & consulting
I recently started reading the excellent book called Designing Machine Learning Systems by Chip Huyen. In the first few chapters, the book illustrated very clearly the differing stakeholder expectations of an ML system, by using a restaurant recommendation app as an example. Data scientists / ML engineers ➡️ Want a model that recommends the best… Continue reading ML system design – problem definition & consulting
The happiness equation
(Image source: https://twitter.com/aurelien_gohier/status/1062248485154705408) We always tell ourselves that happiness will follow after we have worked hard and achieved great success. ”I will be very happy if I achieve the next milestone, be it a job promotion or buying a house or earning my first bucket of gold.” However often after we achieved those successes, happiness… Continue reading The happiness equation
Midpoint review of M6 competition – results
As a quick follow-up to my last post on the midpoint review of M6 competition, I have looked into the actual performance statistics of my entries in the first half of the competition. The results are suprising in a few aspects : The results exhibit huge fluctuations from month to month. (Perhaps partially reflecting the… Continue reading Midpoint review of M6 competition – results
Midpoint review of M6 competition
With the ending of June, it is now the halfway point of M6 competition. It may be a good time to do a quick review of my progress and learnings from the M6 competition so far. (And also to get me into the habits of regularly writing blogs!) Progress in M6 competition For a brief… Continue reading Midpoint review of M6 competition
A beginner guide to the folder structure generated by cookiecutter-pypackage
cookiecutter-pypackage offers a very well equipped standard project template to create a Python package. However for many first time users, the automatically generated folder structure can be quite intimidating. The typical folder structure generated by cookiecutter-pypackage (https://github.com/cheeyeelim/cookiecutter-pypackage) (v1.1.2) looks like this : Note that all mentions of {{cookiecutter.project_slug}} and {{cookiecutter.pkg_name}} will be replaced by the… Continue reading A beginner guide to the folder structure generated by cookiecutter-pypackage
Trying to use cookiecutter-pypackage and ended up contributing to it
cookiecutter-pypackage provides an amazing framework to develop a Python package with. The original version (https://github.com/audreyfeldroy/cookiecutter-pypackage) has been further extended by many contributors to include useful tools such as poetry and pre-commit. I decided to start using one of the newer versions of cookiecutter-pypackage (https://github.com/waynerv/cookiecutter-pypackage) (v1.1.1) before running into an error in one of the GitHub… Continue reading Trying to use cookiecutter-pypackage and ended up contributing to it
Loading child CSS style in WordPress
If you are a WordPress newbie and wanted to add your own CSS styles onto an existing WordPress theme using child template like me, you may also be following the instruction on WordPress. As I was working with the twenty-twenty-one theme, which uses get_template functions to load CSS styles, I added the following given code… Continue reading Loading child CSS style in WordPress