Illustrated stable diffusion from Jay Alammar

If you are interested in knowing how Stable Diffusion generates amazing AI arts, but are put off by the steep technical details, This illustrated guide from Jay Alammar should help you, https://jalammar.github.io/illustrated-stable-diffusion/. The guide helpfully breaks down the model into components and substitutes complex equations with simple flowcharts. P/S: I also highly recommend his illustrated… Continue reading Illustrated stable diffusion from Jay Alammar

Short review of Designing Machine Learning System

I have finally finished “Designing Machine Learning Systems” after a few weekends of focus reading. It is one of the rare technical books that I finished in its entirety, and I thoroughly enjoyed it. Just to offer a quick book review below. The book is amazing in the following aspects: ✅Provides a high-level overview of… Continue reading Short review of Designing Machine Learning System

Automatic speech recognition – Whisper OpenAI

Whisper is a recently released transformer-based automatic speech recognition (ASR) model from OpenAI. It can be used for: 🗣Language identification 🗣Voice activity detection 🗣Multi-lingual speech recognition 🗣Multi-lingual speech translation When evaluated on the ESB datasets (including LibriSpeech, Common Voice), Whisper outperformed Conformer RNN-T from NVidia and Wav2Vec2 from Meta. Link to blog: https://openai.com/blog/whisper/Link to repo:… Continue reading Automatic speech recognition – Whisper OpenAI

Data versioning

“Data versioning is like flossing. Everyone agrees it’s a good thing to do, but few do it.” ~ Chip Huyen, Designing Machine Learning Systems Unlike code versioning, it is a lot more difficult to implement data versioning in data science / machine learning projects. It is because of the following reasons: ➡️ Data is often… Continue reading Data versioning

Concept drift vs data drift vs covariate drift

Do you always get confused among concept drift vs data drift vs covariate drift like me? The diagram (from a research paper, https://arxiv.org/abs/1511.03816) provides a clear illustration of the different terms. In summary, concept drift in data refers to changes in environmental conditions that differ from the original environmental conditions under which a model is… Continue reading Concept drift vs data drift vs covariate drift

MLU-Explain : Visual explanation of ML concepts

A very cool website from Amazon that explains various machine learning concepts using interactive and visual essays. https://mlu-explain.github.io/ Using simple and interesting examples, the website really brings to life many core concepts in machine learning and makes them accessible to more people. This reminds me of how I learned physics during my high school era.… Continue reading MLU-Explain : Visual explanation of ML concepts

ML system design – problem definition & consulting

I recently started reading the excellent book called Designing Machine Learning Systems by Chip Huyen. In the first few chapters, the book illustrated very clearly the differing stakeholder expectations of an ML system, by using a restaurant recommendation app as an example. Data scientists / ML engineers ➡️ Want a model that recommends the best… Continue reading ML system design – problem definition & consulting

The happiness equation

(Image source: https://twitter.com/aurelien_gohier/status/1062248485154705408) We always tell ourselves that happiness will follow after we have worked hard and achieved great success. ”I will be very happy if I achieve the next milestone, be it a job promotion or buying a house or earning my first bucket of gold.” However often after we achieved those successes, happiness… Continue reading The happiness equation

Midpoint review of M6 competition – results

As a quick follow-up to my last post on the midpoint review of M6 competition, I have looked into the actual performance statistics of my entries in the first half of the competition. The results are suprising in a few aspects : The results exhibit huge fluctuations from month to month. (Perhaps partially reflecting the… Continue reading Midpoint review of M6 competition – results