Lukas Valatka

https://valatka.dev (RSS)
visit blog
Wheel - a surprisingly good model package format
26 Jan 2025 | original ↗

I’ll give it a few years until MLflow dominates the model package format space, with alternatives like SageMaker models fading away, and sharing pure weights becoming an arcane art. But until that dominance is absolute, I’ve been thinking that there’s another quite obvious way to package models: just store them as wheels. Packaging == persisting...

Uv has a killer feature you should know about
12 Jan 2025 | original ↗

In my view, neither performance nor trying to be Python-aligned is what sets uv apart. Don’t get me wrong — try switching from uv to Poetry, and you’ll quickly notice how sluggish it (poetry) feels. uv goes extra miles to adhere to PEPs, and IMHO it’s the go-to package manager for Python these days. But these aren’t the features that surprised me...

Async I/O Is Not Enough
28 Dec 2024 | original ↗

For the past few months, I’ve been exploring Go. Having done quite a bit of grueling work shaving off milliseconds from Python web apps, I’ve found Go to be incredible. You can schedule dirt cheap concurrent operations — simply by adding go in front of a function call — and achieve true parallelism across cores.

Let’s build a Bloom filter. Part 1 - A Space-Efficient Hash Set
19 Dec 2023 | original ↗

I have always been intrigued by Bloom filters. They are very similar to hash sets but somehow consume much less memory. Sometimes, they can yield false positives, creating a peculiar tradeoff.

A Feature Store, or Just a Warehouse Table?
1 Aug 2023 | original ↗

Recently a data platform vendor introduced us to their latest offering: a feature store module for machine learning projects. They highlighted the usual selling points, like reducing train-serve skew, feature backfilling, and enhancing feature documentation. However, one of our experienced senior data engineers remained unconvinced. The question...

Walrus Operator
28 Nov 2021 | original ↗

The Walrus operator, formally the assignment expressions, has been for some time around now in Python (since 3.8). I’ve grown fond of it :).

No Time To Plan
21 Nov 2021 | original ↗

The other day I had a chat with a local Karate dojo owner. I am not exactly sure how it all began, but we drifted towards discussing my sensei. He is a busy bee.

↑ These items are from RSS. Visit the blog itself at https://valatka.dev to find everything else and to appreciate author's digital home.