Data Engineering
Posts about Data Engineering
I Turned on the Python 3.14 JIT in Production (Well, Staging). Here’s the Truth.
Well, I have to admit, I was a bit skeptical about this whole Python JIT thing at first. In my experience, “free performance” usually comes …
TF 2.18 & Keras: Real-World Performance Review
I finally bit the bullet last week. After ignoring the notification icons for two months, I upgraded our main training pipeline Learn about Keras updates.
Taipy: Why I Finally Ditched Streamlit for Production Apps
Taipy news: Well, I have a confession to make. For the last five years, I’ve been utterly hooked on “script-to-web” tools. You know the drill — write a …
Ray and PyTorch are finally under one roof. Good riddance to the anxiety.
Actually, I should clarify – I’ve been writing distributed training scripts for the better part of five years, and let me tell Learn about PyTorch news.
LlamaCloud’s Multimodal RAG: Finally, No More Glue Code
LlamaIndex news: Well, that’s not entirely accurate — I’ve actually been playing around with LlamaCloud for a while now. You know the drill.
Mojo in 2026: Is It Finally Time to Ditch Pure Python?
Actually, I still remember the noise when Mojo first dropped. It was mid-2023, and the promise was wild: Python syntax, C++ Learn about Mojo language.
Python’s Reference Counting Has Changed (And You Probably Missed It)
Well, that’s not entirely accurate — I actually spent most of last Tuesday staring at a flame graph that absolutely refused Learn about CPython internals.
Distributed Training Finally Stopped Making Me Cry (Mostly)
I still remember the first time I tried to shard a 70B parameter model across a cluster of GPUs. It was 2 AM, I was three Learn about PyTorch news.
Stop Rewriting Your Pandas Code for Spark. Seriously.
I looked at my terminal yesterday and saw the one error message that has haunted my entire career in data engineering. Learn about NumPy news.
NASA Just Paid to Fix NumPy’s Messy Parts. About Time.
I was staring at a flame graph at 11 p. m. last Tuesday, wondering why my seemingly simple data pipeline was eating RAM like Learn about NumPy news.
