Skip to content
Python News | Practical Python Engineering

Python News | Practical Python Engineering

Python News covers applied Python development, libraries, and real-world engineering patterns.

site mode button
Subscribe

Data Engineering

Posts about Data Engineering

Inside Polars’ Streaming Engine: How Spillable Sinks Handle Larger-Than-RAM Joins
19 mins read

Inside Polars’ Streaming Engine: How Spillable Sinks Handle Larger-Than-RAM Joins

  • Data Engineering
April 24, 2026May 19, 2026 Riko IshikawaTagged CPython internals, DuckDB python, Free threading, GIL removal, Ibis framework, Marimo notebooks, Mojo language, Polars dataframe, Python JIT, Rust Python

If you search polars streaming engine spill today, the top result is a benchmark that OOM’d on Polars 1.6.0 in September 2024.

Read More
Inside Polars’ Lazy Query Engine: How Predicate Pushdown Beats Pandas
15 mins read

Inside Polars’ Lazy Query Engine: How Predicate Pushdown Beats Pandas

  • Data Engineering
April 23, 2026May 27, 2026 Priya SharmaTagged CPython internals, DuckDB python, Free threading, GIL removal, Ibis framework, Marimo notebooks, Mojo language, Polars dataframe, Python JIT, Rust Python

The pola.rs blog has a headline number that every Polars tutorial copies: the optimized lazy plan runs roughly 4x faster than the naive one on the NYC.

Read More
Step-by-Step Tutorial: Writing Python Extensions in Rust With PyO3
15 mins read

Step-by-Step Tutorial: Writing Python Extensions in Rust With PyO3

  • CPython Internals
April 6, 2026 Mateo VargasTagged CPython internals

I hit a massive performance wall last Tuesday. I was tasked with parsing a 50GB dataset of nested JSON logs for a cybersecurity client doing malware.

Read More
Marimo vs Jupyter Notebook: Which Python Environment is Best?
16 mins read

Marimo vs Jupyter Notebook: Which Python Environment is Best?

  • Data Engineering
April 6, 2026 Nia OkoroTagged PyArrow updates

I just spent three hours debugging a machine learning pipeline, only to realize I had executed cell 14 before cell 12.

Read More
How to Process Massive CSV Files With DuckDB and Python Fast
16 mins read

How to Process Massive CSV Files With DuckDB and Python Fast

  • Algorithmic Trading
April 4, 2026 Silas MontgomeryTagged Algo trading

A 60GB CSV file lands in your AWS S3 bucket. Your data pipeline triggers, spins up a standard EC2 instance, and attempts to run pandas.read_csv() .

Read More
I Dropped Pandas for Polars. Here’s What Broke.
6 mins read

I Dropped Pandas for Polars. Here’s What Broke.

  • Data Engineering
April 3, 2026 Riko IshikawaTagged Polars dataframe

It was 11 PM on a Thursday last month. My data pipeline running on a t3.xlarge AWS instance crashed for the fourth time that week. The culprit?

Read More
Mastering Gil Removal: Advanced Techniques and Best Practices for Modern Developers
15 mins read

Mastering Gil Removal: Advanced Techniques and Best Practices for Modern Developers

  • Data Engineering
April 1, 2026April 19, 2026 Riko IshikawaTagged Ibis framework

Introduction to Gil Removal In today’s rapidly evolving technological landscape, GIL removal has emerged as a critical skill for developers seeking to.

Read More
6 mins read

FastAPI on the Edge: Running Local LLMs on a Pi

  • AI/ML
March 17, 2026April 19, 2026 python_news_comTagged FastAPI news

I’m officially sick of renting $3/hour cloud GPUs just to parse text. For the last few weeks, I’ve been moving my background Learn about FastAPI news.

Read More
I Turned on the Python 3.14 JIT in Production (Well, Staging). Here’s the Truth.
6 mins read

I Turned on the Python 3.14 JIT in Production (Well, Staging). Here’s the Truth.

  • Data Engineering
February 25, 2026April 19, 2026 Silas MontgomeryTagged Python JIT

Well, I have to admit, I was a bit skeptical about this whole Python JIT thing at first. In my experience, “free performance” usually comes …

Read More
TF 2.18 & Keras: Real-World Performance Review
7 mins read

TF 2.18 & Keras: Real-World Performance Review

  • AI/ML
February 16, 2026April 19, 2026 Riko IshikawaTagged Keras updates

I finally bit the bullet last week. After ignoring the notification icons for two months, I upgraded our main training pipeline to TensorFlow 2.18.

Read More

Posts pagination

1 2 … 6 Next

Recent Posts

  • Django Async ORM Migration: What Breaks and When to Stay Sync
  • The Case for MicroPython Over C on Edge AI Devices
  • How Ibis Lowers One Expression to Three SQL Dialects
  • How MicroPython’s Garbage Collector Survives on 256KB
  • Inside Pydantic v2’s Core: How pydantic-core Compiles Schemas to

Recent Comments

No comments to show.

Archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025

Categories

  • Agent Development
  • AI Agents
  • AI Testing
  • AI/ML
  • Algorithmic Trading
  • Ansible
  • Api
  • API Development
  • Apple Ecosystem
  • Application Scripting
  • Architecture
  • Async
  • Async Programming
  • Asynchronous Programming
  • Asyncio
  • Automation
  • Backend Development
  • Bioinformatics
  • Blockchain Development
  • CircuitPython
  • Climate Modeling
  • Cloud Computing
  • Code Quality
  • Code-CAD
  • Community
  • Concurrency
  • Core Development
  • CPython Internals
  • Cryptography
  • Data
  • Data Engineering
  • Data Science
  • Data Validation
  • Data Visualization
  • Database
  • Deep Learning
  • Dependencies
  • Dependency Management
  • Deployment
  • Developer Community
  • Developer Tools
  • Development Tools
  • DevOps
  • Distributed Computing
  • Django
  • Edge AI
  • Edge Computing
  • Educational Technology
  • Embedded Systems
  • Encryption
  • FARM Stack
  • FastAPI
  • FinTech
  • Flask
  • Functional Programming
  • Game Development
  • Generative AI
  • Hardware Engineering
  • Hardware Simulation
  • Internet of Things
  • IoT
  • JIT Compilation
  • Knowledge Graphs
  • LangChain
  • Language Features
  • Language Internals
  • LlamaIndex
  • LLM Development
  • Machine Learning
  • Malware Analysis
  • Mathematical Modeling
  • Matplotlib
  • Memory Management
  • Microservices
  • MLOps
  • Mojo Language
  • Network Security
  • Open Source
  • Open Source Governance
  • Operating Systems
  • Optimization
  • Package Management
  • Packaging
  • Pandas
  • Parallel Processing
  • Performance
  • Performance Optimization
  • Pipeline
  • Poetry
  • Probabilistic Forecasting
  • Probabilistic Programming
  • Profiling
  • Programming
  • Pytest
  • Python
  • Python Asyncio
  • Python Development
  • Python Ecosystem
  • Python Encryption
  • Python Internals
  • Python News
  • Python Packaging
  • Python Profiling
  • Python Programming
  • Python Security
  • Python Tooling
  • Quantitative Finance
  • Quantum Computing
  • Red Teaming
  • Rest
  • Robotics
  • Scientific Computing
  • Security
  • Smart Contracts
  • Software Design Patterns
  • Software Development
  • Software Governance
  • Static Type Checking
  • Structured Concurrency
  • Supply Chain Security
  • Systems Programming
  • Tdd
  • Tech Ethics
  • Tensorflow
  • Test
  • Testing
  • Time Series Forecasting
  • UI Development
  • Web Development
  • Web Scraping
  • Web3 Integration
  • Web3.Py
  • WebAssembly
  • About
  • Contact
  • Django Async Orm Migration Effort
  • Ibis Deferred Expression Backends
  • Micropython Edge AI Embedded Deployment
  • Numpy Ufunc Dispatch Simd
  • Pdm Hatch Build
  • Privacy Policy
  • Pyarrow Table Pandas Dataframe
  • Terms of Service
News Express © 2026. All Rights Reserved.