July 31, 2024
Full Text Search over Postgres: Elasticsearch vs. Alternatives
July 30, 2024
Faster backups with sharding
Delightful, production-grade replication for Postgres
This is an external post of mine. Click here if you are not redirected.
July 29, 2024
Building data pipelines with Vitess
Import Postgres tables into Tinybird with the PostgreSQL Table Function
A Deep Dive into German Strings
A Deep Dive into German Strings
“Strings are Everywhere”! At least according to a 2018 DBTest Paper from the Hyper team at Tableau. In fact, strings make up nearly half of the data processed at Tableau. This high prevalence undoubtedly applies to many other companies as well, as the paper’s dataset consists of data analyzed by Tableau’s users. The string-heavy nature of the data makes string processing one of the most important tasks of a database system.
Building Data Pipelines With Vitess
Import Postgres tables into Tinybird with the PostgreSQL Table Function
July 25, 2024
The Hidden Cost of Data Movement
The Hidden Cost of Data Movement
Recently, Mark Raasveldt of DuckDB wrote an excellent post about why memory management is crucial for efficient data processing. In his post, he focuses on the cost of having data on disk and moving it to memory. After all, everyone knows that having data in memory is what you want. As Jim Gray famously said in 2006:
Tape is Dead, Disk is Tape, Flash is Disk, RAM Locality is King