Skip to content

Netflix Tudum Architecture: from CQRS with Kafka to CQRS with RAW Hollow

By Eugene Yemelyanau, Jake GriceIntroductionTudum.com is Netflix’s official fan destination, enabling fans to dive deeper into their favorite Netflix shows and movies. Tudum offers exclusive first-looks, behind-the-scenes content, talent interviews, live events, guides, and interactive experiences. “Tudum” is named after the sonic ID you hear when pressing play on a Netflix show or movie. Attracting over 20 million members each month,...
Measuring the impact of AI on experienced open-source developer productivity

We conduct a randomized controlled trial (RCT) to understand how early-2025 AI tools affect the productivity of experienced open-source developers working on their own repositories. Surprisingly, we find that when developers use AI tools, they take 19% longer than without—AI makes them slower. We view this result as a snapshot of early-2025 AI capabilities in one relevant setting; as these...
The Micro-Frontend Architecture Handbook

submitted by /u/woltan_4 [link] [comments]
We stopped relying on bloom filters and now sort our ClickHouse primary key on a resource fingerprint. It cut our log query scans to 0.85% of blocks.

Hey folks, My team and I have been working on a performance optimization and wanted to share the results. We managed to cut log-query scanning from nearly all data blocks down to less than 1% by reorganizing how logs are stored in ClickHouse. Instead of relying on bloom-filter skip indexes, we generate a deterministic “resource fingerprint” (a hash of cluster...
Python 3.14 will officially support free-threading

This article explains the new features in Python 3.14, compared to 3.13. New features¶ PEP 779: Free-threaded Python is officially supported¶ The free-threaded build of Python is now supported and no longer experimental. This is the start of phase II where free-threaded Python is officially supported but still optional. We are confident that the project is on the right path, and we appreciate the continued dedication from...
The power of SurrealDB embedded

Embedded systems are rapidly evolving to power intelligent, offline-first applications at the edge, demanding more than traditional storage solutions. With the rise of on-device LLMs, dynamic data models, and real-time decision-making, a new kind of embedded database is needed. In this blog, we describe the power of SurrealDB embedded: a lightweight, secure, and AI-native engine built in Rust, designed to...

Tag: Hadoop

Intro to Apache Spark (slides)

Intro to Apache Spark (slides)

Published by Arnon Rotem-Gal-Oz on December 16, 2020

I gave a general overview of Apache Spark to our R&D teams. You can find the slides below

Big data isn’t – well, almost

Big data isn’t – well, almost

Published by Arnon Rotem-Gal-Oz on March 23, 2019

Back in ancient history (2004) Google’s Jeff Dean & Sanjay Ghemawat presented their innovative idea for dealing with huge data sets – a novel idea…

Continue reading

Hadoop and the OpenDataPlatform

Hadoop and the OpenDataPlatform

Published by Arnon Rotem-Gal-Oz on February 17, 2015

Pivotal, IBM and Hortonworks announced today the “Open Data Platform” (ODP) – an attempt to standardize Hadoop. This move seems to be backed up by…

Continue reading

Random thoughts on big data

Random thoughts on big data

Published by Arnon Rotem-Gal-Oz on February 10, 2015

I began blogging in 2005, back then I managed to post something new almost everyday. Now, 10 years after, I hardly post anything. I was…

Continue reading

Apache Spark, ETL and Parquet

Apache Spark, ETL and Parquet

Published by Arnon Rotem-Gal-Oz on September 14, 2014

(Edit 10/8/2015 : A lot has changed in the last few months – you may want to check out my new post on Spark, Parquet…

Continue reading