Quit Emailing Yourself

Bloom Filters by Example

Bloom filters are efficient probabilistic data structures used to quickly determine if an element is part of a set, allowing for rapid membership queries with a trade-off for false positives. They utilize a bit vector and multiple hash functions, where the choice of hash functions and the size of the filter can be optimized based on the expected number of elements and acceptable false positive rates. The article also discusses various implementations and use cases of Bloom filters across different technologies.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ bloom-filters + data-structures hashing ✓ + algorithms optimization ✓

Steinar H. Gunderson

Steinar H. Gunderson discusses modern perfect hashing techniques for mapping a predefined set of strings to integers, focusing on optimizing performance for small sets. He critiques existing methods, particularly the use of PEXT instructions, and shares a solution inspired by the chess community's approach to avoid collisions in string hashing. The article includes code examples demonstrating his methods for handling specific string lengths efficiently.

Saved by hn_user_5 · 2 others saved this · Last saved October 28, 2025 · 4 min read

hashing ✓ optimization ✓ + coding + algorithms

Links

Bloom Filters by Example

Steinar H. Gunderson