bloomfilter.js/README.md

49 lines
1.5 KiB
Markdown
Raw Permalink Normal View History

2011-09-04 11:37:42 +02:00
Bloom Filter
============
This JavaScript bloom filter implementation uses the non-cryptographic
2011-09-04 19:30:37 +02:00
[FowlerNollVo hash function][1] for speed.
2011-12-08 00:09:26 +01:00
Usage
-----
var bloom = new BloomFilter(
32 * 256, // number of bits to allocate.
16 // number of hash functions.
);
// Add some elements to the filter.
bloom.add("foo");
bloom.add("bar");
// Test if an item is in our filter.
// Returns true if an item is probably in the set,
// or false if an item is definitely not in the set.
bloom.test("foo");
bloom.test("bar");
bloom.test("blah");
// Serialisation. Note that bloom.buckets may be a typed array,
// so we convert to a normal array first.
var array = [].slice.call(bloom.buckets),
json = JSON.stringify(array);
// Deserialisation. Note that the any array-like object is supported, but
// this will be used directly, so you may wish to use a typed array for
// performance.
var bloom = new BloomFilter(array, 3);
2011-12-08 00:09:26 +01:00
Implementation
--------------
Although the bloom filter requires *k* hash functions, we can simulate this
2011-09-04 19:30:37 +02:00
using only *two* hash functions. In fact, we cheat and get the second hash
function almost for free by iterating once more on the first hash using the FNV
hash algorithm.
Thanks to Will Fitzgerald for his [help and inspiration][2] with the hashing
optimisation.
2011-09-04 11:37:42 +02:00
[1]: http://isthe.com/chongo/tech/comp/fnv/
[2]: http://willwhim.wordpress.com/2011/09/03/producing-n-hash-functions-by-hashing-only-once/