Sunday, November 30, 2008

I have started to work with the HBase project on the implementation of "Bloom Filters". I feel this technology will be very useful to HEART.

http://pages.cs.wisc.edu/~cao/papers/summary-cache/node8.html

Currently, the Hbase implementation has problems, but there is some talk of a HFS implementation that may make the HBase code a moot point. Any comments?

Bruce Williams

5 comments:

Edward J. Yoon said...

AFAIK, Hadoop is incorporating the bloom filter code that is in HBase to make a
BloomFilterMapFile.

Andrew Purtell said...

HBase will fix bloom filters for release 0.20.0 (the next release is 0.19.0). See https://issues.apache.org/jira/browse/HBASE-553 . Bloom filter augmented map files have already been taken up by Hadoop for their 0.20.0 release. See https://issues.apache.org/jira/browse/HADOOP-3063 .

Ronald said...
This comment has been removed by the author.
Ronald said...

Sorry, converted the link,

this may be helpful at least for those unfamiliar with bloom filters:

Ryan said...

I'm currently working on bloom filters in HBase, if you have any contributions or other aspects I would suggest you join us on #hbase and have a conversation.