Thanks very much krub. Most of the time, bloom-filter is used in system that can (more or less) control the expect entries of data krub. In use case of my project, we limit maximum data entries in a chunk of data (1 chunk for one bloom instance). So we let library calculate the size of bloom e.g newBloom(maxNumEntries, false-positive-rate) . So we don't really monitor if false-positive rate is increase or not, we just control the max-entries inserted to the bloom instead.
Data ที่ผมใช้มากจาก Kaggle : Goodreads Book Reviews Data Set - www.kaggle.com/datasets/pypiahmad/goodreads-book-reviews1?select=goodreads_books.json
Thx a lot nice video krub, i have a question, How do we know if the bloomfilter space initially set is no longer enough?
Thanks very much krub. Most of the time, bloom-filter is used in system that can (more or less) control the expect entries of data krub. In use case of my project, we limit maximum data entries in a chunk of data (1 chunk for one bloom instance). So we let library calculate the size of bloom e.g newBloom(maxNumEntries, false-positive-rate) . So we don't really monitor if false-positive rate is increase or not, we just control the max-entries inserted to the bloom instead.
@@BBConfig Thank for your answer krub 🙏