You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently we do a text search to see if the URL exists in our table. Most of the time, this is not needed as the URL isn't listed in the database. We should use a bloom filter (https://benwendt.ca/articles/a-bloom-filter-in-c/) to determine if an URL could be in the database and then do the SQL request. This way we won't waste processing power/memory on the requests that we shouldn't do anything with.
Things to take into account:
What to do with deletions. Do we just pull all redirects and recreate our bloom filter? Or do we use an more extensive bloom filter that records the amount of bits?
Same for any changing of the Urls.
When do we initialize the bloom filter and is it really worth it in place of SQL? (check memory)
How does this work with regex lookups? I assume we cannot do that with the bloom filter, so we should check if we can even use bloom filter if we are still doing a lookup for regex every time.
The text was updated successfully, but these errors were encountered:
Currently we do a text search to see if the URL exists in our table. Most of the time, this is not needed as the URL isn't listed in the database. We should use a bloom filter (https://benwendt.ca/articles/a-bloom-filter-in-c/) to determine if an URL could be in the database and then do the SQL request. This way we won't waste processing power/memory on the requests that we shouldn't do anything with.
Things to take into account:
The text was updated successfully, but these errors were encountered: