idf – Solr.pl

Distributed IDF

Rafał Kuć Solr 20 May 201914 November 2020distributed, idf, relevancy, solr 0 Comment

When Lucene and Solr searches through the data, each document is assigned a score that is calculated on the basis of query terms statistics. When using SolrCloud and our data inside the collection is distributed among multiple shards we are hit by a problem of not exact inverse document frequency calculation. The problem can be defined in the following way – each shard stores the term statistics locally and doesn’t share that with other shards during query execution. Can we do something about it to have more precise IDF calculation? Let’s see what we can do about it.

We use cookies to personalise content and ads, to provide social media features and to analyse our traffic. We also share information about your use of our site with our social media, advertising and analytics partners. /home/aludstro/domains/solr.pl/public_html/wp-includes/link-template.php on line 409
https://solr.pl/en/distributed-idf/">View more

Cookies settings

Privacy & Cookie policy

Privacy & Cookies policy

Cookie name	Active
wp-wpml_current_language

Cookies settings