The question I asked myself recently what seems to be one of those for which the response should be quick and painless. So, when to send the commit command to Solr (or Lucene)? Despite the simplicity of the questions, the answer is not clear, at least in my opinion.
Solr 3.1: FastVectorHighlighting
One of the many new features that Lucene and Solr 3.1 brings is FastVectorHighlighting – as the change notes say nothing less than the improved functionality of highlighting. Currently the highlighting mechanism is not too fast, sometimes it could kill your Solr instance when dealing with a large amount of data, or very long text fields. I thought that it is worthwhile to test the performance of the new functionality.
Lucene and Solr 3.2
On 03 June 2011 Lucene and Solr commiters published a new, stable version of Lucene library and Solr search engine – both numbered 3.2. You should not expect revolution in terms of new functionalities, but there are few changes worth the look.
Quick look: frange
In Solr 1.4 there were a new type of queries presented the frange queries. This new type of queries let you search for a range of values. According to the Solr developers this queries should be much faster from normal range queries. I thought that I should make a simple test to see how much faster, the new range queries can be expected to be.
“Car sale application” – SpellCheckComponent – did you really mean that ? (part 5)
The time has come to add another important functionality to our car sale application. It will be the spell checking mechanism with the ability to construct a new query from the suggestions. It has become the main functionality of every search engine so we will also make use of it.
Solr filters: PatternReplaceCharFilter
Continuing the overview of the filters included in Solr today we look at the PatternReplaceCharFilter.
As you might guess the task of the filter is to change the matching input stream parts that match the given regular expression.
Solr filters: KeepWordFilter
This time I decided to look at one of the unusual filters available in the standard distribution of Solr. The first one in my hands is a filter called KeepWordFilter.
Solr 3.1 Cookbook
Solr 3.1: JSON Update Handler
After the release of Solr 3.1 I decided to look into the extended list of formats through which we can update the indexes. Until now we had a choice of three kinds of formats with which we were able to provide data – XML, CSV, and so. called JavaBin. The release of Solr 3.1 introduces the fourth format – JSON.
“Car sale application” – Unicode Collation, sorting text in a language-sensitive way (part 4)
In the third part of our ”Car sale” application related posts we added some location data and the information about the city that is related to every car. Shortly afterwards we added the possibility to sort using the city field by simply modifying the schema:
