The time has come to add another important functionality to our car sale application. It will be the spell checking mechanism with the ability to construct a new query from the suggestions. It has become the main functionality of every search engine so we will also make use of it.
Solr
Solr filters: PatternReplaceCharFilter
Continuing the overview of the filters included in Solr today we look at the PatternReplaceCharFilter.
As you might guess the task of the filter is to change the matching input stream parts that match the given regular expression.
Solr filters: KeepWordFilter
This time I decided to look at one of the unusual filters available in the standard distribution of Solr. The first one in my hands is a filter called KeepWordFilter.
Solr 3.1 Cookbook
Solr 3.1: JSON Update Handler
After the release of Solr 3.1 I decided to look into the extended list of formats through which we can update the indexes. Until now we had a choice of three kinds of formats with which we were able to provide data – XML, CSV, and so. called JavaBin. The release of Solr 3.1 introduces the fourth format – JSON.
“Car sale application” – Unicode Collation, sorting text in a language-sensitive way (part 4)
In the third part of our ”Car sale” application related posts we added some location data and the information about the city that is related to every car. Shortly afterwards we added the possibility to sort using the city field by simply modifying the schema:
Indexing files like doc, pdf – Solr and Tika integration
In the previous article we have given basic information about how to enable the indexing of binary files, ie MS Word files, PDF files or LibreOffice files. Today we will do the same thing, using the Data Import Handler. Since a few days ago a new version of the Solr server (3.1) have been released, the following guidelines are based on this version. For the purpose of the article I used the “example” application – all of the changes relate to this application.
Lucene and Solr 3.1
A few minutes ago Lucene and Solr commiters published a new, stable version of Lucene library and Solr search engine – both numbered 3.1. There are numerous changes, but I’ll just mention some of them (following Grant Ingersoll announcement).
Solr 1.4: Local Params
Several months ago, during one of the projects I have tried to construct a query with optimal faceting. The problem was that we need filters (fq) in the query but in the same time we need a faceting that was not filtered. To some point it was not possible in Solr – you had to make two queries. But now, you can do it with one query. Let’s meet LocalParams.
Solr and Tika integration (part 1 – basics)
Indexing the so-called “rich documents”, ie files like pdf, doc, rtf, and so on (or binary files) always required some additional work on the developer side, at least to get the contents of the file and prepare it in a format understood by the search engines, in this case for Solr. To minimize this job I decided to look at the Apache Tika and integration of this library with Solr.
