filter – Solr.pl

Developing Your Own Solr Filter part 2

Rafał Kuć Lucene, Solr 4 February 201312 November 2020filter 0 Comment

In the previous entry “Developing Your Own Solr Filter” we’ve shown how to implement a simple filter and how to use it in Apache Solr. Recently, one of our readers asked if we can extend the topic and show how to write more than a single token into the token stream. We decided to go for it and extend the previous blog entry about filter implementation.

Developing Your Own Solr Filter

Rafał Kuć Solr 14 May 201211 November 2020develop, filter, solr 0 Comment

Sometimes Lucene and Solr out of the box functionality is not enough. When such time comes, we need to extend what Lucene and Solr gives us and create our own plugin. In todays post I’ll try to show how to develop a custom filter and use it in Solr.

Use of cache=false and cost parameters

Rafał Kuć Solr 5 March 201211 November 2020cache, cache=false, cost, filter, fq 0 Comment

From the day Solr 3.4 was released its users got a nice feature which allows if the results of a filter query or query should be placed in cache. In addition to that we got the possibility to set filter query cost. Let’s see how to use those features.

Do I have to look for maxBooleanClauses when using filters ?

Rafał Kuć Solr 19 December 201111 November 2020bool, boolean, clause, filter, lucene, maxBooleanClauses, query, solr 0 Comment

One of the configuration variables we can find in the solrconfig.xml file is maxBooleanClauses, which specifies the maximum number of boolean clauses that can be combined in a single query. The question is, do I have to worry about it when using filters in Solr ? Let’s try to answer that question without getting into Lucene and Solr source code.

Optimization – document cache

Rafał Kuć Solr 29 August 201111 November 2020cache, document, document cache, documentCache, filter, solr 0 Comment

A few months ago (here) we looked at filterCache. I’ve decided to update the optimization topic and take a look at the documentCache.

Solr filters: PatternReplaceCharFilter

Rafał Kuć Solr 9 May 201111 November 2020analysis, configuration, filter, filtering, solr 0 Comment

Continuing the overview of the filters included in Solr today we look at the PatternReplaceCharFilter.

As you might guess the task of the filter is to change the matching input stream parts that match the given regular expression.

Solr filters: KeepWordFilter

Rafał Kuć Solr 2 May 201111 November 2020filter, keep, keepwordfilter, solr, word 0 Comment

This time I decided to look at one of the unusual filters available in the standard distribution of Solr. The first one in my hands is a filter called KeepWordFilter.

Optimization – filter cache

Rafał Kuć Solr 7 February 201111 November 2020cache, caching, filter, filter cache, filterCache, filtering, solr 0 Comment

Today’s entry is dedicated to one type of cache in the Solr – filter cache. I will try to explain what it does, how to configure it and how to use it in an efficient way.

Faceting, filters elimination and how to use it ?

Rafał Kuć Solr 6 December 201010 November 2020filter, fq, local, local params, params, solr 0 Comment

During my everyday work, I have seen many repeated queries, to Solr, with only one filter difference. When I asked why – I got anserws that it was necessary to get the faceting results for various filters. If you are using Solr version 1.4 or later, my suggestion is to use local params – what is it and how to use – this post will attempt to answer both questions.

What is schema.xml?

Rafał Kuć Solr 16 August 201010 November 2020analysis, field, filter, schema, schema.xml, solr, token, tokenizer, type 0 Comment

One of the configuration files that describe each implementation Solr is schema.xml file. It describes one of the most important things of the implementation – the structure of the data index. The information contained in this file allow you to control how Solr behaves when indexing the data, or when making queries. Schema.xml is not only the very structure of the index, is also detailed information about data types that have a large influence on the behavior Solr, and usually are treated with neglect. This entry will try to bring some insight about schema.xml.