Rafał Kuć – Page 27

Quick look – Solritas

While observing Solr mailing lists we can spot a functionality called Solritas. Sounds strange ? What kind of functionality is it ? How we can use it ? To see the answers to these questions, I invite you to read the rest of the entry.

Quick look – FieldCollapsing

Rafał Kuć Solr 20 September 201010 November 20204.0, collapsing, field, fieldcollapsing, grouping, lucene, lucene 4.0, solr, solr 4.0 0 Comment

FieldCollapsing, or in other words grouping of search results has just been commited to the svn repository. I decided to take a look at this functionality and see how it works.

6 sins of solrconfig.xml modifications

Rafał Kuć Solr 13 September 201010 November 2020configuration, proper configuration, solr, solrconfig, solrconfig.xml 0 Comment

Solrconfig.xml file is another file that defines the behavior Solr. Unlike a file that describes the structure of the index file solrconfig.xml determines the functionality available in Solr. Just like in the case schema.xml file we can distinguish a number of standard mistakes made by those who implement Solr, and I’m not talking only about people who have little experience with Solr. In order to learn some of those mistakes I invite you to read the following entry.

5 sins of schema.xml modifications

Rafał Kuć Solr 30 August 201010 November 2020attribute, attributes, error, index, index structure, indexing, mistake, schema, schema.xml, solr, structure 0 Comment

I made a promise and here it is – the entry on the most common mistakes when designing Solr index, which is when You create or modify the schema.xml file for Your system implementation. Feel free to read on 😉

The scope of Solr faceting

Rafał Kuć Solr 23 August 201010 November 2020date faceting, facet, facet method, facet parameter, facet query, faceting, local, local params, params, query, range faceting, solr 0 Comment

Faceting is one of the ways to categorize the content found in the process of information retrieval. In case of Solr this is the division of set of documents on the basis of certain criteria: content of individual fields, queries or on the basis of compartments or dates. In today’s entry I will try to some scope on the possibility of using the faceting mechanism, both currently available in Solr 1.4.1, as well as what will be available in the future.

English rss feed available

Rafał Kuć General 19 August 201010 November 2020 0 Comment

Just a quick information. Recently many people asked about English rss feed. We patched one of our plug-ins and here it is – rss feed in English. If You want to subscribe, to English rss feed, go to the following URL: http://solr.pl/feed?lang=en.

What is schema.xml?

Rafał Kuć Solr 16 August 201010 November 2020analysis, field, filter, schema, schema.xml, solr, token, tokenizer, type 0 Comment

One of the configuration files that describe each implementation Solr is schema.xml file. It describes one of the most important things of the implementation – the structure of the data index. The information contained in this file allow you to control how Solr behaves when indexing the data, or when making queries. Schema.xml is not only the very structure of the index, is also detailed information about data types that have a large influence on the behavior Solr, and usually are treated with neglect. This entry will try to bring some insight about schema.xml.

6 deadly sins in the context of query

Rafał Kuć Solr 11 August 201010 November 2020facet, facet.limit, facet.offset, faceting, how to query, query, solr query 0 Comment

In my work related to Lucene and Solr I have seen various queries. While in the case of Lucene, developer usually knows what he/she wants to achieve and use more or less optimal solution, but when it comes to Solr it is not always like this. Solr is a product which could theoretically be used by everyone, both the person who knows Java, one that does not have a broad and specialized technical knowledge, as well as programmer. Precisely because of that Solr is a product which is easy to run and use it, at least when it comes to simple functionalities. I suppose, that is why not many people are worried about reading Solr wiki or at least review the mailing list. As a result, sooner or later people tend to make mistakes. Those errors arise from various shortcomings – lack of knowledge about Solr, lack of skills, lack of experience or simply a lack of time and tight deadlines. Today I would like to show some major mistakes when submitting queries to Solr and how to avoid those mistakes.

CSVResponseWriter

Rafał Kuć Solr 3 August 201010 November 2020csv, CSVResponseWriter, response, responsewriter, solr, writer 0 Comment

Solr recently received another small, but worth mentioning functionality – another response format available in standard distribution – CSV response format. I decided to write a short note about it.

Solr and PhraseQuery – phrase bonus in query stage

Rafał Kuć Solr 14 July 201010 November 2020boosting, dismax, edismax, lucene, phrase, phrase query, query, solr, standard 0 Comment

In the majority of system implementations I dealt with, sooner or later, there was a problem – search results tunning. One of the simplest ways to improve the search results quality was phrase boosting. Having the three most popular query parsers in Solr and the variety of parameters to control them I though it will be a good idea to check how they behave and how they affect performance.

Solr.pl

Author: Rafał Kuć