Solr.pl – Page 29 – All things to be found – Blog related to Apache Solr & Lucene projects

Data Import Handler – How to import data from SQL databases (part 1)

Marek Rogoziński Solr 11 October 201010 November 2020 0 Comment

In the article on how to import data (http://solr.pl/2010/09/06/solr-data-indexing-for-fun-and-profit/?lang=en) I mentioned the Data Import Handler (DIH). The main advantage of this method of data importing is no need for additional software development and the rapid integration of the data source. This second advantage, however, requires skill and practice. In this entry I’ll show you the basics of DIH integration with SQL data source.

Quick look – IndexSorter

Rafał Kuć Solr 4 October 201010 November 2020index, index sorter, index sorting, indexsorter, lucene, solr, sorting 0 Comment

At the Apache Lucene Eurocon 2010 conference, which took place in May this year, Andrew Białecki in his presentation talked about how to obtain satisfactory search results when using early termination search techniques. Unfortunately the tool he mentioned, was not available in Solr – but it changed.

Quick look – Solritas

Rafał Kuć Solr 27 September 201010 November 2020response, response writer, solr, solritas, template, velocity, velocity response writer, writer 0 Comment

While observing Solr mailing lists we can spot a functionality called Solritas. Sounds strange ? What kind of functionality is it ? How we can use it ? To see the answers to these questions, I invite you to read the rest of the entry.

Quick look – FieldCollapsing

Rafał Kuć Solr 20 September 201010 November 20204.0, collapsing, field, fieldcollapsing, grouping, lucene, lucene 4.0, solr, solr 4.0 0 Comment

FieldCollapsing, or in other words grouping of search results has just been commited to the svn repository. I decided to take a look at this functionality and see how it works.

6 sins of solrconfig.xml modifications

Rafał Kuć Solr 13 September 201010 November 2020configuration, proper configuration, solr, solrconfig, solrconfig.xml 0 Comment

Solrconfig.xml file is another file that defines the behavior Solr. Unlike a file that describes the structure of the index file solrconfig.xml determines the functionality available in Solr. Just like in the case schema.xml file we can distinguish a number of standard mistakes made by those who implement Solr, and I’m not talking only about people who have little experience with Solr. In order to learn some of those mistakes I invite you to read the following entry.

Solr: data indexing for fun and profit

Marek Rogoziński Solr 6 September 201010 November 2020acf, cell, import, indexing, lcf, tika 0 Comment

Solr is not very friendly to novice users. Preparing good schema file requires some experience. Assuming that we have prepared the configuration files, what remains for us is to share our data with the search server and take care of update ability.

5 sins of schema.xml modifications

Rafał Kuć Solr 30 August 201010 November 2020attribute, attributes, error, index, index structure, indexing, mistake, schema, schema.xml, solr, structure 0 Comment

I made a promise and here it is – the entry on the most common mistakes when designing Solr index, which is when You create or modify the schema.xml file for Your system implementation. Feel free to read on 😉

The scope of Solr faceting

Rafał Kuć Solr 23 August 201010 November 2020date faceting, facet, facet method, facet parameter, facet query, faceting, local, local params, params, query, range faceting, solr 0 Comment

Faceting is one of the ways to categorize the content found in the process of information retrieval. In case of Solr this is the division of set of documents on the basis of certain criteria: content of individual fields, queries or on the basis of compartments or dates. In today’s entry I will try to some scope on the possibility of using the faceting mechanism, both currently available in Solr 1.4.1, as well as what will be available in the future.

English rss feed available

Rafał Kuć General 19 August 201010 November 2020 0 Comment

Just a quick information. Recently many people asked about English rss feed. We patched one of our plug-ins and here it is – rss feed in English. If You want to subscribe, to English rss feed, go to the following URL: http://solr.pl/feed?lang=en.

What is schema.xml?

Rafał Kuć Solr 16 August 201010 November 2020analysis, field, filter, schema, schema.xml, solr, token, tokenizer, type 0 Comment

One of the configuration files that describe each implementation Solr is schema.xml file. It describes one of the most important things of the implementation – the structure of the data index. The information contained in this file allow you to control how Solr behaves when indexing the data, or when making queries. Schema.xml is not only the very structure of the index, is also detailed information about data types that have a large influence on the behavior Solr, and usually are treated with neglect. This entry will try to bring some insight about schema.xml.