So far, in previous articles, we looked at the import data from SQL databases. Today it’s time to import from XML files.
Author: Marek Rogoziński
Indexing files like doc, pdf – Solr and Tika integration
In the previous article we have given basic information about how to enable the indexing of binary files, ie MS Word files, PDF files or LibreOffice files. Today we will do the same thing, using the Data Import Handler. Since a few days ago a new version of the Solr server (3.1) have been released, the following guidelines are based on this version. For the purpose of the article I used the “example” application – all of the changes relate to this application.
Data Import Handler & XML – nested entities
Data Import Handler is a very nice and powerful tool. The following entry is a description of the problem (and solutions) which I met recently.
Data Import Handler – sharding
Our reader (greetings!) reported us a problem with the cooperation of DIH and sharding mechanism. The Solr project wiki, in my opinion, discuss the solution to this issue, but makes it a little around and on the occasion.
Data Import Handler – How to import data from SQL databases (part 3)
The perfect search engine in three easy steps
If you own the site, or you are the person responsible for any service, with should have search engine, this entry tells you how to easily and quickly find the perfect solution, operating in every situation.
Data Import Handler – How to import data from SQL databases (part 2)
In the first part we were able to index the information contained in the database. In the second part we will try to extend the functionality by adding incremental imports.
Data Import Handler – How to import data from SQL databases (part 1)
In the article on how to import data (http://solr.pl/2010/09/06/solr-data-indexing-for-fun-and-profit/?lang=en) I mentioned the Data Import Handler (DIH). The main advantage of this method of data importing is no need for additional software development and the rapid integration of the data source. This second advantage, however, requires skill and practice. In this entry I’ll show you the basics of DIH integration with SQL data source.
Solr: data indexing for fun and profit
Solr is not very friendly to novice users. Preparing good schema file requires some experience. Assuming that we have prepared the configuration files, what remains for us is to share our data with the search server and take care of update ability.
Search Process
We hereby inaugurate part of solr.pl which is not related to a particular search engine but rather to the development and functioning of websites related to search.
Do you ever wondered what causes the search engine, on the site, to be considered good ? To answer this question we should consider how a typical process of finding desired information by the customer look like, and is there such a thing as a typical process.