dih – Solr.pl

Data Import Handler – import from Solr XML files

Marek Rogoziński General 16 August 201111 November 2020apache solr, data import handler, dih, import 0 Comment

So far, in previous articles, we looked at the import data from SQL databases. Today it’s time to import from XML files.

Indexing files like doc, pdf – Solr and Tika integration

Marek Rogoziński Solr 4 April 201111 November 2020data import handler, dih 0 Comment

In the previous article we have given basic information about how to enable the indexing of binary files, ie MS Word files, PDF files or LibreOffice files. Today we will do the same thing, using the Data Import Handler. Since a few days ago a new version of the Solr server (3.1) have been released, the following guidelines are based on this version. For the purpose of the article I used the “example” application – all of the changes relate to this application.

Data Import Handler – removing data from index

Rafał Kuć Solr 3 January 201111 November 2020data import handler, databse, dih, integration 0 Comment

Deleting data from an index using DIH incremental indexing, on Solr wiki, is residually treated as something that works similarly to update the records. Similarly, in a previous article, I used this shortcut, the more that I have given an example of indexing wikipedia data that does not need to delete data.

Having at hand a sample data of the albums and performers, I decided to show my way of dealing with such cases. For simplicity and clarity, I assume that after the first import, the data can only decrease.

Data Import Handler – sharding

Marek Rogoziński Solr 27 December 201011 November 2020data import handler, database, dih, import, sharding 0 Comment

Our reader (greetings!) reported us a problem with the cooperation of DIH and sharding mechanism. The Solr project wiki, in my opinion, discuss the solution to this issue, but makes it a little around and on the occasion.

Data Import Handler – How to import data from SQL databases (part 3)

Marek Rogoziński Solr 22 November 201010 November 2020data import handler, dih, import, integration 0 Comment

In previous episodes (part 1 i part 2) we were able to import data from a database in a both wyas full and incremental. Today is the time for a short summary.