<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>General &#8211; Solr.pl</title>
	<atom:link href="https://solr.pl/en/category/general/feed/" rel="self" type="application/rss+xml" />
	<link>https://solr.pl/en/</link>
	<description>All things to be found - Blog related to Apache Solr &#38; Lucene projects - https://solr.apache.org</description>
	<lastBuildDate>Thu, 23 Jan 2025 19:33:38 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9</generator>
	<item>
		<title>Apache Solr 9.8.0</title>
		<link>https://solr.pl/en/2025/01/23/apache-solr-9-8-0/</link>
					<comments>https://solr.pl/en/2025/01/23/apache-solr-9-8-0/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Thu, 23 Jan 2025 19:33:38 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[release]]></category>
		<category><![CDATA[solr]]></category>
		<guid isPermaLink="false">https://solr.pl/?p=1398</guid>

					<description><![CDATA[It is a pleasure to inform you that the new version of the Solr search server has been released. It is the next release from the 9.x branch and it is numbered 9.8. Some of the changes introduced in Solr]]></description>
										<content:encoded><![CDATA[
<p>It is a pleasure to inform you that the new version of the Solr search server has been released. It is the next release from the 9.x branch and it is numbered <strong>9.8</strong>.</p>



<span id="more-1398"></span>



<p>Some of the changes introduced in Solr <strong>9.8</strong>:</p>



<ul class="wp-block-list">
<li>Solr cross data center feature graduated into a main Solr feature!</li>



<li>A give request may now be limited when it comes to the amount of memory it can use using the <em>memAllowed</em> parameter.</li>



<li>The <em>lib</em> tags in <em>solrconfig.xml</em> are now silently ignored unless you include the <em>SOLR_CONFIG_LIB_ENABLED</em> environment variable set to <em>true</em>. </li>



<li>A new parser called <em>knn_text_to_vector</em> was added allowing to calculate text embeddings using external LLMs.</li>
</ul>



<p>We encourage you to read the whole list of changes at: <a href="https://solr.apache.org/docs/9_8_0/changes/Changes.html">https://solr.apache.org/docs/9_8_0/changes/Changes.html</a>.</p>



<p>Apache Solr<strong> 9.8</strong> can be downloaded from <a href="https://dlcdn.apache.org/solr/">https://dlcdn.apache.org/solr/</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2025/01/23/apache-solr-9-8-0/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Solr Cookbook Third Edition for less than 6 euro</title>
		<link>https://solr.pl/en/2015/12/18/solr-cookbook-third-edition-for-less-than-6-euro/</link>
					<comments>https://solr.pl/en/2015/12/18/solr-cookbook-third-edition-for-less-than-6-euro/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Fri, 18 Dec 2015 13:56:36 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Solr]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=877</guid>

					<description><![CDATA[A quick information for those of you who are interested in my latest Solr book, the Solr Cookbook Third Edition. Packt Publishing is offering the book for less than 6 euros, starting on 17th December till the end of the]]></description>
										<content:encoded><![CDATA[<p>A quick information for those of you who are interested in my latest Solr book, the Solr Cookbook Third Edition. Packt Publishing is offering the book for less than 6 euros, starting on 17th December till the end of the year. If you are interested, you can buy the book with the discounted price at <a href="https://www.packtpub.com/big-data-and-business-intelligence/solr-cookbook-third-edition/?utm_source=RPRafalkuc&amp;utm_medium=referral&amp;utm_campaign=OME5D2015">Solr Cookbook Third Edition Sale</a> website.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2015/12/18/solr-cookbook-third-edition-for-less-than-6-euro/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Solr Cookbook, Third Edition</title>
		<link>https://solr.pl/en/2015/01/23/solr-cookbook-third-edition/</link>
					<comments>https://solr.pl/en/2015/01/23/solr-cookbook-third-edition/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Fri, 23 Jan 2015 13:49:20 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[solr]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=851</guid>

					<description><![CDATA[As usual when we are not updating solr.pl for a long time that doesn&#8217;t mean that we are not doing anything. Similar to the previous period of silence we were writing. This time, after two years of from Apache Solr]]></description>
										<content:encoded><![CDATA[<p><img fetchpriority="high" decoding="async" class="alignleft wp-image-3496 size-medium" style="margin-left: 0px; margin-right: 20px; margin-bottom: 5px;" src="http://solr.pl/wp-content/uploads/2015/01/cookbook-third-248x300.png" alt="cookbook third" width="248" height="300">As usual when we are not updating solr.pl for a long time that doesn&#8217;t mean that we are not doing anything. Similar to the previous period of silence we were writing. This time, after two years of from Apache Solr 4 Cookbook publication we are proud to announce that this Monday &#8211; 26.01.2015 <strong>Solr Cookbook Third Edition</strong> will be published.</p>
<p><span id="more-851"></span></p>
<p>Similar to the previous edition of cookbook, we took the time to rebuild the book and all recipes were updated, half of the previous content has been thrown away and new content was added. The very important thing in our minds is that Solr Cookbook Third Edition covers Solr 4.x version (basing on the newest 4.10.3 version of Solr) and Solr 5.0 which should be released very soon.</p>
<p>The book is targeting beginners and intermediate users working with Apache Solr. You&#8217;ll find recipes that should make your life easier when you take the first steps with Solr and when you are encountering common problems that intermediate users tend to struggle with. However I don&#8217;t recommend the book for those of you who knows everything about Solr &#8211; you may find parts of the book interesting, but this book is not directed to you.</p>
<p>The list of chapters from the book is as follows:</p>
<ol>
<li><em>Apache Solr Configuration</em></li>
<li><em>Indexing Your Data</em></li>
<li><em>Analyzing Your Text Data</em></li>
<li><em>Querying Solr</em></li>
<li><em>Faceting</em></li>
<li><em>Improving Solr Performance</em></li>
<li><em>In the Cloud</em></li>
<li><em>Using Additional Solr Functionalities</em></li>
<li><em>Dealing with Problems</em></li>
<li><em>Real-life Situations</em></li>
</ol>
<p>More information about the book itself with a free chapter (which will be available after official publication of the book) can be found on Packt Publishing web page dedicated to the book &#8211; <a href="https://www.packtpub.com/big-data-and-business-intelligence/solr-cookbook-third-edition">https://www.packtpub.com/big-data-and-business-intelligence/solr-cookbook-third-edition</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2015/01/23/solr-cookbook-third-edition/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Win Elasticsearch Server second edition e-book</title>
		<link>https://solr.pl/en/2014/06/08/win-elasticsearch-server-second-edition-e-book/</link>
					<comments>https://solr.pl/en/2014/06/08/win-elasticsearch-server-second-edition-e-book/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Sun, 08 Jun 2014 13:20:11 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=636</guid>

					<description><![CDATA[Together with Packt Publishing we have give away copies of our latest book &#8211; &#8220;Elasticsearch Server 2nd Edition&#8220;. Although it is not about Solr, you can have a chance to learn about it and compare to Solr. For the readers]]></description>
										<content:encoded><![CDATA[<p>Together with Packt Publishing we have give away copies of our latest book &#8211; &#8220;<strong>Elasticsearch Server 2nd Edition</strong>&#8220;. Although it is not about Solr, you can have a chance to learn about it and compare to Solr. For the readers of solr.pl we will modify the competition question &#8211; &#8220;What unusual use case you&#8217;ve implemented using Apache Solr&#8221;. You can also give an example of a functionality that can&#8217;t be achieved with Elasticsearch.</p>
<p><span id="more-636"></span></p>
<p>All the information about the competition can be found at <a href="http://elasticsearchserverbook.com/win-elasticsearch-server-second-editon-ebook/">http://elasticsearchserverbook.com/win-elasticsearch-server-second-editon-ebook/</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2014/06/08/win-elasticsearch-server-second-edition-e-book/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>SolrCloud &#8211; What happens when ZooKeeper fails?</title>
		<link>https://solr.pl/en/2013/12/02/solrcloud-what-happens-when-zookeeper-fails-2/</link>
					<comments>https://solr.pl/en/2013/12/02/solrcloud-what-happens-when-zookeeper-fails-2/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Mon, 02 Dec 2013 14:13:35 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<category><![CDATA[solr]]></category>
		<category><![CDATA[solrcloud]]></category>
		<category><![CDATA[zookeeper]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=620</guid>

					<description><![CDATA[One of the questions I tend to get is what happens with SolrCloud cluster when ZooKeeper fails. Of course we are not talking about a single ZooKeeper instance failure, but the whole ensemble not being accessible and so the quorum]]></description>
										<content:encoded><![CDATA[<p>One of the questions I tend to get is what happens with SolrCloud cluster when ZooKeeper fails. Of course we are not talking about a single ZooKeeper instance failure, but the whole <em>ensemble</em> not being accessible and so the <em>quorum</em> not present. Because the answer to this question is very easy to verify i decided to make a simple blog post to show what happens when ZooKeeper fails.</p>
<p><span id="more-620"></span></p>
<h3>Test environment</h3>
<p>The test environment was very simple:</p>
<ul>
<li>A single virtual machine running under Linux operating system</li>
<li>A single instance of ZooKeeper (which will be suitable for our test)</li>
<li>Two Solr instances with a single collection deployed</li>
<li>Solr <a title="Apache Lucene and Solr 4.6" href="http://solr.pl/en/2013/11/24/apache-lucene-and-solr-4-6/">4.6</a></li>
</ul>
<p>In order to create our test collection I&#8217;ve uploaded the configuration to ZooKeeper and used the following command:
</p>
<pre class="brush:bash">curl 'http://localhost:8983/solr/admin/collections?action=CREATE&amp;name=collection1&amp;numShards=2&amp;replicationFactor=1'</pre>
<p>The cloud view of the example cluster was as follows:
</p>
<p style="text-align: center;"><a href="http://solr.pl/wp-content/uploads/2013/12/cloud_view.png"><img decoding="async" class="aligncenter  wp-image-3330" alt="cloud_view" src="http://solr.pl/wp-content/uploads/2013/12/cloud_view.png" width="495" height="44"></a></p>
<h3>Test data indexing</h3>
<p>The next step in our test will be indexing. We will index a few example documents that are provided with Solr in the <em>exampledocs</em> directory. The following commands were used to index the data:
</p>
<pre class="brush:bash">curl 'localhost:8983/solr/collection1/update?commit=true' --data-binary @mem.xml -H 'Content-type:application/xml'
curl 'localhost:8983/solr/collection1/update?commit=true' --data-binary @monitor.xml -H 'Content-type:application/xml'
curl 'localhost:8983/solr/collection1/update?commit=true' --data-binary @monitor2.xml -H 'Content-type:application/xml'</pre>
<p>After executing the above commands we get the following number of documents:</p>
<ul>
<li>The whole collection holds <strong>5</strong> documents</li>
<li>Shard located on Solr running on port <strong>8983</strong> host <strong>1</strong> document</li>
<li>Shard located on Solr running on port<strong> 7983</strong> has <strong>4</strong> documents</li>
</ul>
<h3>Querying with ZooKeeper not present</h3>
<p>Now we go to the next step &#8211; we shutdown our ZooKeeper instance and we try to run a simple query by sending the following command:
</p>
<pre class="brush:bash">curl 'localhost:8983/solr/collection1/select?q=*:*&amp;indent=true'</pre>
<p>In result we get the following response:
</p>
<pre class="brush:xml">&lt;?xml version="1.0" encoding="UTF-8"?&gt;
&lt;response&gt;
 &lt;lst name="responseHeader"&gt;
  &lt;int name="status"&gt;0&lt;/int&gt;
  &lt;int name="QTime"&gt;16&lt;/int&gt;
  &lt;lst name="params"&gt;
   &lt;str name="indent"&gt;true&lt;/str&gt;
   &lt;str name="q"&gt;*:*&lt;/str&gt;
  &lt;/lst&gt;
 &lt;/lst&gt;
&lt;result name="response" numFound="5" start="0" maxScore="1.0"&gt;
&lt;doc&gt;
 &lt;str name="id"&gt;TWINX2048-3200PRO&lt;/str&gt; 
 &lt;str name="name"&gt;CORSAIR  XMS 2GB (2 x 1GB) 184-Pin DDR SDRAM Unbuffered DDR 400 (PC 3200) Dual Channel Kit System Memory - Retail&lt;/str&gt;
 &lt;str name="manu"&gt;Corsair Microsystems Inc.&lt;/str&gt;
 &lt;str name="manu_id_s"&gt;corsair&lt;/str&gt;
 &lt;arr name="cat"&gt;
  &lt;str&gt;electronics&lt;/str&gt;
  &lt;str&gt;memory&lt;/str&gt;
 &lt;/arr&gt;
 &lt;arr name="features"&gt;
  &lt;str&gt;CAS latency 2,    2-3-3-6 timing, 2.75v, unbuffered, heat-spreader&lt;/str&gt;
 &lt;/arr&gt;
 &lt;float name="price"&gt;185.0&lt;/float&gt;
 &lt;str name="price_c"&gt;185,USD&lt;/str&gt;
 &lt;int name="popularity"&gt;5&lt;/int&gt;
 &lt;bool name="inStock"&gt;true&lt;/bool&gt;
 &lt;str name="store"&gt;37.7752,-122.4232&lt;/str&gt;
 &lt;date name="manufacturedate_dt"&gt;2006-02-13T15:26:37Z&lt;/date&gt;
 &lt;str name="payloads"&gt;electronics|6.0 memory|3.0&lt;/str&gt;
 &lt;long name="_version_"&gt;1453219034197655552&lt;/long&gt;
&lt;/doc&gt;
&lt;doc&gt;
 &lt;str name="id"&gt;VS1GB400C3&lt;/str&gt;
 &lt;str name="name"&gt;CORSAIR ValueSelect 1GB 184-Pin DDR SDRAM Unbuffered DDR 400 (PC 3200) System Memory - Retail&lt;/str&gt;
 &lt;str name="manu"&gt;Corsair Microsystems Inc.&lt;/str&gt;
 &lt;str name="manu_id_s"&gt;corsair&lt;/str&gt;
 &lt;arr name="cat"&gt;
  &lt;str&gt;electronics&lt;/str&gt;
  &lt;str&gt;memory&lt;/str&gt;
 &lt;/arr&gt;
 &lt;float name="price"&gt;74.99&lt;/float&gt;
 &lt;str name="price_c"&gt;74.99,USD&lt;/str&gt;
 &lt;int name="popularity"&gt;7&lt;/int&gt;
 &lt;bool name="inStock"&gt;true&lt;/bool&gt;
 &lt;str name="store"&gt;37.7752,-100.0232&lt;/str&gt;
 &lt;date name="manufacturedate_dt"&gt;2006-02-13T15:26:37Z&lt;/date&gt;
 &lt;str name="payloads"&gt;electronics|4.0 memory|2.0&lt;/str&gt;
 &lt;long name="_version_"&gt;1453219034252181504&lt;/long&gt;
&lt;/doc&gt;
&lt;doc&gt;
 &lt;str name="id"&gt;VDBDB1A16&lt;/str&gt;
 &lt;str name="name"&gt;A-DATA V-Series 1GB 184-Pin DDR SDRAM Unbuffered DDR 400 (PC 3200) System Memory - OEM&lt;/str&gt;
 &lt;str name="manu"&gt;A-DATA Technology Inc.&lt;/str&gt;
 &lt;str name="manu_id_s"&gt;corsair&lt;/str&gt;
 &lt;arr name="cat"&gt;
  &lt;str&gt;electronics&lt;/str&gt;
  &lt;str&gt;memory&lt;/str&gt;
 &lt;/arr&gt;
 &lt;arr name="features"&gt;
  &lt;str&gt;CAS latency 3,     2.7v&lt;/str&gt;
 &lt;/arr&gt;
 &lt;int name="popularity"&gt;0&lt;/int&gt;
 &lt;bool name="inStock"&gt;true&lt;/bool&gt;
 &lt;str name="store"&gt;45.18414,-93.88141&lt;/str&gt;
 &lt;date name="manufacturedate_dt"&gt;2006-02-13T15:26:37Z&lt;/date&gt;
 &lt;str name="payloads"&gt;electronics|0.9 memory|0.1&lt;/str&gt;
 &lt;long name="_version_"&gt;1453219034255327232&lt;/long&gt;
&lt;/doc&gt;
&lt;doc&gt;
 &lt;str name="id"&gt;3007WFP&lt;/str&gt;
 &lt;str name="name"&gt;Dell Widescreen UltraSharp 3007WFP&lt;/str&gt;
 &lt;str name="manu"&gt;Dell, Inc.&lt;/str&gt;
 &lt;str name="manu_id_s"&gt;dell&lt;/str&gt;
 &lt;arr name="cat"&gt;
  &lt;str&gt;electronics&lt;/str&gt;
  &lt;str&gt;monitor&lt;/str&gt;
 &lt;/arr&gt;
 &lt;arr name="features"&gt;
  &lt;str&gt;30" TFT active matrix LCD, 2560 x 1600, .25mm dot pitch, 700:1 contrast&lt;/str&gt;
 &lt;/arr&gt;
 &lt;str name="includes"&gt;USB cable&lt;/str&gt;
 &lt;float name="weight"&gt;401.6&lt;/float&gt;
 &lt;float name="price"&gt;2199.0&lt;/float&gt;
 &lt;str name="price_c"&gt;2199,USD&lt;/str&gt;
 &lt;int name="popularity"&gt;6&lt;/int&gt;
 &lt;bool name="inStock"&gt;true&lt;/bool&gt;
 &lt;str name="store"&gt;43.17614,-90.57341&lt;/str&gt;
 &lt;long name="_version_"&gt;1453219041357332480&lt;/long&gt;
&lt;/doc&gt;
&lt;doc&gt;
 &lt;str name="id"&gt;VA902B&lt;/str&gt;
 &lt;str name="name"&gt;ViewSonic VA902B - flat panel display - TFT - 19"&lt;/str&gt;
 &lt;str name="manu"&gt;ViewSonic Corp.&lt;/str&gt;
 &lt;str name="manu_id_s"&gt;viewsonic&lt;/str&gt;
 &lt;arr name="cat"&gt;
  &lt;str&gt;electronics&lt;/str&gt;
  &lt;str&gt;monitor&lt;/str&gt;
 &lt;/arr&gt;
 &lt;arr name="features"&gt;
  &lt;str&gt;19" TFT active matrix LCD, 8ms response time, 1280 x 1024 native resolution&lt;/str&gt;
 &lt;/arr&gt;
 &lt;float name="weight"&gt;190.4&lt;/float&gt;
 &lt;float name="price"&gt;279.95&lt;/float&gt;
 &lt;str name="price_c"&gt;279.95,USD&lt;/str&gt;
 &lt;int name="popularity"&gt;6&lt;/int&gt;
 &lt;bool name="inStock"&gt;true&lt;/bool&gt;
 &lt;str name="store"&gt;45.18814,-93.88541&lt;/str&gt;
 &lt;long name="_version_"&gt;1453219045997281280&lt;/long&gt;&lt;/doc&gt;
&lt;/result&gt;
&lt;/response&gt;</pre>
<p>As we can see Solr responded correctly. This is because Solr already has the clusterstate.json file cached. To search Solr doesn&#8217;t need to update that file, so search should and is working as we could see.</p>
<h3>Indexing with failed ZooKeeper</h3>
<p>Without turning on our ZooKeeper instance we try to run the following command:
</p>
<pre class="brush:bash">curl 'localhost:8983/solr/collection1/update?commit=true' --data-binary @hd.xml -H 'Content-type:application/xml'</pre>
<p>The above command should result in indexing the contents of the <em>hd.xml </em>file. After a longer period of time Solr responds with the following information:
</p>
<pre class="brush:xml">&lt;?xml version="1.0" encoding="UTF-8"?&gt;
&lt;response&gt;
&lt;lst name="responseHeader"&gt;&lt;int name="status"&gt;503&lt;/int&gt;&lt;int name="QTime"&gt;15096&lt;/int&gt;&lt;/lst&gt;&lt;lst name="error"&gt;&lt;str name="msg"&gt;Cannot talk to ZooKeeper - Updates are disabled.&lt;/str&gt;&lt;int name="code"&gt;503&lt;/int&gt;&lt;/lst&gt;
&lt;/response&gt;</pre>
<p>So as you can see we are not able to index data without working ZooKeeper <em>ensemble</em>.</p>
<h3>Starting ZooKeeper again</h3>
<p>So let&#8217;s see what will happen when we start our ZooKeeper instance again without restarting Solr nodes. After starting ZooKeeper we try to run the same indexing command, we just did, once again:
</p>
<pre class="brush:bash">curl 'localhost:8983/solr/collection1/update?commit=true' --data-binary @hd.xml -H 'Content-type:application/xml'</pre>
<p>And this time the response is different:
</p>
<pre class="brush:xml">&lt;?xml version="1.0" encoding="UTF-8"?&gt;
&lt;response&gt;
&lt;lst name="responseHeader"&gt;&lt;int name="status"&gt;0&lt;/int&gt;&lt;int name="QTime"&gt;118&lt;/int&gt;&lt;/lst&gt;
&lt;/response&gt;</pre>
<p>As we can see the indexing request was successful this time. This allows us to assume that the connection to ZooKeeper was re-established by Solr. We can see that in Solr and ZooKeeper logs.</p>
<h3>Short summary</h3>
<p>As you can see, our short test allowed to see what happens when our ZooKeeper <em>ensemble </em>fails and what we can expect from Solr in such rare cases. I hope this blog entry will help you with some doubts about SolrCloud and its usefulnesses.</p>
<p>Please also remember that during the test, the cluster state did not change &#8211; all shards were accessible and working. We will see what will be happening when shards or replicas fails when ZooKeeeper is down in the next blog entry about SolrCloud.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2013/12/02/solrcloud-what-happens-when-zookeeper-fails-2/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Random documents from result set (Giveaway results !)</title>
		<link>https://solr.pl/en/2013/04/02/random-documents-from-result-set-giveaway-results/</link>
					<comments>https://solr.pl/en/2013/04/02/random-documents-from-result-set-giveaway-results/#respond</comments>
		
		<dc:creator><![CDATA[Marek Rogoziński]]></dc:creator>
		<pubDate>Tue, 02 Apr 2013 11:56:24 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=546</guid>

					<description><![CDATA[And now two birds with a single stone &#8211; a new article and the Apache Solr 4 Cookbook giveaway results. In this article we would like to show you how to implement random ordering of documents in the resulting using]]></description>
										<content:encoded><![CDATA[<p>And now two birds with a single stone &#8211; a new article and the <a href="http://solr.pl/en/solr-4-0-cookbook-2/">Apache Solr 4 Cookbook</a> <a href="http://solr.pl/en/2013/03/15/win-free-copies-of-packts-new-book-on-apache-solr/">giveaway</a> results. In this article we would like to show you how to implement random ordering of documents in the resulting using Apache Solr. Our example is real case scenario &#8211; we&#8217;ve used this to draw two <a href="http://solr.pl/en/2013/03/15/win-free-copies-of-packts-new-book-on-apache-solr/">giveaway</a> participants. Those two comment authors that will be of top of the results set will receive the&nbsp;ebook.</p>
<p><span id="more-546"></span></p>
<h2>Documents</h2>
<p>Our documents contain information about participants of the competition &#8211; their <em>id</em>, name (as the <em>author</em> field) and <em>email</em>.. For example one record looks like that:
</p>
<pre class="brush:xml">&lt;doc&gt;
  &lt;field name="id"&gt;1&lt;/id&gt;
  &lt;field name="author"&gt;Solr.pl author&lt;/field&gt;
  &lt;field name="email"&gt;blog(at)solr.pl&lt;/field&gt;
&lt;/doc&gt;</pre>
<p>Our very big data contains <strong>19</strong> records, maybe we should have used map/reduce ? :).</p>
<h2>Schema</h2>
<p>The <em>schema.xml</em> file describing the structure of the index is also very simple. In our case it contains the following fields:
</p>
<pre class="brush:xml">&lt;field name="id" type="string" indexed="true" stored="true" required="true" multiValued="false" /&gt;
&lt;field name="author" type="text_general" indexed="true" stored="true"/&gt;
&lt;field name="email" type="text_general" indexed="true" stored="true"/&gt;</pre>
<h2>Additional configuration</h2>
<p>Now we need to make sure that the <em>schema.xml</em> file contains the following definition type and field definitions:
</p>
<pre class="brush:xml">&lt;fieldType name="random" class="solr.RandomSortField" indexed="true" /&gt;
&lt;dynamicField name="random_*" type="random" /&gt;</pre>
<p>In the example <em>schema.xml</em> file provided with the standard Solr distribution package this type and dynamic field is available by default. We will need those to randomize our result set.</p>
<h2>Running a query with random sorting</h2>
<p>Running a query with a random sorting is a little bit tricky. We build a query like we usually do except for the sorting. For the&nbsp;<em>sort</em> parameter we will use the previously defined dynamic field with the <em>random</em> prefix. For example:
</p>
<pre class="brush:bash">localhost:8983/solr/competition/select?q=*:*&amp;sort=random_12939291%20desc</pre>
<h2>How it works ?</h2>
<p>Solr will calculate ordering of the documents basing on the name of the random field and the index version. This means that every time you use the same field name and the same index (which was not changed between queries), you will get results that are ordered exactly the same way. This is disadvantage of this method, but sometimes this may be quite handy, like when doing paging (we don&#8217;t want to have different results ordering for each page, right ?). Because of this you have to generate the field name in your application that runs queries to Solr.</p>
<h2>And now &#8211; Giveaway results !</h2>
<p>We&#8217;ve used the above-mentioned query. Number used in sort field is absolutely random, that was randomized by saying: &#8220;Dad, tell me some random numbers&#8221; :). So the whole query we&#8217;ve used was:
</p>
<pre class="brush:bash">localhost:8983/solr/collection1/select?q=*:*&amp;indent=true&amp;rows=2&amp;sort=random_3721117253841%20desc</pre>
<p>This above query gave the following results:
</p>
<pre class="brush:xml">&lt;result name="response" numFound="19" start="0"&gt;
  &lt;doc&gt;
    &lt;str name="id"&gt;9&lt;/str&gt;
    &lt;str name="author"&gt;Rajeev Srivastava&lt;/str&gt;
    &lt;str name="email"&gt;[CENSORED]&lt;/str&gt;
    &lt;long name="_version_"&gt;1431017731370516481&lt;/long&gt;&lt;/doc&gt;
  &lt;doc&gt;
    &lt;str name="id"&gt;8&lt;/str&gt;
    &lt;str name="author"&gt;Evgeny&lt;/str&gt;
    &lt;str name="email"&gt;[CENSORED]&lt;/str&gt;
    &lt;long name="_version_"&gt;1431017731370516480&lt;/long&gt;&lt;/doc&gt;
&lt;/result&gt;</pre>
<h2>And the winners are</h2>
<ul>
<li><strong>Rajeev</strong></li>
<li><strong>Evgeny</strong></li>
</ul>
<p>Congratulations ! We will contact you in the very near future with further information about how to receive your awards.&nbsp; Once again congratulations! Also, to all the other participants, thanks for participations and your comments !</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2013/04/02/random-documents-from-result-set-giveaway-results/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Win Free Copies of Packt’s new book on Apache Solr (updated)</title>
		<link>https://solr.pl/en/2013/03/15/win-free-copies-of-packts-new-book-on-apache-solr-updated/</link>
					<comments>https://solr.pl/en/2013/03/15/win-free-copies-of-packts-new-book-on-apache-solr-updated/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Fri, 15 Mar 2013 12:54:11 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=542</guid>

					<description><![CDATA[Readers would be pleased to know that we have teamed up with Packt Publishing to organize a Giveaway of the Apache Solr 4 Cookbook. Two lucky winners will win a copy of the book (in eBook format). Keep reading to]]></description>
										<content:encoded><![CDATA[<p>Readers would be pleased to know that we have teamed up with <a href="http://www.packtpub.com/" target="_blank" rel="noopener noreferrer">Packt Publishing</a> to organize a Giveaway of the <a href="http://www.packtpub.com/apache-solr-4-cookbook/book" target="_blank" rel="noopener noreferrer">Apache Solr 4 Cookbook</a>. Two lucky winners will win a copy of the book (in eBook format). Keep reading to find out how you can be one of the Lucky Winners.</p>
<p><span id="more-542"></span></p>
<p><a href="http://www.packtpub.com/apache-solr-4-cookbook/book" target="_blank" rel="noopener noreferrer"><img decoding="async" class=" wp-image-2683 alignleft" style="margin: 5px 20px; border: 0px none;" alt="cookbook_4_cover" src="http://solr.pl/wp-content/uploads/2013/01/cookbook_4_cover.png" width="140" height="170"></a></p>
<p>Let&#8217;s start with a little reminder about the book:</p>
<ul>
<li>Learn how to make Apache Solr search faster, more complete, and comprehensively scalable</li>
<li>Solve performance, setup, configuration, analysis, and query problems in no time</li>
<li>Get to grips with, and master, the new exciting features of Apache Solr 4</li>
</ul>
<p>Read more about this book and download free <a href="http://www.packtpub.com/apache-solr-4-cookbook/book" target="_blank" rel="noopener noreferrer">Sample Chapter</a>.</p>
<h3>How to Enter ?</h3>
<p>All you need to do is head on over to the book page (<a href="http://www.packtpub.com/apache-solr-4-cookbook/book" target="_blank" rel="noopener noreferrer">Apache Solr 4 Cookbook</a>) and look through the product description of the book and drop a line via the <strong>comments below this post to let us know what interests you the most about this book</strong>. It’s that simple.</p>
<p>Product Description: <a href="http://www.packtpub.com/apache-solr-4-cookbook/book" target="_blank" rel="noopener noreferrer">http://www.packtpub.com/apache-solr-4-cookbook/book</a></p>
<h3>Deadline</h3>
<p>The contest will close on <strong>28.03.2013</strong>. Winners will be contacted by email, so be sure to use your real email address when you comment!</p>
<h3>Who Will Win ?</h3>
<p>The winners will be chosen by the Solr.pl team randomly from readers entering the competition that replied with on topic comment.</p>
<p>If you want to increase your chances of winning, write a small review of the book using the sample chapter on Amazon.com and also forward the same post to bhavins@packtpub.com.</p>
<h3>Book Format</h3>
<p><strong>T</strong>he free copies will be provided in eBook format.</p>
<h3>Update</h3>
<p>The contents is now officially closed. Thank you to all the participants. The winners will be announced in a dedicated blog post right after Easter, on <strong>Tuesday 2nd of April</strong>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2013/03/15/win-free-copies-of-packts-new-book-on-apache-solr-updated/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>New Book: ElasticSearch Server!</title>
		<link>https://solr.pl/en/2013/01/29/new-book-elasticsearch-server/</link>
					<comments>https://solr.pl/en/2013/01/29/new-book-elasticsearch-server/#respond</comments>
		
		<dc:creator><![CDATA[Marek Rogoziński]]></dc:creator>
		<pubDate>Tue, 29 Jan 2013 11:23:12 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=532</guid>

					<description><![CDATA[In the blog post dedicated to Solr 4.0 Cookbook we give a small hint that cookbook was not the only project that occupies our free time. Today we can officially say that a few month of hard work is slowly]]></description>
										<content:encoded><![CDATA[<p><a href="http://solr.pl/wp-content/uploads/2013/01/es_cover.png"><img loading="lazy" decoding="async" class="size-full wp-image-2758 alignleft" style="margin: 10px;" alt="es_cover" src="http://solr.pl/wp-content/uploads/2013/01/es_cover.png" width="175" height="213"></a>In the blog <a href="http://solr.pl/en/2012/12/11/solr-4-0-cookbook/">post</a> dedicated to <a href="http://solr.pl/en/solr-4-0-cookbook-2/">Solr 4.0 Cookbook</a> we give a small hint that cookbook was not the only project that occupies our free time. Today we can officially say that a few month of hard work is slowly coming to an end &#8211;&nbsp; we can announce a new book about one of the greatest piece of open-source software &#8211; <a href="http://solr.pl/en/elasticsearch-server/">ElasticSearch Server</a> book!</p>
<p><span id="more-532"></span></p>
<p>ElasticSearch server book describes the most important and commonly used features of ElasticSearch (at least from our perspective). Example of topics discussed:</p>
<ul>
<li>ElasticSearch installation and configuration</li>
<li>Static and dynamic index structure creation</li>
<li>Querying ElasticSearch with Query DSL explained</li>
<li>Using filters</li>
<li>Faceting</li>
<li>Routing</li>
<li>Indexing data that is not flat</li>
</ul>
<p>We also talk about:</p>
<ul>
<li>Autocomplete and how to implement it using ElasticSearch</li>
<li>Percolator &#8211; what is it and how to use it</li>
<li>ElasticSearch monitoring and being a fireman</li>
<li>And much, much more <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></li>
</ul>
<p>Eventhough we work with ElasticSearch everyday, we realized how big it was after digging into all the functionalities and cases. Because of that we were not able to describe all the features, but we hope we were able to choose the ones that are the most interested and needed.</p>
<p>Do you think that we should start writing about new enterprise search engine on solr.pl ? <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<p>&nbsp;</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2013/01/29/new-book-elasticsearch-server/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Solr 4.0 Cookbook</title>
		<link>https://solr.pl/en/2012/12/11/solr-4-0-cookbook/</link>
					<comments>https://solr.pl/en/2012/12/11/solr-4-0-cookbook/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Tue, 11 Dec 2012 11:19:09 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Solr]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=523</guid>

					<description><![CDATA[Because of the fact that is was rather quietly on solr.pl lately, we would like to show you one of the reasons of this situation. We are pleased to inform you, that the updated version of cookbook &#8211; &#8220;Solr 4.0]]></description>
										<content:encoded><![CDATA[<p><a href="http://solr.pl/wp-content/uploads/2012/12/Solr-4.0-Cookbook.jpg"><img loading="lazy" decoding="async" class="alignleft  wp-image-2615" style="margin-left: 20px; margin-right: 20px;" title="Solr 4.0 Cookbook" src="http://solr.pl/wp-content/uploads/2012/12/Solr-4.0-Cookbook-243x300.jpg" alt="" width="170" height="210"></a></p>
<p><span id="more-523"></span></p>
<p>Because of the fact that is was rather quietly on <a href="http://solr.pl">solr.pl</a> lately, we would like to show you one of the reasons of this situation.</p>
<p>We are pleased to inform you, that the updated version of cookbook &#8211; &#8220;<strong>Solr 4.0 Cookbook</strong>&#8221; will be available in March 2013. The book is focused on the latest available version of Solr server &#8211; the 4.0.As before, the content is divided into ten thematic chapters and the book is again maintained in the cookbook conversion, which means that each recipe is focused on solving one particular problem.</p>
<p>From our point of view its worth to notice that the book was rewritten in the majority. More than half of the old recipes were removed and new ones came for them, for example ones describing Solr Cloud.</p>
<p>If you are interested, please refer to the Packt Publishing page: <a href="http://www.packtpub.com/apache-solr-4-cookbook/book" target="_blank" rel="noopener noreferrer">http://www.packtpub.com/apache-solr-4-cookbook/book</a>, more details soon.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2012/12/11/solr-4-0-cookbook/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Rich documents processing &#8211; on the search or application side?</title>
		<link>https://solr.pl/en/2012/06/11/rich-documents-processing-on-the-search-or-application-side/</link>
					<comments>https://solr.pl/en/2012/06/11/rich-documents-processing-on-the-search-or-application-side/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Mon, 11 Jun 2012 21:48:05 +0000</pubDate>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Solr]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=462</guid>

					<description><![CDATA[When indexing so called &#8220;rich documents&#8221; we should sometimes think about, where we want those documents to be processes &#8211; should we send them to Apache Solr (or other search engine, like ElasticSearch) and forget about them or whether we]]></description>
										<content:encoded><![CDATA[<p>When indexing so called &#8220;rich documents&#8221; we should sometimes think about, where we want those documents to be processes &#8211; should we send them to Apache Solr (or other search engine, like ElasticSearch) and forget about them or whether we should use Apache Tika before sending the document and send the extracted content along with other information for indexation.</p>
<p><span id="more-462"></span></p>
<h3>Options</h3>
<p>As I wrote a few lines above we have two options &#8211; the first one is sending the binaries to search engine and use ExtractingRequestHandler (information about integrating Solr with Apache Tika can be found <a href="http://solr.pl/en/2011/03/21/solr-and-tika-integration-part-1-basics/" target="_blank" rel="noopener noreferrer">here</a>) in Solr case, so it will make all the work for us. The second option is to use the same functionality (almost the same) to parse binary documents and get their contents before sending them to Solr. Of course there is a third option, not possible in most cases &#8211; get the documents you want to index in a format understandable by Solr <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<h3>Processing on the Search Server Side</h3>
<p>The simplest approach is to process your &#8220;rich documents&#8221; on the search server side. Lets assume its Apache Solr. We configure the ExtractingRequestHandler in the way we want it to work and we forget about everything else. But its not the right approach every time. You can imagine a situation when your indexing server is almost 100% utilized. If you would add another source of generating load you would probably suffer from performance problems. In such cases you will probably want to do it the other way.</p>
<h3>Processing Outside of the Search Server</h3>
<p>If the amount of rich documents is huge or your indexing server is almost completely utilized than it may be a good idea to process your binary files before sending them to your indexing server. Using Apache Tika for example we are able to build (quite easily) a good and reliable solution to process rich documents in your application. Of course such approach require a bit of knowledge about Java (or any other language you will use for content extraction). Such approach can save us from a situation where our indexing server is overloaded and because of the amount of data we can&#8217;t do anything with it.</p>
<h3>A Few Words at the End</h3>
<p>Once every few weeks we will be publishing posts that don&#8217;t cover one of the Apache Solr functionalities, but instead discuss some overall search problem or describe architecture of system with search as their part. We hope that such posts will allow us and you to look a bit wider on search topics than only from Apache Solr point of view.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2012/06/11/rich-documents-processing-on-the-search-or-application-side/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
