<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Conferences &#8211; Solr.pl</title>
	<atom:link href="https://solr.pl/en/category/conferences/feed/" rel="self" type="application/rss+xml" />
	<link>https://solr.pl/en/</link>
	<description>All things to be found - Blog related to Apache Solr &#38; Lucene projects - https://solr.apache.org</description>
	<lastBuildDate>Wed, 11 Nov 2020 20:59:00 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9</generator>
	<item>
		<title>Explain 0.9.1 &#8211; New version</title>
		<link>https://solr.pl/en/2012/01/11/explain-0-9-1-new-version/</link>
					<comments>https://solr.pl/en/2012/01/11/explain-0-9-1-new-version/#respond</comments>
		
		<dc:creator><![CDATA[Marek Rogoziński]]></dc:creator>
		<pubDate>Wed, 11 Jan 2012 20:58:24 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=394</guid>

					<description><![CDATA[After a few weeks (counting Christmas, when people don&#8217;t usually analyze theirs Solr queries ;)) we would like to share some thoughts about releasing&#160;explain.solr.pl. We noticed the following: Explain tool have goaded some attention, but most to the explains are]]></description>
										<content:encoded><![CDATA[<p>After a few weeks (counting Christmas, when people don&#8217;t usually analyze theirs Solr queries ;)) we would like to share some thoughts about releasing&nbsp;<a title="Explain.solr.pl" href="http://explain.solr.pl/">explain.solr.pl</a>.</p>
<p><span id="more-394"></span></p>
<p>We noticed the following:</p>
<ul>
<li>Explain tool have goaded some attention, but most to the explains are marked as private.</li>
<li>In relatively large amount of explain only query was submit instead of the whose response.</li>
<li>Often the results don&#8217;t have diagnostic information (<em>debugQuery=true</em>).</li>
<li>Unreleased <a href="http://solr.pl/en/tag/4.0/">Apache Solr 4.0</a> is quite popular <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></li>
<li>Explain parser did a good job (of course without counting the obvious Solr 4.0 explain parsing problems).</li>
</ul>
<div>Our thoughts:</div>
<div>
<ul>
<li>We need to work on clarity of the messages.</li>
<li>We need to focus on enabling <a href="http://solr.pl/en/tag/4.0/">Solr 4.0</a> explain parsing.</li>
</ul>
</div>
<p>Today new version of&nbsp;<a title="Explain.solr.pl" href="http://explain.solr.pl/">explain.solr.pl</a>&nbsp;was deployed. The changes are as follows:</p>
<ul>
<li>Simple analysis of query performance was added.</li>
<li>Displayed messages are now more user friendly.</li>
<li>Explain parser was corrected according to text files generated by Solr 3.5 unit test cases.</li>
<li>Minor GUI enhancements.</li>
</ul>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2012/01/11/explain-0-9-1-new-version/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Explain.solr.pl beta version available</title>
		<link>https://solr.pl/en/2011/12/23/explain-solr-pl-beta-version-available/</link>
					<comments>https://solr.pl/en/2011/12/23/explain-solr-pl-beta-version-available/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Fri, 23 Dec 2011 20:57:02 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[debug]]></category>
		<category><![CDATA[debugQuery]]></category>
		<category><![CDATA[explain]]></category>
		<category><![CDATA[explain.solr.pl]]></category>
		<category><![CDATA[score]]></category>
		<category><![CDATA[scoring]]></category>
		<category><![CDATA[solr]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=390</guid>

					<description><![CDATA[We are pleased to announce, that http://explain.solr.pl is now available for users. Please remember that this is a beta version and will be further developed. If you want to know more details please read the following blog post. Design Currently]]></description>
										<content:encoded><![CDATA[<p>We are pleased to announce, that <a href="http://explain.solr.pl" target="_blank" rel="noopener noreferrer">http://explain.solr.pl</a> is now available for users. Please remember that this is a beta version and will be further developed. If you want to know more details please read the following blog post.</p>
<p><span id="more-390"></span></p>
<h3>Design</h3>
<p>Currently explain.solr.pl can analyze the query of 3.x version of Solr and it&#8217;s getting ready to be able to handle 4.0 version of Solr.</p>
<h3>Explain visualization</h3>
<p>In the current version <a href="http://explain.solr.pl">explain.solr.pl</a> visualizes <em>score</em> values of the given documents.</p>
<p>In order to add a new explain visualization you need to run your query with an additional <em>debugQuery=on</em> parameter and next paste the result returned by Solr to <a href="http://explain.solr.pl">explain.solr.pl</a> (if you will use web browser to get the results please remember to paste the page source). After hitting the &#8220;<em>Create explain</em>&#8221; button your explain visualization will be created and you will be taken to that visualization web page. Currently <a href="http://explain.solr.pl">explain.solr.pl</a> supports only results in XML format, so if you use another format (like json) please ommit the <em>wt</em> parameter or set it to<em> wt=xml</em>.</p>
<p>It is also possible to verify hits for any document, even one that was not found in the search results (given to Solr with the use of <em>explainOther</em> parameter). It can be useful to check why the given document wasn&#8217;t included in the result list.</p>
<h3>Available information</h3>
<p>After creating the new explain visualization you have a series of information available. Information about the query is shown ones for the whole explain, while the rest of the information is shown for each document.</p>
<h4>Information about query</h4>
<p><a href="http://solr.pl/wp-content/uploads/2011/12/explain.solr_.pl_top1.png"><img fetchpriority="high" decoding="async" class="alignnone size-full wp-image-1938" title="explain.solr.pl_top" src="http://solr.pl/wp-content/uploads/2011/12/explain.solr_.pl_top1.png" alt="" width="515" height="185"></a></p>
<p>In this part of <a href="http://explain.solr.pl">explain.solr.p</a>l you can see:</p>
<ol>
<li>Unique identifier of your explain visualization.</li>
<li>Information about <em>debugOther</em> parameter not being used.<em><br />
</em></li>
<li>The actual query.</li>
<li>Information about explain visualization being available to public.</li>
</ol>
<h4>Fields and values</h4>
<p><a href="http://solr.pl/wp-content/uploads/2011/12/explain.solr_.pl_fields.png"><img decoding="async" class="alignnone size-full wp-image-1939" title="explain.solr.pl_fields" src="http://solr.pl/wp-content/uploads/2011/12/explain.solr_.pl_fields.png" alt="" width="552" height="152"></a></p>
<p>In this part of the <a href="http://explain.solr.pl">explain.solr.pl</a> you can see information about fields and theirs values that were available in the search results.</p>
<h4>Visualization of elements influencing total score value</h4>
<p><a href="http://solr.pl/wp-content/uploads/2011/12/explain.solr_.pl_explain.png"><img decoding="async" class="alignnone size-full wp-image-1942" title="explain.solr.pl_explain" src="http://solr.pl/wp-content/uploads/2011/12/explain.solr_.pl_explain.png" alt="" width="500" height="332"></a></p>
<p>The following information is visible in this part of the <a href="http://explain.solr.pl">explain.solr.pl</a>:</p>
<ol>
<li>Percentage influence of each value on the global <em>score</em> value of the document.</li>
<li>Pie chart visualizing the data visible in the left column.</li>
</ol>
<h3>Getting back to once visualized explain</h3>
<p>If you wish to get back to your once visualised explain please to the following URL address: <em>http://explain.solr.pl/explains/EXPLAIN_ID</em>, where <em>EXPLAIN_ID</em> is the unique identifier of your explain. If you don&#8217;t remember unique identifier of your visualized explain, you can find it out in the &#8220;<em>history</em>&#8221; page.</p>
<h3>Hiding your explain</h3>
<p>If you don&#8217;t want your explain information to be available to public, please uncheck the &#8220;<em>I want this explain to be visible on history page</em>&#8221; checkbox during explain creation. Please remember that in order to get back to private explain you will need to remember its unique identifier.</p>
<h3>Issues raising</h3>
<p>To raise a new issue You found in <a href="http://explain.solr.pl">explain.solr.pl</a>, please create a new issue on <a href="https://github.com/solrpl/explain/issues?sort=created&amp;direction=desc&amp;state=open" target="_blank" rel="noopener noreferrer">https://github.com/solrpl/explain</a> or send a mail to <strong>explain</strong>(at)<strong>solr</strong>(dot)<strong>pl</strong>.</p>
<h3>Request</h3>
<p>We also have a request to the community. If you have some notes, if you encounter an error or would like some additional functionality please write an mail to <strong>explain</strong>(at)<strong>solr</strong>(dot)<strong>pl</strong>. This will allow us to better understand Your needs and will make <a href="http://explain.solr.pl">explain.solr.pl</a> better suite Your needs.</p>
<h3>Something at the end</h3>
<p>At the very end we would like to show you how the process of developing <a href="http://explain.solr.pl">http://explain.solr.pl</a> looks like. We used a tool called <a href="http://code.google.com/p/gource/" target="_blank" rel="noopener noreferrer">gsource</a> <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<p><iframe loading="lazy" src="http://www.youtube.com/embed/NUENKPOf_6U" width="560" height="315" frameborder="0"></iframe></p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2011/12/23/explain-solr-pl-beta-version-available/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Lucene Eurocon 2011 &#8211; day two</title>
		<link>https://solr.pl/en/2011/11/14/lucene-eurocon-2011-day-two/</link>
					<comments>https://solr.pl/en/2011/11/14/lucene-eurocon-2011-day-two/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Mon, 14 Nov 2011 20:53:41 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[day]]></category>
		<category><![CDATA[day two]]></category>
		<category><![CDATA[eurocon]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[lucene eurocon]]></category>
		<category><![CDATA[presentation]]></category>
		<category><![CDATA[solr]]></category>
		<category><![CDATA[two]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=380</guid>

					<description><![CDATA[In the previous entry, we tried to cover what we saw at the first day of Lucene Eurocon 2011 conference. However, from our point of view, the second day was the more important one, due to the fact that this]]></description>
										<content:encoded><![CDATA[<p>In the <a href="http://solr.pl/en/2011/11/07/lucene-eurocon-2011-day-one/" target="_blank" rel="noopener noreferrer">previous</a> entry, we tried to cover what we saw at the first day of Lucene Eurocon 2011 conference. However, from our point of view, the second day was the more important one, due to the fact that this was the day we had our talk. But let&#8217;s start from the beginning.</p>
<p><span id="more-380"></span></p>
<h3>Keynote (Realtime Search at Twitter, Michael Busch)</h3>
<p>Day two began with one of the better, in my opinion, talks during Lucene Eurocon 2011. Michael Busch, during his presentation titled &#8220;Realtime Search at Twitter&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Busch_twitter_realtime_search_eurocon_11.pdf" target="_blank" rel="noopener noreferrer">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/realtime-search-twitter" target="_blank" rel="noopener noreferrer">video</a>) showed us what the Twitter team to handle real-time search based on Lucene, so that the whole system was able to handle the daily traffic of Twitter. The presentation consisted of a very large amount of technical details, details that were explained in details. Conference participants, in addition to the information about Lucene, also got a brief lesson of Java multithreaded programming <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<h3>What will come (Lucene Today, Tomorrow &amp; Beyond, Simon Willnauer)</h3>
<p>The conference was divided into three parallel sessions, and we chose the presentation titled &#8220;Lucene Today, Tomorrow &amp; Beyond&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Willnauer_Eurocon_today_tomorrow_and_beyond_eurocon.pdf">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/lucene-today-tomorrow-beyond" target="_blank" rel="noopener noreferrer">video</a>) , led by Simon Willnauer. The presentation focused on what Lucene is not capable of at this time, and what it may be capable of in the future (for example, in version 5). Simon began by presenting the history of Lucene, the people involved in the development of the project and the companies they work for. Then we found out what users can expect from Lucene (in terms of functionalities), when version 4.0 is officially released. During the talk, Simon also talked about that the role of positions of tokens in the token stream, incremental updates and JIT. At the end we were shown a part of his vision of Lucene and Solr as a modular projects.</p>
<h3>A thing about explain in Solr (Understanding and Visualising Solr &#8216;explain&#8217; Information, Rafał Kuć)</h3>
<p><img loading="lazy" decoding="async" class="alignleft size-thumbnail wp-image-1783" style="border: 1px solid black; margin: 10px;" title="eurocon2011" src="http://solr.pl/wp-content/uploads/2011/11/eurocon2011-150x150.png" alt="" width="150" height="150">For personal reasons and because I wanted to prepare for the talk I desided to skip one of the talks. And after the lunch, my talk began. The presentation titled &#8220;Understanding and Visualising Solr &#8216;explain&#8217; Information&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Understanding%20and%20Visualizing%20Solr%20Explain%20information%20-%20Solr.pl%20-%20version%202.pdf" target="_blank" rel="noopener noreferrer">slides</a>)&nbsp;was divided into two parts &#8211; first, the theory regarding the information returned by Apache Solr and the second &#8211; what we have done for <a href="http://explain.solr.pl" target="_blank" rel="noopener noreferrer">http://explain.solr.pl</a>. Let me not to comment on my own presentation, and focus on what I felt after it. Unfortunately, I was not pleased after the talk, stress ate me alive &#8211; I didn&#8217;t say all I wanted to say, I rushed and not explained what I wanted to. But now, after having some experience I can promise one thing &#8211; next time will be better, much better. I am pleased with the positive reception of the talk and the topic itself.</p>
<h3></h3>
<h3></h3>
<h3></h3>
<h3>Randomized tests (Randomized Continous Testing: Solr &amp; Lucene Use Case, Dawid Weiss)</h3>
<p>After a short rest we went to listen as David Weiss talks about tests using a random factor during the talk titled &#8220;Randomized Continous Testing: Solr &amp; Lucene Use Case&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/dweiss-Eurocon_testing.pdf" target="_blank" rel="noopener noreferrer">slides</a>). If you are interested about the automated testing (and you should be) I definitely recommend the slides, as well as video as soon as it&#8217;ll be available. David started with the theory about where you can add a random factor to the tests (of course everything in the context of Lucene and Solr). Then we got a good deal of technical information on how to implement a randomized test (with examples of course). There were also information on how to test multi-threading.</p>
<h3>How to test your search engine (Better Search Engine Testing, Eric Pugh)</h3>
<p>Staying in the topic of testing, we decided to go to the last talk during Lucene Eurocon 2011 titled &#8220;Better Search Engine Testing&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Eric%20Pugh%20-%20Better%20Search%20Engine%20Testing%20Printout.pdf" target="_blank" rel="noopener noreferrer">slides</a>), led by Eric Pugh.</p>
<h3>Commiter panel</h3>
<p>The last attraction in the Lucene Eurocon 2011 was the &#8220;Commiter panel&#8221; &#8211; the meeting of conference participants and the people that develop Apache Lucene and Solr every day. After brief introductions, there was a series of questions and answers. If you are interested how the meeting went, you will have to wait for the video material. Another interesting presentation, at least from my point of view. Eric started from the fact that search, and particularly good search is increasingly important in today&#8217;s world. What&#8217;s more, data storage is cheap, but data management is by no means cheap, and therefore the role of search engines will increase. Eric then turned our attention to the problem of testing the search engines, what should be tested and how we can automatically test the search engines. It&#8217;s hard to write a summary of that talk, due to the large amount of information that Eric had given.</p>
<h3>Other presentations</h3>
<p>I would also like to invite persons interested in presentations other than those discussed in this and the previous entry to the following sites:</p>
<ul>
<li><a href="http://2011.lucene-eurocon.org/">http://2011.lucene-eurocon.org/</a></li>
<li><a href="http://2011.lucene-eurocon.org/pages/program">http://2011.lucene-eurocon.org/pages/program</a></li>
<li><a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011" target="_blank" rel="noopener noreferrer">http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011</a></li>
</ul>
<h3>To sum up</h3>
<p>From my point of view, Lucene Eurocon 2011 conference was much better organized than Lucene Eurocon 2010, even considering the fact that the Lucene Eurocon 2010 conference was very good. You could see that the organizers gained experience <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /> The talks at the conference were miscellaneous &#8211; there were strictly technical subjects and those that focused on the user. Because of the three parallel sessions, it was impossible not to find the presentation for yourself &#8211; the organizers deserve thanks and congratulations, they did a very good job. I hope that we will meet most of this year&#8217;s participants in the Lucene Eurocon 2012 <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f609.png" alt="😉" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2011/11/14/lucene-eurocon-2011-day-two/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>explain.solr.pl: Status</title>
		<link>https://solr.pl/en/2011/11/11/explain-solr-pl-status/</link>
					<comments>https://solr.pl/en/2011/11/11/explain-solr-pl-status/#respond</comments>
		
		<dc:creator><![CDATA[Marek Rogoziński]]></dc:creator>
		<pubDate>Fri, 11 Nov 2011 20:52:52 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[eurocon]]></category>
		<category><![CDATA[explain]]></category>
		<category><![CDATA[explain.solr.pl]]></category>
		<category><![CDATA[lucene]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=378</guid>

					<description><![CDATA[During the Lucene Eurocon 2011 conference we gave a talk about a tool that will enable you to analyze Solr search results. We promised, that the public version of the tool will be released soon. We would like to ensure]]></description>
										<content:encoded><![CDATA[<p>During the <a href="http://solr.pl/en/2011/11/07/lucene-eurocon-2011-day-one/" target="_blank" rel="noopener noreferrer">Lucene Eurocon 2011</a> conference we gave a talk about a tool that will enable you to analyze Solr search results. We promised, that the public version of the tool will be released soon. We would like to ensure that we are getting close to that moment. Right now we are focusing on the following things:</p>
<p><span id="more-378"></span></p>
<ul>
<li>rebuilding Lucene explain information analyzer so it is ready for the changes in Lucene 4.0 (<em>per field similarity</em> and <em>flexible similarity</em>)</li>
<li>user interface rebuild along with some changes to the usability</li>
<li>code cleaning and preparation for publishing sources</li>
</ul>
<p>Fortunately autumn and winter evenings are just perfect for work <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /> We will try to make <a href="http://explain.solr.pl" target="_blank" rel="noopener noreferrer">http://explain.solr.pl</a> available as soon as possible.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2011/11/11/explain-solr-pl-status/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Lucene Eurocon 2011 &#8211; day one</title>
		<link>https://solr.pl/en/2011/11/07/lucene-eurocon-2011-day-one/</link>
					<comments>https://solr.pl/en/2011/11/07/lucene-eurocon-2011-day-one/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Mon, 07 Nov 2011 20:52:18 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Lucene]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[2011]]></category>
		<category><![CDATA[barcelona]]></category>
		<category><![CDATA[eurocon]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[lucene eurocon]]></category>
		<category><![CDATA[presentation]]></category>
		<category><![CDATA[solr]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=376</guid>

					<description><![CDATA[As we wrote a few days ago we are back from this year Lucene Eurocon, which took place in Barcelona. Despite the fact that the videos will be available shortly, we decided to wrote something about those presentations we attended]]></description>
										<content:encoded><![CDATA[<p>As we wrote a few days ago we are <a href="http://solr.pl/en/2011/10/31/another-lucene-eurocon-is-a-history/" target="_blank" rel="noopener noreferrer">back</a> from this year <a href="http://2011.lucene-eurocon.org/" target="_blank" rel="noopener noreferrer">Lucene Eurocon</a>, which took place in Barcelona. Despite the fact that the videos will be available shortly, we decided to wrote something about those presentations we attended to.</p>
<p><span id="more-376"></span></p>
<h3>Keynote (Search + Big Data: It&#8217;s (still) All About the User&#8221;, Grant Ingersoll)</h3>
<p>The first day started at 8:30, at the time of the day which is called a &#8220;night&#8221; by the Spanish <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /> This year the conference was started by Grant Ingersoll with his presentation titled &#8220;Search + Big Data: It&#8217;s (still) All About the User&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Ingersoll_Lucid_Eurocon-keynote_0.pdf" target="_blank" rel="noopener noreferrer">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/search_big_data_its_still_all_about_users" target="_blank" rel="noopener noreferrer">video</a>). Grant reminded us that despite all the superb technologies, we deal with every day, we should always focus at the end user. Therefore, we should do everything to ensure that users are satisfied with the application we created. Grant showed that, despite the fact that developers have more and more technologically advanced tools, everything comes down to the same &#8211; the most important is still the user. You should definitely keep this in mind.</p>
<h3>Keynote #2 (Architecting the Future of Big Data &amp; Search, Eric Baldeschwieler)</h3>
<p>The second presentation we attended to during the conference was &#8220;architecting the Future of Big Data &amp; Search&#8221; by Eric Baldeschwieler (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Baldeschwieler_HortonWorks_LuceneEurocon20111018.pdf" target="_blank" rel="noopener noreferrer">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/architecting-future-big-data-search" target="_blank" rel="noopener noreferrer">video</a>). During the presentation Eric tried to answer whether Lucene and Hadoop are able to work efficiently within a single system. At the beginning we were given a large dose of information on the Map/Reduce algorithm, Hadoop and HBase. The second part of the presentation was a case study of the implementation of the Apache Hadoop at Yahoo and how Map/Reduce helped in the implementation of daily tasks of the system. At the very end of the we were given information about Lucene and Hadoop integration. Lots of interesting information, I recommend to watch the movie as soon as it gets published.</p>
<h3>The thing about the index structure (Portable Lucene Index Format &amp; Applications, Andrzej Białecki)</h3>
<p>At this time the conference was split into three parallel sessions and we decided to go and listen to Andrzej Bialecki and his presentation titled &#8220;Lucene Index Portable Format &amp; Applications&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/AndrzejBialecki-Eurocon-2011.pdf" target="_blank" rel="noopener noreferrer">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/portable-lucene-index-format-applications" target="_blank" rel="noopener noreferrer">video</a>). Andrzej started with information about the structure of Apache Lucene index and why backward compatibility is difficult to achieve. Then, the presentation focused on its title, which is why portable Lucene index format is needed, what are the challenges and what is the state of implementation of PortableCodec. Attendees could also see an interesting example of how to use the SimpleTextCodec and how the index data are stored in plain text:) I recommend the video to watch the whole presentation.</p>
<h3>First topic about Solr (Improving Solr&#8217;s Update Chain, Jan Høydahl)</h3>
<p>We changed the conference room to listen to Jan Høydahl talking about &#8220;Improving Solr&#8217;s Update Chain&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Jan%20Hoydahl_Improving%20Solrs%20Update%20Chain_Eurocon2011%20.pdf" target="_blank" rel="noopener noreferrer">slides</a>). During the presentation Jan showed how to define your own components in the Solr update chain. It turns out that a simple modification of the XML file allows you to expand and adapt the process of indexing the data for your own needs. During the presentation we were also shown how to implement your own update chain components that will enable you to influence the indexation process. A large part of the presentation was the case-study of the implementation of a project for the University of Oslo. At the end of the presentation we were given information about the plans and the functionalities that have been donated to the Apache Software Foundation as patches to Solr.</p>
<h3>Know what your users are doing (Search Analytics: Business Value &amp; BigData NoSQL Backend, Otis Gospodnetic)</h3>
<p>Another presentation focused on what should be one of the most esential things in all search systems &#8211; the user,&nbsp;presented by Otis Gospodnetic under the title &#8220;Search Analytics: Business Value &amp; BigData NoSQL backend&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/otis_gospodnetic_search_analytics_lucene_eurocon_2011.ppt" target="_blank" rel="noopener noreferrer">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/search-analytics-business-value-bigdata-nosql-backend" target="_blank" rel="noopener noreferrer">video</a>). If you are using Solr you should look at what Sematext has to offer and what&#8217;s more &#8211; completely for free (at least for now). During the presentation Otis told the audience that the analysis of user behavior and how users use your system, is not an option &#8211; it&#8217;s necessary. Without this it is impossible to tune and refine the search, because I do not know how, nor did we know where. Additionally, without the data about user behavior, we can not say whether the changes that we do are going in the right direction. I highly recommend watching the video.</p>
<h3>Solr and Hadoop (Scaling Search at Trovit with Solr &amp; Hadoop, Marc Sturlese)</h3>
<p>After a short break we decided to listen about integration of Solr and Hadoop during the presentation &#8220;Scaling Search with Solr at Trovit &amp; Hadoop&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/MarcSturleseTrovit.pdf" target="_blank" rel="noopener noreferrer">slides</a>), led by Marc Sturlese. The author focused on how they managed to disperse the indexation based using the Map/Reduce algorithm. In short, the presentation was about how pretty efficiently, using the data stored in HDFS, create smaller indexes, and then combine them into larger indices (or a large index, depending on your needs). Additionally, Marc showed what they did to take advantage of the functionality of Solr (data analysis and deduplication) not running Solr itself.</p>
<h3>Solr and UIMA<strong> (</strong>Natural language search in Solr, Tommaso Teofili)</h3>
<p>The next presentation in which we participated was &#8220;Natural language search in Solr&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/NaturalLanguageSearchInSolr.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Tommaso Teofili. The presentation was about usage of Apache Solr with conjunction of UIMA and a case-study of a real life project. A bit of technical detail, some performance comparisons and information about the necessity of system learning. Interesting topic and very interesting presentation. I recommend to watch the video as soon as it gets published.</p>
<h3>Here comes the new (Improved Search with Lucene 4, Robert Muir)</h3>
<p>The last presentation scheduled for the first day of Lucene Eurocon 2011 was &#8220;Improved Search with Lucene 4&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Muir_Eurocon_ImprovedSearchLucene4.pdf" target="_blank" rel="noopener noreferrer">slides</a>, <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/improved-search-lucene-4" target="_blank" rel="noopener noreferrer">video</a>), led by Robert Muir. He talked about the changes that await the Apache Lucene in version 4.0. We also heard a lot about the performance of Lucene 4.0, changes in the API, NRT, and deep paging support. An interesting technical presentation about what we can expect from the upcoming version of Lucene. Persons interested in this subject should refer to the video.</p>
<h3>Lighting Talks</h3>
<p>Then again, the conference has been combined into a single session which was called &#8220;Lighting Talks&#8221;. During the session presenters had about 7 minutes for their talk. The talk that I remember the most (because of the large dose of humor) was &#8220;Java 7 and Lucene: the story behind the story&#8221; by Uwe Schindler which in my opinion should be rather called &#8220;Do not use Java 7 for anything&#8221; <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /> A sincere congratulations for amusing the audience <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f609.png" alt="😉" class="wp-smiley" style="height: 1em; max-height: 1em;" /> The lighting talks session consisted of the following presentation:</p>
<ul>
<li>&#8220;Morphological Analysis and Named Entity Recognition for your Lucene/Solr Search Applications&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/INTRAFIND_Goller.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Christoph Goller</li>
<li>&#8220;Java 7 and Lucene: the story behind the story&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Schindler-Java7_eurocon2011.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Uwe Schindler</li>
<li>&#8220;Navigating Subdocuments with Solr&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Khudnev_Lightning_Searching_Sub_documents_with_Solr.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Mikhail Khludnev</li>
<li>&#8220;Powered by Lucene: IBM Content Analytics with Enterprise Search&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Wolf_Lightning_IBM%20Content%20Analytics.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Wolfgang Jung</li>
<li>&#8220;Solr performance monitoring&#8221; led by Otis Gospodnetic</li>
<li>&#8220;Searching in more than 140 years newspaper articles&#8221; (<a href="http://www.lucidimagination.com/sites/default/files/file/Eurocon2011/Provenzov_Lightning_searching_in_more_than_140_years_newspaper_articles_def-3.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Nicola Provenzano</li>
</ul>
<h3>Stump the chump (Chris Hostetter)</h3>
<p>The last session for the day was fun and it was possible to win a few Euros if you asked a good question (100, 50 and 25 as I recall). The whole session was about asking a question that Chris (aka Hoss) would not been able to answer. Summarizing this part does not make sense, I recommend the video which is available at the following URL: <a href="http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/apache-lucene-eurocon-2011-stump-chump" target="_blank" rel="noopener noreferrer">http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011/apache-lucene-eurocon-2011-stump-chump</a>.</p>
<h3>End of the first day</h3>
<p>And that was all about the first day of the conference. Some of the participants then went to FC Barcelona vs FC Viktoria Plzeň, a small part of us went in the direction of their hotels, and all the rest, including us, went to Shoko Club for some cava and something to eat.</p>
<h3></h3>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2011/11/07/lucene-eurocon-2011-day-one/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Another Lucene Eurocon is a history</title>
		<link>https://solr.pl/en/2011/10/31/another-lucene-eurocon-is-a-history/</link>
					<comments>https://solr.pl/en/2011/10/31/another-lucene-eurocon-is-a-history/#respond</comments>
		
		<dc:creator><![CDATA[Marek Rogoziński]]></dc:creator>
		<pubDate>Mon, 31 Oct 2011 20:51:33 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=374</guid>

					<description><![CDATA[Another Lucene Eurocon is a history. That was a very intense two days, where the basic problem was: which of the three lectures go to. Sometimes the choice was very difficult, the only hope is that this year all the]]></description>
										<content:encoded><![CDATA[<p>Another Lucene Eurocon is a history. That was a very intense two days, where the basic problem was: which of the three lectures go to. Sometimes the choice was very difficult, the only hope is that this year all the presentations were recorded and will soon be available on the web.</p>
<p><span id="more-374"></span></p>
<p>For us the most important was the second day and the presentation of the results of our work: <strong>Understanding &amp; Visualising Solr &#8216;explain&#8217; Information</strong>. The presentation consisted of two parts. First, the theoretical, described what Solr presents about the validity of the returned document. This information, unfortunately, do not belong to the most readable, especially when the search is done on many fields, often using dedicated query parser (the screenshot contains explain fragment describing a hit in only one document (!)).</p>
<p><img loading="lazy" decoding="async" title="explain" src="http://solr.pl/wp-content/uploads/2011/10/explain-300x118.png" alt="Przykładowy explain" width="300" height="118"></p>
<p>The situation gets worse because the questions from the client about the documents positioning are one of the most time consuming ones during the project.</p>
<p>The second part of the presentation showed what we decided to do with this problem. The idea came after seeing <a href="http://explain.depesz.com">explain.depesz.com</a> &#8211; a similar tool for visualization of information provided by the postgreSQL database.</p>
<p>In retrospect, our explain seems to be a good idea. Although its not finished we are using the tool almost everyday. What&#8217;s more &#8211; the Lucene Eurocon presentation showed us that there is also considerable interest in the community for a tool like this. Thank you for your kind words and the promises of sending us your own modules.</p>
<p>The current version explain.solr.pl focuses on reading the information generated by the solr 3.x. We are currently working on going public, and (somewhat later) the opensourcing the code.</p>
<p>To sum up the plans for the nearest future are quite ambitious:)</p>
<ol>
<li>Firstly &#8211; the chronicler&#8217;s duty &#8211; we will try to describe how Lucene Eurocon 2011 looked from our perspective. We plan to publish two entries &#8211; each of which will be dedicated to one day of the conference. That is something to read before they appear on the official videos from the conference.</li>
<li>Opening <a href="http://explain.solr.pl">explain.solr.pl</a> for a wide audience:)</li>
<li>Publication of <a href="http://explain.solr.pl">explain.solr.pl</a> code (github)</li>
<li>Back to more regular posts about Solr.</li>
</ol>
<p>Keep your fingers crossed:)</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2011/10/31/another-lucene-eurocon-is-a-history/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Lucene Eurocon 2010</title>
		<link>https://solr.pl/en/2010/07/12/lucene-eurocon-2010/</link>
					<comments>https://solr.pl/en/2010/07/12/lucene-eurocon-2010/#respond</comments>
		
		<dc:creator><![CDATA[Rafał Kuć]]></dc:creator>
		<pubDate>Mon, 12 Jul 2010 09:17:24 +0000</pubDate>
				<category><![CDATA[Conferences]]></category>
		<guid isPermaLink="false">http://sematext.solr.pl/?p=52</guid>

					<description><![CDATA[Following the announcement by the Apache Software Foundation&#8217;s intention to abandon the organization of ApacheCon conference on the old continent, we Europeans were left with no conference under the sign of the Apache near us. But as we all know,]]></description>
										<content:encoded><![CDATA[<p>Following the announcement by the<strong> Apache Software Foundation&#8217;s</strong> intention to abandon the organization of ApacheCon conference on the old continent, we Europeans were left with no conference under the sign of the <strong>Apache</strong> near us. But as we all know, nature does not like emptiness, and thus the company <a href="http://www.lucidimagination.com/" target="_blank" rel="noopener noreferrer">Lucid Imagination</a>, in cooperation with the sponsors, decided to organize the first conference dedicated to Lucene and Solr &#8211; <a href="http://lucene-eurocon.org/" target="_blank" rel="noopener noreferrer">Lucene Eurocon</a>. Due to the fact that we had the pleasure to participate in this conference, we decided to pass you a short account of its progress.</p>
<p><span id="more-52"></span></p>
<p>The conference was divided into two parts: a training and a typical conference. I wont write about the training part, because I did not participated in this part of the conference. Trainings were conducted by two commiters of Lucene/Solr project: Erik Hatcher and Grant Ingersoll. More information on topics that were addressed in this part of the conference can be found at <a href="http://lucene-eurocon.org/training.html" target="_blank" rel="noopener noreferrer">http://lucene-eurocon.org/training.html</a>.</p>
<h3>The Begining</h3>
<p>Proper conference began Thursday, May 20, with the greeting done spoken by Grant Ingersoll, followed by&nbsp; two presentations: &#8220;<em>The Search Revolution &#8211; How Lucene &amp; Solr Are Changing the World</em>&#8220;, led by Lucid Imagination`s CEO &#8211; Eric Gries (<a href="http://lucene-eurocon.org/slides/The-Search-Revolution-How-Lucene-&amp;-Solr-Are-Changing-the-World_Eric-Gries.pdf" target="_blank" rel="noopener noreferrer">slides</a>) and &#8220;<em>From Publisher To Platform: How The Guardian Used  Content, Search,  and Open Source To Build a Powerful New Business Model</em>&#8221; (<a href="http://lucene-eurocon.org/slides/From-Publisher-To%20Platform-the-Guardian_Stephen-Dunn.pdf" target="_blank" rel="noopener noreferrer">slides</a>), presented by Stephen Dunn &#8211; the head of strategies and technologies department in the The Guardian. Eric Gries in his presentation focused on the increasing demand for information processing, information retrieval and of course on Lucene/Solr project which we can use in both discused cases. Of course, there was an information about growing interest in people with skills in both projects. I must mention that I envy the skills of talking with such audience interest. I just bow my forehead. While the second presentation, led by Stephen Dunn, was a discussion about the new developer platform released by The Guardian named &#8220;The Open Platform&#8221; which allow access to a database of articles published by The Guardian since 1991 (millions of documents and still growing). The author focused on describing the technical details of implementation based on a Solr search engine and the profits of using their platform.</p>
<p>Then, the conference split into two tracks. Due to the fact that physical presence was possible only on one of the two presentations, I will describe only those presentations in which I was present. To be precise &#8211; full agenda is available at <a href="http://lucene-eurocon.org/agenda.html" target="_blank" rel="noopener noreferrer">http://lucene-eurocon.org/agenda.html</a>.</p>
<h3>Tika and the future of Lucene</h3>
<p>I decided that I`ll go for the track one. For the start some interesting and sometimes detailed product information about Tika in presentation called &#8220;<em>Text and Metadata Extraction with Apache Tika</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Text-and-Metadata-Extraction_Jukka-Zitting.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Jukka Zitting. He started with a bit of background information, assumptions and historical overview of the project. Then came purely technical informations including code fragments showing how to use the framework. Overall, from the developer perspective, I think it was a good presentation. Without any break we got a second presentation led by Uwe Schindler and Simon Willnauer under the intriguing title &#8220;<em>Lucene Forecast: Version, Unicode, Flex and Modules</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Lucene-Forecast-Version-Unicode-Flex-and-Mod_Willnauer&amp;Schindler.pdf" target="_blank" rel="noopener noreferrer">slides</a>). In my humble opinion I received enormous amount of information about the future of both Lucene and Solr. The presentation began by explaining the recent moves in the Lucene and Solr projects &#8211; the development merge. From now on the trunk for both projects is collective, same as the commiters. Further more the trunk is now a place where the development of the newest version of Lucene and Solr are kept. We also heard that the 4.0 version of Lucene wont be backwards compatible. Further information provided by speakers were no less interesting &#8211; plans to port the actual faceting mechanism to Lucene. Much time was devoted to discussing the changes associated with full support for Unicode (<em>ICU </em>module of Lucene) and then speakers went on to lead a very interesting topic for me, a <em>Flexible Indexing</em>. I`ll try to write something more about the <em>ICU </em>in the near future.</p>
<p>Below are the titles of two presentations of the second track. Unfortunately due to the fact that I did not take part in them, I can not say anything more about them:</p>
<ul>
<li>&#8220;<em>Use of Solr at Trovit, A Leading Search Engine For Classified Ads</em>&#8220;, led by Mark Sturlese</li>
<li>&#8220;<em>Implementing Solr in Online Media As An Alternative to Commercial Search Products</em>&#8220;, led by Bo Raun</li>
</ul>
<h3>Magic of post-processing</h3>
<p>After a lunch break I decided not to change the trac and I went to see presentation titled &#8220;<em>Munching and Crunching: Lucene Index Post-processing</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Munching-&amp;-crunching-Lucene-index-post-processing-and-applications_Andrzej-Bialecki.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Andrzej Białecki and after the whole presentation I think it was the right choice. I decided that I wont write much about the topic he discussed because in my opinion, the amount of information deserves a separate post. He provided information main about the possibility of Lucene index post-processing like: separation, cleaning, filtering and strong the whole index in RAM memory. As I said, more on this topics in a separate entry. Meanwhile, on the second track, Peter Kiraly led a presentation titled &#8220;<em>Bringing Solr to Drupal: A General, and a Library-Specific Use Case</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Bringing-Solr-to-Drupal-A-General-and-Library-Specific-Use-Case_Peter-Kiraly.pdf" target="_blank" rel="noopener noreferrer">slides</a>).</p>
<h3>Solr and NLP</h3>
<p>I decided to leave the first track and the presentation &#8220;<em>Solr Schema Demystified</em>&#8221; led by Uri Boness in favor of &#8220;<em>Integration of Natural Language Processing tools with Solr</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Integration-of-Natural-Language-Processing-tools-with-Solr_Joan-Codina-Filba.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Joan Codina-Filbà. I though that the topic should be interesting for those seeking to integrate Solr with the linguistic analysis tools &#8211; to save the results returned by these tools and use them. The presented information included use of <a href="http://uima.apache.org/" target="_blank" rel="noopener noreferrer">UIMA</a>, the problems that the developers encountered and how they were resolved. Joan also mentioned how they used Carrot2 in their systems. The presenter also showed differences between stem and lemma, how they diagnosed which part of speech the term is and how this information was used for classification of positive or negative connotation of comments about the products. All this have been said in the context of Lucene and Solr and the use of payloads and without it. It`s a pity that it was discussed only in the context of the Spanish language, but hey, You can`t have everything right ? <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f609.png" alt="😉" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<h3>Document processing and pipe-line</h3>
<p>Due to the fact that early this day I heard a bit about &#8220;<em>The Open Platform</em>&#8221; made by the Guardian, I decided not to go and see a presentation titled &#8220;<em>Solr in the Wild: The Guardian&#8217;s Open Content Platform API</em>&#8221; led by Graham Tackley (<a href="http://lucene-eurocon.org/slides/Solr-In-The-Wild-The-Guardians-Open-Platform-Content-API_Graham-Tackley.pdf" target="_blank" rel="noopener noreferrer">slides</a>) and instead listen to a speech led by Max Charas and Karl Jansson titled &#8220;<em>Modular Document Processing for Solr/Lucene</em>&#8220;(<a href="http://lucene-eurocon.org/slides/A-Pipeline-for-Solr_Charas-Jansson.pdf" target="_blank" rel="noopener noreferrer">slides</a>). Presentation in my opinion, was more about the capabilities of sponsors products, than something very insightful, especially comparing to the previous presentation. That was my impression after this presentation, but maybe it was the result of information overload and general tiredness.</p>
<h3>Solr in IBM</h3>
<p>After another shot break two further presentations began. Given the choice: &#8220;<em>Make Your Domain Objects Searchable with Hibernate Search</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Making-Your-Domain-Objects-Searchable-with-Hibernate-Search_Gustavo-Fernandes.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Gustavo Fernandes and &#8220;<em>Social and Network Discover (SaND) over Lucene</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Social-and-Network-Discovery-over-Lucene_Shai-Erera.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Shai Erera i chose the second one. I short, the presentation concentrated on the application created by IBM to search for information within the company, information like: documents, people, parties, etc. The presenter showed how search was implemented (what can we find), how to narrow the results (faceting, narrowing based on date, location or source), and how they presented the relationship between individuals or documents. In addition to search, we were showed a pretty interesting functionality &#8211; the graph of relationships (slide 25).</p>
<h3>From FAST to Solr</h3>
<p>Being tired already I choose the last presentation of the day, the &#8220;<em>Key Topics When Migrating From FAST to Solr</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Migrating-FAST-to-Solr_Jan-Hoydah.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by John Høydahl over &#8220;<em>Query by Document: When &#8220;More Like This&#8221; Is Insufficient</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Query-By-Document-When-More-like-this-is-insufficient.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Dusan Omercevic. It was interesting to hear about some solutions that have been implemented in FAST that are not in Lucene/Solr projects. John Høydahl spoke quite interestingly, showed differences between the two technologies and how to handle the case of deficiencies of certain functionality, both with one side or the other. For the person who had no commercial experience with FAST, like me, it was interesting to know that most of the data processing in FAST is prepared in the stage of indexing &#8211; for example, sorting must be defined at the stage of indexing. Looking at the FAST can see, what else is missing in Solr &#8211; such as multilingual fields and pipeline. In the later part of the presentation we were shown how to make migration from FAST to Solr, of course, highly simplified, but very informative. Overall, a very good presentation in my opinion.</p>
<h3>End of the first day</h3>
<p>Then, after a 90 minut break, we had a five short presentations, less formal. Due to their nature, and that our thoughts were already at the Czech Beer Festival, I decided only to mention them:</p>
<ul>
<li>&#8220;<em>Social Media Scheduler based on Solr + Hadoop + Amazon EC2</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Social-Media-Scheduler-For-Spanish-Blogs_Pablo-Aragon.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Pablo Aragón</li>
<li>&#8220;<em>Introduction to Collaborative Filtering using Mahout</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Introduction-To-Collaborative-Filtering-Using-Mahout_Frank-Scholten.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Frank Scholten</li>
<li>&#8220;<em>Enterprise Search meets Enterprise CMS &#8211; TYPO3 and Apache Solr</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Enterprise-Search-meets-Enterprise-CMS-Apache-Solr-and-TYPO3_Olivier-Dobberkau.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Olivier Dobberkau</li>
<li>&#8220;<em>BM25 Scoring for Lucene &#8211; From Academia to Industry</em>&#8221; (<a href="http://lucene-eurocon.org/slides/BM25-Scoring-For-Lucene_Yuval-Feinstein.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Yuval Feinstein</li>
<li>&#8220;<em>How We Scaled Solr to 3+ Billion Documents</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Scaling-Solr-to-3Billion-Documents_Jason-Rutherglen.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Jason Rutherglen</li>
</ul>
<h3>Second day</h3>
<p>Similar to the first day, the second day began with a short introduction by Grant Ingersoll.</p>
<h3>So we start again</h3>
<p>Just like in the case of the previous day, at the beginning we got two presentations: &#8220;<em>Software Disruption: How Open Source, Search, Big Date and Cloud Technology are Disrupting IT</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Software-Disruption-How-Open%20Source-Big-Data-and-The-Cloud-Are-Disrupting-IT_Zack-Urlocker.pdf" target="_blank" rel="noopener noreferrer">slides</a>), led by Zack Urlocker and &#8220;<em>Solr 1.5 and Beyond</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Solr-15-and-Beyond_Yonik-Seely.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Yonik Seeley. The first presentation, due to its marketing target didn`t interest me much, which does not change the fact that it has been forwarded to give interesting facts about the state of development of the open source software. There were also predictions about the direction of software development and it`s turn to the processing in the cloud. Zak Urlocker is for sure a speaker with a wide knowledge of open source as one of the fathers of the success of the MySQL. The second presentation was from my point of view much more interesting, because of technical orientation. Yonik Seeley concentrated on several key themes for future of Solr &#8211; merge of developments, extended Dismax parser, integration with Zookeeper (so-called Solr Cloud), spatial search, field collapsing and NRE (near real time) search. In my opinion it is worth keep an eye on changes in Solr, since the future of the project appears very bright, if only everything that was mentioned during the presentation can be achieved and yet we know that not everything was told.</p>
<h3>Grant Ingersoll about relevance</h3>
<p>And again the conference split into two paths. It seemed to me, that I`ll benefit more by listening to Grant`s Ingersoll talk about &#8220;<em>Practical Relevance</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Practical-Relevance_Grant-Ingersoll.pdf" target="_blank" rel="noopener noreferrer">slides</a>) rather than &#8220;<em>Building Multilingual Search Based Applications</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Building-Multilingual-Search-Based-Applications_Steve-Kearns.pdf" target="_blank" rel="noopener noreferrer">slides</a>) led by Steve Kearn. I was not mistaken. Grant spoke about the improvement of search results quality &#8211; how to do it, how to analyze logs, how to do what can be drawn from the search statistics and what needs to be targeted during the process of quality improvement. He referred to the need to collect information from users, because they ultimately decide the success or failure of the application. There were also details how to quickly gain satisfactory results by adding phrase boosting. Approaching the end of the presentation, Grant talked about the advanced methods of influencing the so-called relevance, i.e. by developing own components responsible for counting the validity of the document. He also referred to the &#8220;Open Relevance&#8221; project &#8211; one of the projects of Lucene ecosystem.</p>
<h3>Lucene Connectors Framework</h3>
<p>I decided to stay in the same conference room and listen to Karl Wrigth, who spoke about Lucene Connectors Framework, in a presentation titled &#8220;<em>Lucene Connectors Framework: An Introduction</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Lucene-Connectors-Framework-Introduction_Karl-Wright.pdf" target="_blank" rel="noopener noreferrer">slides</a>). At the same time conference participants could listen to Karel Braeckman presenting &#8220;<em>Unlocking a Breadcaster Video Archive Using Solr</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Unlocking-a-broadcaster-archive-using-Solr_Karel-Braeckman.pdf" target="_blank" rel="noopener noreferrer">slides</a>). Presentation about <em>LCF </em>was mainly providing assumptions and architecture of the framework. We learned that the framework itself is currently in the process of migration from a commercial to open source form, so at this time a lot of functionality may not work, because some of them were based on commercial libraries, which for obvious reasons can not be used in a project open source. While the framework itself will certainly be interesting and useful tool to supplement Solr and Lucene with capabilities such as reading the various data sources (both by PULL and PUSH), the ability to deliver data periodically. Nice thing about Lucene Connectors Framework is it`s security model. In my opinion, at this point, the framework should be regarded as a curiosity, but I really hope this will change soon.</p>
<h3>Solr + Zookeeper = Solr Cloud</h3>
<p>After a lunch break and the free talks with the commiters of Lucene/Solr projects began the afternoon presentation session. Rested and hungry for information I went to see Mark Miller talking about &#8220;<em>Solr in the Cloud</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Solr-In-The-Cloud_Mark-Miller.pdf" target="_blank" rel="noopener noreferrer">slides</a>). At the same time in the next room Tyler Tate and Stefan Olafsson talked about &#8220;<em>The Path to Discovery: Facets and the Scent of Information</em>&#8221; (<a href="http://lucene-eurocon.org/slides/The-Path-To-Discovery-Facets-and-the-Scent-Of-Information_Tate-&amp;-Olafsson.pdf" target="_blank" rel="noopener noreferrer">slides</a>). As you can guess the theme of the presentation it was about so-called Solr Cloud, which is a distributed instance of Solr farm managed using <em>Zookeeper </em>framework. The talk began with the presentation of master &#8211; slave architecture, and how this architecture looks like in the case of distributed index among many shards. Then we had a few words about the index replication &#8211; from shell scripts to new Java based mechanism. Then, Mark Miller, moved to describe the core assumptions that underlie the integration with the <em>Zookeeper </em>&#8211; centralized configuration, fault tolerance, the ability to automatically delete and add more Solr instances or support for checking the status of each instance. Then he discussed what has been done so far with the integration with <em>Zookeeper </em>and what more needs to be done. He also mentioned about what is planned for the future. In addition to matters related to the <em>Zookeeper</em> integration Mark Miller presented the new features in Solr 1.4, ie. <em>LoadBalanced Solr Http Server</em>. In conclusion, the presentation was very interesting, showing a further path of development Solr.</p>
<h3>A moment of grace</h3>
<p>When we talked with other participants of the conference and made some contacts, there were two consecutive presentations:</p>
<ul>
<li>&#8220;<em>Neural Networks, Newspapers and Solr &#8211; A short tour through through extending Solr for real-world text-classification</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Neural-Networks-Newspapers-and-Solr_Sven-Maurmann.pdf" target="_blank" rel="noopener noreferrer">slides</a>), led by Sven Maurmann</li>
<li>&#8220;<em>Rapid Prototyping</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Rapid-Prototyping-with-Solr_Erik-Hatcher.pdf" target="_blank" rel="noopener noreferrer">slides</a>), led by Eric Hatcher</li>
</ul>
<p><strong>Almost the end</strong></p>
<p>The last presentation I watched at the conference was &#8220;<em>Combating Information Overload &#8211; Search in Military Information gathering Systems</em>&#8221; (<a href="http://lucene-eurocon.org/slides/Combatting-Information-Overload-Search-in-Military-Information-Gathering-Systems_Alexandra-Larsson.pdf" target="_blank" rel="noopener noreferrer">slides</a>), the speaker was Alexandra Larsson, Captain of the Swedish Air Force, but at first I went to hear about &#8220;<em>European Language Analysis with Hunspell</em>&#8221; led by Chris Male. After a while I switched to adjacent conference room and listen about how the Swedish military service is using Lucene and Solr in their systems. Because I didn`t listen any of the presentations from the beginning to the end I will not write anything about them. I`ll just say, that the Alexandra Larsson showed some pretty examples of how military is using Solr.</p>
<h3>Summary</h3>
<p>The last word said on the Lucene Eurocon 2010 were spoken by Grant Ingersoll, there were some books given, t-shirts and lottery for a pass to Lucene Revolution in Boston, something fun for the audience <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f609.png" alt="😉" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<p>To sum up the whole event I&#8217;m happy that I could participate. Interesting presentations, the sea of ideas and plans and to make sure the community is that Lucene and Solr are mature products that have not rested on its laurels, and despite the growing interest in their present shape there are peaople behind who care about it, so projects are not standing still. I hope that I will participate in the Lucene Eurocon 2011.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://solr.pl/en/2010/07/12/lucene-eurocon-2010/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
