Category | Uncategorized
Home » Uncategorized ( Page 2 )

The session, “Use Case Diagnosis – When is Solr Really the Best Tool?,” by Michael Hausenblas, Chief Data Engineer at MapR, will present an overview of common big data use cases in the form of a set of questions that can be used to determine what kind of problem you really have. From the answers to these questions, you can quickly find out about what technologies are likely to be most productive, useful, …

During the session, “The First Class Integration of Solr with Hadoop,” Apache Lucene/Solr Committer, Mark Miller talks about how Solr has been integrated into the Hadoop ecosystem to provide full text search at “Big Data” scale. This talk will give an overview of how Cloudera has tackled integrating Solr into the Hadoop ecosystem and highlights some of the design decisions and future plans. Learn how Solr is getting ‘cozy’ with Hadoop, which contributions are going …

Although people usually come to Lucene and related solutions in order to make data searchable, they often realize that it can do much more for them. Indeed, its ability to handle high loads of complex queries makes Lucene a perfect fit for analytics applications and, for some use-cases, even a credible replacement for a primary data-store. It is important to understand the design decisions behind Lucene in order to better understand the problems it can …

The juris portal provides access to legal information (about 6.5 mil documents) and information about German companies (about 23 mil documents). Access is highly personalized:  search, links, and search suggestions are customized according to the documents contained in a user’s product collection. There are many search options, the system stability and reliability have to be high and there are DVD versions of subsets of the complete collection.

The Lucene/Solr Revolution session, “Moving a Complex Application …

Like many web applications in the past, the Solr Admin UI up until 4.0 was entirely server based. It used separate code on the server to generate their dashboards, overviews, and statistics. All that code had to be maintained and still… you weren’t really able to use that kind of data for the things you needed it for. It was wrapped into HTML, most of the time difficult to extract and they changed the structure …

Google+