Tag | Lucene Solr Revolution
Home » Posts tagged "Lucene Solr Revolution" ( Page 2 )

Faceted search is a powerful technique to let users easily navigate the search results. It can also be used to develop rich user interfaces, which give an analyst quick insights about the documents space.

In the Lucene/Solr Revolution session, “Faceted Search with Lucene,” Shai Erera, Researcher at IBM, will introduce the Facets module: how to use it, under-the-hood details, optimizations, and best practices. He will also describe advanced faceted search capabilities with Lucene Facets.…

Text classification automates the task of filing documents into pre-defined categories based on a set of example documents. The first step in automating classification is to transform the documents to feature vectors. Though this step is highly domain specific, Apache Mahout provides you with a lot of easy to use tooling to help you get started, most of which relies heavily on Apache Lucene for analysis, tokenisation, and filtering.

The session, “Text Classification Powered by …

In the Lucene/Solr Revolution session, “High Performance JSON Search and Relational Faceted Browsing with Lucene,” Renaud Delbru, Co-Founder of SindiceTech, will discuss how Lucene and Solr can be used for very efficient search of tree-shaped schemaless documents, e.g. JSON or XML, and can be then made to address both graph and relational data search. He will discuss the capabilities of SIREn, a Lucene/Solr plugin they have developed to deal with huge collections of tree-shaped …

The Lucene/Solr Revolution session, “Large Scale Crawling with Apache Nutch and Friends,” by Julien Nioche, Director at DigitalPebble, will give an overview of Apache Nutch. Julien will describe its main components and how it fits with other Apache projects, such as Hadoop, SOLR, Tika, and HBase. The second part of the presentation will be focused on the latest developments in Nutch, the differences between the 1.x and 2.x branch, and what we can expect …

Solr and Lucene provide a powerful, scalable search server. XQuery provides a rich querying and programming model for use with marked-up text. The session, “Querying Rich Text with XQuery,” by Michael Sokolov, Senior Architect at Safari Books Online, will present Lux, a system that combines these into a powerful XML search engine, which is freely available under an open-source license. Query optimizers often mystify database users: sometimes queries run quickly and sometimes they don’t. …

During the Lucene/Solr Revolution session, “Schemaless Solr and the Solr Schema REST API,” Steve Rowe, Senior Software Engineer at LucidWorks, will show you how and why to use Solr’s new Schemaless Mode, under which document indexing can be performed with no up-front schema configuration. Solr uses content clues to choose among a predefined set of field types and then automatically add previously unseen fields to the schema.

This introductory level session will take …

During the Lucene/Solr Revolution session, “System Teardown – Solr as a Practical Recommendation Engine,”  Michael Hausenblas, Chief Data Engineer at MapR Technologies, will present a detailed tear-down and walk-through of a working soup-to-nuts recommendation engine that uses observations of multiple kinds of behavior to do combined recommendation and cross recommendation. The system is built using Mahout to do off-line analysis and Solr to provide real-time recommendations. The presentation will also include enough theory to …

Many of us tend to hate or simply ignore logs, and rightfully so:  they’re typically hard to find, difficult to handle, and are cryptic to the human eye. But can we make logs more valuable and more usable if we index them in Solr, so we can search and run real-time statistics on them? Indeed we can, and in this session you’ll learn how to make that happen.

In the first part of the session, …

Kibana4Solr is search-driven, scalable, browser based, and extremely user friendly (also for non-technical users). Logs are everywhere. Any device, system, or human can potentially produce a huge amount of information saved in logs. The amount of available logs and their semi-structured nature make meaningful processing in real-time quite a difficult task. Thus, valuable business insights stored in logs might not be found. Kibana4Solr is a search-driven approach to handling that challenge. It offers a user-friendly …

In a recent project with the United States Patent and Trademark Office, Opensource Connections was asked to prototype the next generation of patent search – using Solr and Lucene. An important aspect of this project was the implementation of BRS, a specialized search syntax used by patent examiners during the examination process.

In this fast paced session, John Berryman, Data Architect at Bloom, will relate Opensource Connections’s experiences and describe how they used a …

Google+