Apache Nutch, a subproject of Apache Lucene, is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats.Apache Nutch 1.0 contains almost 200 resolved issues and improvements such as Solr Integration, new indexing framework and new scoring framework just to mention a few.Nutch 1.0 is available from here.
- About Lucene Solr
- Reference Materials
Blog | Got A Cool Story? Post It Here.
Connect With Us!
- Fast range faceting using segment trees and the Java ASM library
- Coming Soon to Solr: Efficient Cursor Based Iteration of Large Result Sets
- NYC Solr Meetup: Lucene/Solr - The Default Search Engine for Hadoop (Feb. 12)
Announcements Apache ApacheCon Big data Big data ecosystem Chump cloud Cloud Computing Dismax Enterprise Search Erik Hatcher Faceting Function Query Grant Ingersoll Hadoop ISFDB Lucene Lucene/Solr Lucene/Solr Case Studies Lucene/Solr Revolution Lucene Solr Lucene Solr Revolution LucidWorks LucidWorks Search Mahout MapR Mark Miller NoSQL Nutch Open Source Open Source Search PyLucene Query Parser Result Grouping Road to Revolution Ruby Solr Solr 4.0 SolrCloud Solr reference guide Spatial Search Technical Articles Tika Videos & Podcast Whitepapers
You must be logged in to post a comment.