Apache Nutch, a subproject of Apache Lucene, is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats.Apache Nutch 1.0 contains almost 200 resolved issues and improvements such as Solr Integration, new indexing framework and new scoring framework just to mention a few.Nutch 1.0 is available from here.
- About Lucene Solr
- Reference Materials
Blog | Got A Cool Story? Post It Here.
Connect With Us!
- Lucene/Solr Revolution 2014: Community Voting Ends June 30
- End-to-end Payload Example in Solr
- What is Search Relevancy?
Announcements Apache ApacheCon Big data Big data ecosystem Chump cloud Cloud Computing Dismax Enterprise Search Erik Hatcher Faceting Function Query Grant Ingersoll Hadoop ISFDB Lucene Lucene/Solr Lucene/Solr Case Studies Lucene/Solr Revolution Lucene Solr Lucene Solr Revolution LucidWorks LucidWorks Search Mahout MapR Mark Miller NoSQL Nutch Open Source Open Source Search PyLucene Query Parser Release Result Grouping Road to Revolution Ruby Solr Solr 4.0 SolrCloud Spatial Search Technical Articles Tika Videos & Podcast Whitepapers