Author | Grant Ingersoll
Home » Articles posted by Grant Ingersoll ( Page 2 )
For all of those interested in Apache Mahout and scalable machine learning, Lucid Imagination is hosting a Mahout Users Meeting at it’s new office in Redwood City on Nov. 29th. Doors open at 6:30 pm. The night will feature two speakers, Ted Dunning of MapR Technologies and Grant Ingersoll of LucidWorks, along with a social gathering with food and drinks.For more details and to RSVP, please see…
With another Lucene Eurocon successfully behind us (thanks Barcelona, you’ve been awesome!), it’s time to say hello to Vancouver for ApacheCon….  I’ll leave it to others to fill in the blanks on the Barcelona conference other than to say that I am continually amazed by the vibrancy of the Lucene/Solr community and especially grateful to all the committers and contributors who take the time to show up and give talks about how they leverage
You know your (technical) baby is (almost) grown up when the book on the project finally comes out.  Such is the case for Apache Mahout, thanks to Manning Publications shipping Mahout in Action… this week.Mahout in ActionSo, before I start into my review, let me first say congratulations to Sean, Robin, Ted, Ellen and Manning for producing such an excellent product.   The simplest praise I can give it is to put it on the same level
From a quiet start as a pet project to a giant in the industry, Apache Lucene is definitely the little (search) engine that could.  On September 18th, 2001 (at 16:29:48 UTC) Jason Van Zyl made the first official import of Doug Cutting’s Lucene project (which started in 1997 and was hosted on SourceForge) into Apache’s Jakarta project (check out the Wayback machine…).And while I wasn’t around in the beginning, I thought I would
Many times, clients ask us to help them estimate memory usage or disk space usage or to share benchmarks as they build out there search system. Doing so is always an interesting process, as I’ve always been wary of claims about benchmarks (for instance, one of the old tricks of performance benchmark hacking is to “cat XXX > /dev/null” to load everything into memory first, which isn’t what most people do when running their system)…
It’s official, Apache Lucene 3.1.0 and Apache Solr 3.1.0 are officially released.  Keep an eye here for more on the new features and functionality.Here’s the release announcements as just sent to the mailing lists:
March 2011, Apache Lucene 3.1 available
The Lucene PMC is pleased to announce the release of Apache Lucene 3.1.

This release contains numerous bug fixes, optimizations, and
improvements, some of which are highlighted below.  The release
is available for immediate 
Changing Bits: Lucene’s FuzzyQuery is 100 times faster in 4.0….So cool…  I’m in awe daily of what happens in Lucene and Solr open source.  Mike’s post is just a small example of what goes on.  Perhaps Mike or Muir or someone will writeup on how Lucene has improved it’s Unit Testing by several orders of magnitude by some incredibly cool randomization techniques and the use of Jenkins/Hudson.
While we make some of our money off of professional services and support of Apache Lucene and Solr, I thought I would pass along a few freebies when it comes to improving your Lucene or Solr application.  These are things that we usually end up telling most clients at some stage of the game.  Many of them fall under the “broken windows…” theory of software development, so don’t expect anything too earth shattering.
For those of you in the Raleigh, Durham, Chapel Hill (aka RTP) area, there are a couple of upcoming events on Lucene/Solr that may be of interest:Tomorrow, Feb. 11, I will be giving a talk at UNC-Chapel Hill on Apache Lucene and Solr.  The talk is open to the public.  You can read more about the talk at:Introduction to Open Source Search with Apache Lucene and Solr – CRADLE Seminar |….