It’s been awhile since I mentioned anything about Lucene, my favorite Java based open source indexing and search library (which I built the karakoram spider / search application around). Doug Cutting, who created Lucene and who has spent the last couple years working on Nutch, was recently hired by Yahoo!. I just have a couple questions:
a) why would Yahoo want to hire a guy writing a Java based web crawler and indexer?
b) where does he get all the cool names? Nutch? Hadoop?
c) How cool does Hadoop sound? Hadoop Distributed Filesystem (HDFS) and an implementation of MapReduce. Hmm.. where else have I heard about those terms bantered about?
I think the answer to your questions can be found here: http://www.sematopia.com/?p=73
In any case, its really interesting to see these developments unfold.