On April 10th, 2006 I’ll be starting a new job at Jive Software in Portland, Oregon. According to Google Earth, Portland is about 2500 miles away from Mattapoisett as the crow flies so I hope you’ll understand if I don’t post here for the next couple weeks.
Monthly Archives: March 2006
Links: 3-15-2006
- Search Theory – Nutch Wiki
Publicly available white papers, best practices, theories and publications about search related topics.
(categories: nutch search theory )
Links: 3-14-2006
- Tomcat Probe
A self-contained web application, which is designed to dig into Tomcat internal objects to display invaluable runtime information about deployed applications and Tomcat instance in general.
(categories: sysadmin tomcat )
Nutch, Yahoo!, and Hadoop
It’s been awhile since I mentioned anything about Lucene, my favorite Java based open source indexing and search library (which I built the karakoram spider / search application around). Doug Cutting, who created Lucene and who has spent the last couple years working on Nutch, was recently hired by Yahoo!. I just have a couple questions:
a) why would Yahoo want to hire a guy writing a Java based web crawler and indexer?
b) where does he get all the cool names? Nutch? Hadoop?
c) How cool does Hadoop sound? Hadoop Distributed Filesystem (HDFS) and an implementation of MapReduce. Hmm.. where else have I heard about those terms bantered about?
Links: 3-10-2006
- oreilly.com — Online Catalog: Baseball Hacks
Datamining and baseball. Two great subjects combined.
(categories: baseball hacks )