Uncategorized Links: 11-22-2010 November 23, 2010 ajohnson Leave a comment boilerpipe Cool library for text extraction from HTML. (categories: extraction html boilerpipe algorithms )