NLP toolkit by the same team that built the Java Wikipedia database indexer/API. Looks pretty good.
Stet is a cool piece of software that was used in the GPLv3 process. I think there are better tools now, but it's nice that the code is now easily available online.