Tags: text (22)

Sort by: Date / Title / URL

  1. Beautiful.
  2. 2007-12-19 to , , by mako - Archived Link
  3. Stet is a cool piece of software that was used in the GPLv3 process. I think there are better tools now, but it's nice that the code is now easily available online.
  4. Another cool looking tool from The King.
  5. I don't understand how this is different than normal wdiff but I like wdiff a lot and have heard that this software is great.
  6. Take an arbitrary regex and create one monster optimized overlapping regex that matches it all. Never loop over regexes again.
    2011-01-28 to , , , , by mako - Archived Link
  7. 2010-07-25 to , , , , by mako - Archived Link
  8. Nice example of the ? replacing the smart quotes.
  9. Cute.
    2009-08-31 to , , , , by mako - Archived Link
  10. "The policy turnaround faces sjpeg opposition in Congress, which twice authorized Constellation with bipartisan support. Even in today’s polarized political environment on Capitol Hill, opposition to the Obama plan last week also was bipartisan."
  11. 2008-10-17 to , , , by mako - Archived Link
  12. NLP toolkit by the same team that built the Java Wikipedia database indexer/API. Looks pretty good.
    updated: 2011-11-08, original: 2011-11-07 to , , , , , , , , by mako - Archived Link
  13. Looks great, but not a mixture model (i.e., apparently it assumes that each document contains a single topic...)
  14. Looks like a pretty seamless way to extract the text of news documents. Looks very simple and pretty easy.

First / Previous / Next / Last / Page 1 of 1