links.mako.cc: mako: textprocessing

mako: textprocessing (3)

Sort by: Date / Title / URL

BERTopic

Looks great, but not a mixture model (i.e., apparently it assumes that each document contains a single topic...)

updated: 2023-05-29, original: 2023-05-29 to bert, computationalmethods, research, researchmethods, text, textprocessing, topicmodels, transformers - Archived Link
trafilatura: Web scraping tool for text discovery and retrieval — trafilatura 0.8.0 documentation

Looks like a pretty seamless way to extract the text of news documents. Looks very simple and pretty easy.

2021-02-21 to data, freesoftware, internetresearch, modules, news, nlp, python, research, text, textprocessing, via:bnewbold, web - Archived Link
dkpro-core-asl - DKPro Core ASL

NLP toolkit by the same team that built the Java Wikipedia database indexer/API. Looks pretty good.

updated: 2011-11-08, original: 2011-11-07 to ai, computerscience, freesoftware, java, nlp, software, text, textprocessing, tools - Archived Link

First / Previous / Next / Last / Page 1 of 1