Welcome to DKPro

Ready to use software components for natural language processing, based on the Apache UIMA framework.
More ›
More ›

Collection of open-licensed statistical tools, currently including correlation and inter-rater agreement methods.
More ›
More ›

Search-based annotation tool to help distributed annotation teams finding infrequent linguistic phenomena in large corpora.
More ›
More ›

Tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate removal, language detection, and near-duplicate removal
More ›
More ›
Framework for keyphrase extraction.
More ›
More ›