DKPro JWPL - Welcome

JWPL (Java Wikipedia Library) is a free, Java-based application programming interface that allows to access all information in Wikipedia.

Core features:

  • Fast and efficient access to Wikipedia
  • Parser for the MediaWiki syntax
  • Language independent

In addition to the core functionality, JWPL allows access to Wikipedia’s edit history with the Wikipedia Revision Toolkit.

Features of the Wikipedia Revision Toolkit:

  • Tools for reconstructing past states of Wikipedia (TimeMachine)
  • Efficient access to all article revisions (RevisionMachine)
  • Dedicated revision storage format

How to cite?

If you use the Wikipedia Revision Toolkit (RevisionMachine, TimeMachine) in scientific work, please cite the ACL 2011 demo paper:

Oliver Ferschke and Torsten Zsch and Iryna Gurevych (2011). Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’s Edit History. In: Proceedings of the ACL-HLT 2011 System Demonstrations. (pdf) (bib)

If you only use JWPL Core (API, DataMachine), please cite the LREC 2008 paper:

Torsten Zesch and Christof Müller and Iryna Gurevych (2008). Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary. In: Proceedings of the 6th International Conference on Language Resources and Evaluation. (pdf) (bib)

About

This project was initiated under the auspices of Prof. Iryna Gurevych, Ubiquitous Knowledge Processing Lab (UKP), Technische Universität Darmstadt.

It is now maintained by

License

DKPro JWPL is available under the Apache Software License (ASL) version 2.

It must be pointed out that while the component’s source code itself is licensed under the ASL, individual components might make use of third-party libraries or products that are not licensed under the ASL. Please make sure that you are aware of the third party licenses and respect them.