Wikidata Toolkit 0.3.0
The third release of the Wikidata Toolkit significantly improves the performance of v0.2.0, restores compatibility with current Wikidata file exports, and adds many new examples. Main changes are
- Support for the new JSON format used in Wikidata dump files now
- Much faster reading of data files (about 15min for the whole dump)
- Extended example module with more, faster, easier-to-read programs
- More convenient ways to construct data objects using the Datamodel class
- Dump files are now downloaded to temporary files first to cope with aborted downloads
- Additional helper methods and iterators for accessing data objects with less code
Artifacts are also released on Maven Central and it is strongly recommend to use this. To do this, insert the following dependency into your pom.xml:
<dependency>
<groupId>org.wikidata.wdtk</groupId>
<artifactId>wdtk-dumpfiles</artifactId>
<version>0.3.0</version>
</dependency>
Similar configurations can be used for other modules (e.g., wdtk-rdf or wdtk-datamodel), but most dependencies will be resolved for you by Maven. Detailed instructions are given in the Wikidata Toolkit user documentation.