Interfaces

The dataset comes with two interfaces that were used to annotate and inspect the data. Both these tools were built using Flask, which means they work on every platform, in your favorite web browser. You can download them below (right click, save as).

If you would also like to have the uncorrected automatic transcriptions to try out the annotation tool, download them here.

Requirements

If you have no experience with Python, we highly recommend using the Anaconda distribution.

Annotation tool

The annotation tool was used to manually transcribe and annotate the automatically generated transcriptions of the spoken descriptions.

Picture of the annotation interface.

Exploration tool

The exploration tool allows you to browse through the corpus, but also search for specific terms or annotations. If you search for a particular term, the screen will display the amount of hits, along with the first example. You can then browse through all the results. If you are done exploring the results, click "back to browse" to go back to browsing the entire corpus. Because the annotations take up a lot of space, they can be toggled on or off.

Picture of the browser interface.