ANNIS is an open-source web application that provides access to multi-layer richly annotated corpora. It provides search and visualization functionality for complex constellations of token and token span-based annotations, hierarchical graph structures such as syntax trees or rhetorical annotations, arbitrary labeled pointing relations, used for example in coreference annotation, and also for metadata. Multimodal data can also be aligned with corpora, though this is limited at present by a lack of streaming functionality.
ANNIS2, The current iteration of the ANNIS software, is being used and developed within SFB 632 (“Information structure: the linguistic means for structuring utterances, sentences and texts”). In the SFB, a number of different projects collect and annotate data according to the common SFB Annotation Standard. This data, which is annotated using a small set of annotation tools (EXMARaLDA, annotate (TigerXML), MMAX, RSTTool), is mapped to the encoding standard of the SFB, PAULA (Potsdamer Austauschformat für Linguistische Annotation / Potsdam Interchange Format for Linguistic Annotation), which serves as the basis for further processing.
TLA-team: ANNIS is being used by a number of German researchers as a tool bringing together a variety of corpora and analysing them. It makes use of a relational database, i.e. it is not based on a standardized explicit format and thus certainly not meant for long term storage, but it can be optimized towards various access patterns.