EPIGRAF is a research platform for collecting, annotating, linking and publishing multimodal text data. The data model supports research databases ranging from epistolary editions to social media corpora. Source code and developer documentation are available on GitHub. You want to learn more about the application or try it out? Feel free to contact us!
EPIGRAF includes components for the entire data life cycle:
- Collection: Data sets can be both edited in the application as well as imported from files. The core concepts of Epigraf are articles and properties used in articles.
- Annotation: Every article is composed of sections, that can be flexibly combined, containing text and all relevant metadata (descriptions, comments, categorizations via vocabularies) as well as embedded files or images. A configurable toolbar is available for the annotation of texts.
- Linking: In order to publish data as Linked Open Data according to the FAIR principles (Wilkinson et al. 2016), norm data identifiers (IRIs; W3C 2014) can be created for each article and category. This allows data sets to be reconciled between different databases.
- Analysis: A faceted full-text search is available for the entire database. The vocabularies used for indexing can be used to dive into the data.
- Publication: Camera-ready documents, for example in Word format or in standardized document formats such as TEI (TEI 2022; Elliott et al. 2020) , can be generated using a pipeline system and XSL stylesheets. If required, data sets can be made publicly available in the web interface. A programming interface makes the data available in CSV, JSON, or XML format. For interacting with the data, R and Python packages are under developement.
- Collaboration: Epigraf supports working collaboratively on a database. For the coordination of multiple workplaces, wikis and a file repository are used.
EPIGRAF emerged from the inter-academic edition project "The German Inscriptions of the Middle Ages and the Early Modern Period". It is used in the nine inscription research centers of the six participating academies of sciences. The application is being developed by the Digital Academy of Sciences and Literature | Mainz and the Digital Media & Computational Methods Research Group at the University Münster. All data collections are published in print in the series "Die Deutschen Inschriften" and in digital form on Deutsche Inschriften Online (DIO). The publication of structured research data via application programming interfaces and downloads is currently under development.