Skip to main content

Data management

Data Management is a set of practices and techniques used by researchers to ensure that their data is organised, structured and easily reusable for future research

Resources

  • Using OpenCV for Face Detection

    EN
    OpenCV is a very popular, free and open source software system used for a large variety of computer vision applications. This article is intended to help you get started in experimenting with OpenCV using an example of face detection in images as a case study.
  • Building and Linking Humanities' Digital Spatial Infrastructures

    EN
    This workshop, focussing on "Spatial data medieval to modern", is the first of a series of workshops from the NOS-HS project "Linking, Building, and Sustaining Humanities Digital Spatial Infrastructures for Research in the Nordic Countries". The main aims of this workshop were to define key concepts (spatial infrastructures, Linked Open Data, metadata, ontology), outline major challenges in the field, and to provide an opportunity to share experiences of addressing the issues in individual and national projects across the Nordic countries.
    Authors
    • Alexandra Petrulevich
    • Sara Ellis-Nilsson
    • Peder Gammeltoft
    Read more
  • Exploratory Topic Modelling in Python

    EN
    Topic modelling is a technique by which documents within a corpus are clustered based on how certain groups of terms are used together within the text. The commonalities between such term groupings tend to form what we would normally call “topics”, providing a way to automatically categorise documents by their structural content, rather than a more metadata-based knowledge system. Using resources held with EHRI's collections, this notebook offers learners an introduction to 'LDA' topic modelling using Python in a step-by-step guide.
  • EHRI in TEITOK

    EN
    This blog examines TEITOK, which is a corpus framework used as an alternative to Omeka. TEITOK is centered around texts and is similar to the Omeka interface – both allow you to search through the documents, and display the transcription. The main difference is that Omeka treats the transcription as an object description, whereas TEITOK not only shows that a word appears in a document, but also where it appears and how it is used.
  • CLS-INFRA Training School on Data and Annotation

    EN
    This event, organised and provided by the CLS INFRA project, offers an introductory course to textual data annotation. The workshop introduces learners to how to edit, annotate, and query a text corpus without a single line of code, how to structure texts with the XML-TEI, and how to run an NLP tool to add linguistic information.
    Authors
    • Lisanne van Rossum
    • Maarten Janssen
    • Silvie Cinková
    Read more
  • DARIAH-DE Publikator Video Tutorial

    EN
    This video tutorial provides a step-by-step guide through the DARIAH-DE Publikator, a tool that enables its users to upload data(-sets) into the DARIAH-DE Repository and index them with metadata. The tool is part of the larger DARIAH-DE Data Federation Architecture, aiming to support the FAIRification of research data with regards to the research data life cycle.
  • EOSC for Arts and Humanities Scholars

    EN
    As part of the DARIAH Friday Frontiers in-house webinar series, Erzsébet Tóth-Czifra and Laure Barbot provide an introduction to EOSC and open science projects for researchers and practitioners working in the Arts and Humanities. They include a brief walk through the EOSC landscape, and how different EOSC projects are working towards ensuring open science for all.
  • Polifonia - Making sense of musical heritage on the web

    EN
    Polifonia is a H2020 project that aims at harmonising diverse information sources in the landscape of musical heritage and scholarship. The challenges are many, from data management, to knowledge organisation and dissemination barriers. In this talk, an ontology driven strategy to organise, share, and interact with the wealth of music data on the web, is presented. This include solutions to engage with scholars and lay persons, with an emphasis on data visualisation and storytelling.
  • Archiving Activism - Archiving Reproductive Health

    EN
    This video presentation from Clare Lanigan at the Digital Repository of Ireland (DRI) on the 'Archiving Reproductive Health' project, and discusses archival activism more broadly. In particular she gives a demonstration of the current collections available through the archive, provides details of how items were compiled, and also discusses the more pastoral and welfare issues for archival staff when dealing with items relating to political or social activism.
  • Mixed Reality for CoDesign, Sustainable Urban Reactivation in Historic Cities

    EN
    Learn how community-building projects can engage local stakeholders, pull insights from diverse perspectives, and influence urban redevelopment authorities. Hear state-of-the-art theories and approaches to sustainable heritage, with reflections from experienced architects, academics, and urban thinkers. Identify critical issues of urban gentrification, place-making, and the pressures faced by historic urban neighbourhoods in Southern Europe. See state-of-the-art technologies deployed for rapid 3D reconstruction, documentation, and urban co-design with non-experts. We specifically explore augmented reality as a possible solution to scalable public outreach.
    Authors
    • Carlos Smaniotto
    • Georgios Artopoulos
    • Fabio Montagnino
    Read more
  • Introduction to Persistent Identifiers

    EN
    This webinar focuses on 'Persistent Identifiers' (PIDs) and basic concepts of referencing objects. It discusses why so many PID platforms exist, presents aspects of sustainability, demonstrates some added-value services, and talks about practical experiences and open issues.