eprintid: 29769 rev_number: 32 eprint_status: archive userid: 5787 dir: disk0/00/02/97/69 datestamp: 2021-05-28 13:42:28 lastmod: 2021-05-31 12:27:47 status_changed: 2021-05-28 13:42:28 type: conferenceObject metadata_visibility: show creators_name: Venn, Benedikt creators_name: Schneider, Kevin creators_name: Frey, Kevin creators_name: Weil, H. Lukas creators_name: Werner, Johannes creators_name: Wannenmacher, Fabian creators_name: Zajac, Thomas creators_name: von Suchodoletz, Dirk creators_name: Usadel, Björn creators_name: Krüger, Jens creators_name: Garth, Christoph creators_name: Mühlhaus, Timo corp_creators: Computational Systems Biology, University of Kaiserslautern corp_creators: High Performance and Cloud Computing Group (IT Center), University of Tübingen corp_creators: eScience (IT Center), University of Freiburg corp_creators: IBG-4 Bioinformatics, Forschungszentrum Jülich corp_creators: Scientific Germany Visualization Lab, University of Kaiserslautern title: Fostering the democratization of research data by using the Annotated Research Context (ARC) as practical implementation subjects: ddc-004 divisions: i-704000 pres_type: poster abstract: Research in modern life science increasingly depends on the exchange of interdisciplinary expertise and collaboration and the reuse and integration of large data sets. The advancing digitization in particular, opens up new possibilities for scientific knowledge acquisition, especially for the fundamental plant research community. However, challenges exist specifically in capturing the entire research cycle, including contextualization of data according to the FAIR and linked open data principles for the DataPLANT (https://nfdi4plants.de/) community and beyond. Here, we propose a data structure dubbed Annotated Research Context (ARC - https://github.com/nfdi4plants/ARC) which captures the complete research cycle in a structured way, meeting the FAIR requirements with low friction for the individual researcher. ARCs are self-contained and include assay and measurement data, workflows, and computation results, accompanied by metadata in one package. Their structure allows full user-control over all metadata and facilitates usability, access, publication, and sharing of the research. Thereby, ARCs are a practical implementation of existing standards leveraging the advantages of the ISA model, research crates, and the Common Workflow Language. The ARC concept relies on a structure that partitions assay, workflow and results for granular reuse and development. Assays cover biological, experimental, and instrumental data including its self-contained description using the ISA model. Similarly, workflows describe all digital steps of a study and contain application code, scripts and/or any other executable description of an analysis providing the highest degree of flexibility for the scientists. Further, to ensure persistence and reproducibility, workflows include their own containerized running environment. The result data is linked to the workflows by a minimal Common Workflow Language file specifying the workflow input and output. The suggested structure for ARCs is a starting point for individual research projects and defines a framework for the organization, sharing, versioning, reuse (clone), and evolution (fork/pull request) of research projects in a manner familiar from open-source software development. ARCs will form the basis of our collaborative research platform, the DataPLANT Hub, but will also provide an interface with existing infrastructure aiming at compatibility with public services and existing repositories due to its decentralized conception. Additionally, it will be possible for the DataPLANT community to handle ARCs on the de.NBI Cloud and the Storage-for-Science RDM system and to compute on the bwForCluster BinAC, the de.NBI Cloud, and on Galaxy resources. In the future, we envision ARC publications as a central component of knowledge/data communication and sharing, which can be referenced by classical journal publication. As part of the ARC vision, we will discuss mechanisms for measuring data and metadata quality. date: 2021 id_scheme: DOI id_number: 10.11588/heidok.00029769 collection: c-55 ppn_swb: 1759228761 own_urn: urn:nbn:de:bsz:16-heidok-297693 language: eng bibsort: VENNBENEDIFOSTERINGT2021 full_text_status: public place_of_pub: Heidelberg pages: 1 event_title: E-Science-Tage 2021: Share Your Research Data event_location: Heidelberg event_dates: 04.03. - 05.03.2021 citation: Venn, Benedikt ; Schneider, Kevin ; Frey, Kevin ; Weil, H. Lukas ; Werner, Johannes ; Wannenmacher, Fabian ; Zajac, Thomas ; von Suchodoletz, Dirk ; Usadel, Björn ; Krüger, Jens ; Garth, Christoph ; Mühlhaus, Timo (2021) Fostering the democratization of research data by using the Annotated Research Context (ARC) as practical implementation. [Conference Item] document_url: https://archiv.ub.uni-heidelberg.de/volltextserver/29769/7/Fostering_democratization_of_research_data_E-Science-Tage_2021.pdf