Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Nov 2, 2020
Date Accepted: Oct 14, 2021
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Understanding the Nature of Metadata – A Deep Insight in the Literature
ABSTRACT
Background:
Metadata are created to describe the corresponding data in a detailed and unambiguous way and are used for various applications in different research areas, e.g. data identification and classification. However, the clear definition of metadata is crucial for further use. However, experience with the processing and management of metadata has shown that the term "metadata" and its use is not always unambiguous.
Objective:
The goal of this study was to understand the nature of metadata definition and the resulting impact on information reuse.
Methods:
A systematic literature search performed in this paper is conducted in accordance with the PRISMA Guidelines for Reporting on Systematic Reviews. Five research questions were identified to streamline the review process addressing the characteristics, metadata standards, use cases and encountered problems. The review is preceded by a process of harmonization in order to achieve a general understanding of the terms used.
Results:
The harmonization process resulted in a clear set of definitions for metadata processing focusing on data integration. The following literature review was conducted by ten reviewers with different backgrounds and using the harmonized definitions. The review included 81 peer-reviewed papers from the last decade after different filtering steps to identify the most relevant papers. The five research questions could be answered, resulting in a broad overview of standards, use cases, problems and corresponding solutions for the application of metadata in different research areas.
Conclusions:
Metadata can be a powerful tool for identifying, describing and processing information, but its meaningful creation is costly and challenging. The review process discovered many standards, use cases, problems and solutions in dealing with metadata and gave a broad overview of the topic. The harmonized definitions and the new schema should improve the classification and creation of metadata by enabling a common understanding of metadata and its context.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.