CT-EBM-SP - Corpus of Clinical Trials for Evidence-Based-Medicine in Spanish
Data and Resources
Interoperability
Groups
Additional Info
Field | Value |
Identifier | http://hdl.handle.net/10261/285045 |
---|---|
Author | |
Project | |
Name | CT-EBM-SP - Corpus of Clinical Trials for Evidence-Based-Medicine in Spanish |
Description |
A collection of 1200 texts (292 173 tokens) about clinical trials studies and clinical trials announcements in Spanish: - 500 abstracts from journals published under a Creative Commons license, e.g. available in PubMed or the Scientific Electronic Library Online (SciELO). - 700 clinical trials announcements published in the European Clinical Trials Register and Repositorio Español de Estudios Clínicos. Texts were annotated with entities from the Unified Medical Language System semantic groups: anatomy (ANAT), pharmacological and chemical substances (CHEM), pathologies (DISO), and lab tests, diagnostic or therapeutic procedures (PROC). 46 699 entities were annotated (13.98% are nested entities). 10% of the corpus was doubly annotated, and inter-annotator agreement (IAA) achieved a mean F-measure of 85.65% (±4.79, strict match) and a mean F-measure of 93.94% (±3.31, relaxed match). |
Themes |
|
Tags | |
Creation date | 2021-02-22T00:00:00 |
Last updated | 2021-02-22T00:00:00 |
Refresh rate | |
Languages |
|
Geographic coverage |
|
Geographic coverage (International) | |
Time coverage | |
Effective resource | |
Related resources | |
Normative |
|
Institute | |
Publisher | Publicador - Digital.CSIC |
Observations |
Recomended citation: A clinical trials corpus annotated with UMLS© entities to enhance the access to Evidence-Based Medicine. Leonardo Campillos-Llanos, Ana Valverde-Mateos, Adrián Capllonch-Carrión, Antonio Moreno-Sandoval. BMC Medical Informatics and Decision Making (2021) DOI: 10.1186/s12911-021-01395-z |