Impact of clinical data veracity on cancer genomic research
Menée à partir d'une simulation utilisant une série de données génomiques, cette étude analyse l'impact des erreurs et des omissions concernant les informations cliniques accompagnant les données génomiques sur l'interprétation de ces dernières
Genomic analysis of tumours is transforming our understanding of cancer. However, while a great deal of attention is paid to the accuracy of the cancer genomic data itself, less attention has been paid to the accuracy of the associated clinical information that renders the genomic data useful for research. In this Brief Communication, we suggest that omissions and errors in clinical annotations have a major impact on the interpretation of cancer genomic data. We describe our discovery of annotation omissions and errors when reviewing an already carefully annotated colorectal cancer gene expression dataset from our laboratory. The potential significance of clinical annotation omissions and errors was then explored using simulation analyses with an independent genomic dataset. We suggest that the completeness and veracity of clinical annotations accompanying cancer genomic data requires renewed focus by the oncology research community, both when planning new collections and when interpreting existing cancer genomic data.
JNCI Cancer Spectrum , article en libre accès, 2021