ABSTRACT: Capturing, sharing, and publishing cancer biomarker research data are all fundamental challenges of enabling new opportunities to research and understand scientific data. Informatics experts from the National Cancer Institute's (NCI) Early Detection Research Network (EDRN) have pioneered a principled informatics infrastructure to capture and disseminate data from biomarker validation studies, in effect, providing a national-scale, real-world successful example of how to address these challenges. EDRN is a distributed, collaborative network and it requires its infrastructure to support research across cancer research institutions and across their individual laboratories. The EDRN informatics infrastructure is also referred to as the EDRN Knowledge Environment, or EKE. EKE connects information about biomarkers, studies, specimens and resulting scientific data, allowing users to search, download and compare each of these disparate sources of cancer research information. EKE's data is enriched by providing annotations that describe the research results (biomarkers, protocols, studies) and that link the research results to the captured information within EDRN (raw instrument datasets, specimens, etc.). In addition EKE provides external links to public resources related to the research results and captured data. EKE has leveraged and reused data management software technologies originally developed for planetary and earth science research results and has infused those capabilities into biomarker research. This paper will describe the EDRN Knowledge Environment, its deployment to the EDRN enterprise, and how a number of these challenges have been addressed through the capture and curation of biomarker data results.
Cancer biomarkers: section A of Disease markers 01/2011; 9(1-6):511-30. · 1.08 Impact Factor