ArticlePDF Available

Practical issues in using SNOMED CT as a reference terminology

Authors:

Abstract and Figures

SNOMED CT was created by the merger of SNOMED RT (Reference Terminology) and Read Codes Version 3 (also known as Clinical Terms Version 3). SNOMED CT is considered to be among the most extensive and comprehensive biomedical vocabularies available today. It is considered for use as the Reference Terminology of various institutions. We review the adequacy of SNOMED CT as a Reference Terminology and discuss the issues in its use as such. We discuss issues with content coverage of various clinical domains, data integrity and validity, and the update frequency of SNOMED CT, and why SNOMED CT alone is not adequate to serve as the Reference Terminology of a healthcare organization.
Content may be subject to copyright.
Practical Issues in Using SNOMED CT as a Reference Terminology
Senthil K. Nachimuthu, MD, Lee Min Lau, MD, PhD
3M Health Information Systems, Salt Lake City, Utah, USA
Department of Biomedical Informatics, University of Utah, Salt Lake City, Utah, USA
Abstract
SNOMED CT® was created by the merger of SNOMED RT
(Reference Terminology) and Read Codes Version 3 (also
known as Clinical Terms Version 3). SNOMED CT is consid-
ered to be among the most extensive and comprehensive bio-
medical vocabularies available today. It is considered for use
as the Reference Terminology of various institutions. We re-
view the adequacy of SNOMED CT as a Reference Terminol-
ogy and discuss the issues in its use as such. We discuss issues
with content coverage of various clinical domains, data integ-
rity and validity, and the update frequency of SNOMED CT,
and why SNOMED CT alone is not adequate to serve as the
Reference Terminology of a healthcare organization.
Keywords:
SNOMED CT, reference terminology, implementation issues.
Introduction
SNOMED CT® (Systemized Nomenclature of Medicine
Clinical Terms) is one of the most extensive and comprehen-
sive biomedical vocabularies currently available, with 370,702
concepts (US Edition, core distribution, July 2006). It is rec-
ommended for use as the provider of data dictionary for sever-
al electronic health information systems. The SNOMED CT
Technical Reference Guide January 2006 release mentioned
that “SNOMED CT is the most comprehensive clinical refer-
ence terminology available in the world.[1] The July 2006
release of this document redefines it as “SNOMED CT is a
comprehensive clinical terminology that provides clinical con-
tent and expressivity for clinical documentation and report-
ing.[2]
We examine the issues concerning use of SNOMED CT as the
exclusive reference terminology for an electronic clinical in-
formation system. These issues include the coverage of con-
cepts in various clinical domains included in the terminology,
data consistency and the frequency of update. We examine
these issues, and provide comparisons of several clinical do-
mains of content present in SNOMED CT and another refer-
ence terminology. We use a set of objective criteria to evaluate
SNOMED CT, and these criteria and methods may be used to
evaluate any clinical vocabulary for its use in a clinical infor-
mation system.
These three analyses have different objectives and assess dif-
ferent aspects of SNOMED CT. Hence, the materials and
methods, results and discussion are presented individually for
these three analyses for the sake of coherence.
1. Coverage of Content
Materials and Methods
All the active concepts in SNOMED CT (US Edition, July
2006 release) were included in the first experiment. A refer-
ence terminology created by 3M Health Information Systems,
namely the 3M Healthcare Data Dictionary (3M HDD) was
used to aid these comparisons.
The 3M HDD is a concept-based reference terminology that
serves as the interlingua for multiple biomedical vocabularies
which are mapped to it. The content included in the HDD is
derived by importing various standardized terminologies such
as SNOMED CT, LOINC, ICD-9-CM, CPT, HCPCS, First
Databank NDDF, etc. The HDD also includes various legacy
(homegrown) vocabularies such as the US Department of De-
fense lab, pharmacy and other clinical codesets and those from
various other community and university hospitals.
Two subsets of the HDD were created for these comparisons
one including content provided by SNOMED CT alone, and
another including content from the rest of the HDD excluding
SNOMED CT. These two HDD subsets will here onwards be
referred to as SNOMED CT ONLY dataset and NON
SNOMED CT dataset”. The reason that the rest of the HDD
was used for comparison is to provide a real world equivalent
of concepts present in reference terminologies of very large
enterprises. Similar results may be expected by comparing the
content coverage of SNOMED CT with that of the enterprise
reference terminology of any large multi-facility multi-
specialty hospital organization, or to a standard reference ter-
minology such as the UMLS.
SNOMED CT includes various domains of clinical concepts.
The content coverage of SNOMED CT is organized into 19
top-level hierarchies shown in Table 1. Various subhierarchies
of concepts are organized under these nineteen top level hier-
archies. The fully specified name of each SNOMED CT con-
cept includes a suffix which roughly corresponds to the clini-
cal domain that concept belongs to.
The suffix helps to disambiguate between two concepts that
might belong in different top level hierarchies, but have the
same textual name. For example, the suffix can help to differ-
entiate between a disease and a finding at a glance without
having to query its location in the SNOMED CT hierarchy.
Two such examples are Endometriosis (disorder) vs. Endome-
triosis (body structure) and Aspirin (substance) vs. Aspirin
(product).
Table 1. SNOMED CT top-level hierarchies
Body structure
Procedure
Clinical finding
Qualifier value
Environment or geographical
location
Situation with explicit
context
Event
Record artifact
Linkage concept
Social context
Observable entity
Special concept
Organism
Specimen
Substance
Staging and scales
Physical force
Pharmaceutical / biolog-
ic product
Physical object
Table 2 provides a partial listing of suffixes. Each top-level
hierarchy may contain concepts with one or more suffixes.
Table 2. SNOMED CT Concept Suffixes (partial list)
body structure
organism
disorder
specimen
procedure
substance
event
occupation
finding
product
We chose some of the clinical domains denoted by the top-
level hierarchies or concept suffixes listed in tables 1 and 2 to
compare the extent of coverage. We compared the concepts
present in SNOMED CT ONLY dataset with the NON
SNOMED CT dataset. The following results reflect compari-
sons of several domains of content included in SNOMED CT
with NON SNOMED CT content included in the real world
clinical information systems of various hospitals, both home-
grown and vendor-developed. Examples of concepts present in
these datasets are given for some of the domains to better ex-
plain the results.
Results
a. Pharmaceutical Ingredients
The SNOMED CT Substance top-level domain contains
pharmaceutical ingredients under the ‘Drug or medicament
(substance)’ subhierarachy. This is a clearly definable clinical
domain without any ambiguity, and hence this domain was
used for comparison. It is more constrained and universal, and
it is included in the core SNOMED CT data. This is in contrast
with pharmaceutical products which vary from one country to
another. In case of the US, pharmaceutical products are in-
cluded in the SNOMED CT US Drugs Extension, which is not
available free of cost, rather than the core US Edition.
Comparison was done by querying the specific domain of con-
tent in the NON SNOMED CT dataset and the SNOMED CT
ONLY dataset. The results of comparison of various ‘levels’
of pharmaceutical ingredients are presented in Table 3. First
Databank’s National Drug Data File (NDDF) was one of the
primary pharmacy content providers for the 3M HDD.
Comparisons were made at the level of Therapeutic class (An-
tibiotics, Antipyretics, Sedatives, etc.), Drug Class (Penicil-
lins, Aminoglycosides, Calcium Channel Blockers, etc.), In-
gredients (Diclofenac, Ampicillin, etc) and Ingredient-salt
(Diclofenac Sodium, Diclofenac Potassium, Ampicillin Sodi-
um, etc) levels. At all these levels, the NON SNOMED CT
dataset had significantly better coverage for pharmaceutical
ingredients than the SNOMED CT ONLY dataset. For exam-
ple, ‘Ketanserin and ‘Moxaverine’ are some concepts present
in NON SNOMED CT dataset but not in SNOMED CT
ONLY dataset.
Table 3. Pharmaceutical Ingredients
DOMAIN
NON
SNOMED CT
SNOMED
CT ONLY
Therapeutic Class
134
22
Drug Class
873
118
Ingredient
7,265
2,647
Ingredient Salt
3,167
1,135
b. Microorganisms
Microorganisms constitute another domain of content that can
be clearly defined, and hence the coverage of this domain was
compared between SNOMED CT ONLY dataset and NON
SNOMED CT dataset. We compared coverage of the entire
microorganisms domain as well as some subdomains.
The following results in Table 4 show that the SNOMED CT
ONLY dataset had about 80% to 90% content coverage com-
pared as that of the microorganisms domain of the NON
SNOMED CT dataset. For example, ‘Heterosporium species’
is a concept present in NON SNOMED CT dataset but not in
SNOMED CT ONLY dataset.
Table 4. Microorganisms
DOMAIN
SNOMED
CT ONLY
Microorganism
11,516
Bacterium
7,393
Fungus
1,321
Parasite
2,899
Virus
1,835
c. Problems
The clinical problems domain is among those that are not
standardized and are highly variable. This domain will under-
go much refinement over time. However, this domain is often
used for picklists or dictionaries to manage electronic problem
lists. This clinical domain is becoming increasingly important
as clinical information systems try to implement problem-
oriented documentation and maintain a longitudinal or histori-
cal record of patients’ problems. Several authors have found
that SNOMED CT has around 80% to 90% coverage of prob-
lems included in medical problem list domains of individual
enterprise reference terminologies.[3][4][5]
We compared the problems domain of SNOMED CT with that
of our reference terminology, which is the union of multiple
enterprise reference terminologies, and our results indicate
approximately 75% coverage. This domain is highly subjective
and variable; however, the results still show very significant
differences. The high variability of the domain is reflected in
the concepts. For example, the NON SNOMED CT dataset
contains the concept ‘Intercostal bulging’ which is not present
in the SNOMED CT ONLY dataset.
Table 5. Problems
DOMAIN
NON
SNOMED CT
SNOMED
CT ONLY
Problem
48,971
36,633
d. Specimens
Clinical specimens are an important part of medical records,
and are an essential component of most biochemistry, patholo-
gy, microbiology and immunology lab orders. The following
table shows the numbers of clinical specimens present in the
NON SNOMED CT dataset and the number of specimens in
SNOMED CT ONLY dataset. SNOMED CT ONLY dataset
contained a significantly larger number of specimens com-
pared to the NON SNOMED CT dataset, which is not surpris-
ing considering that SNOMED was created by the College of
American Pathologists. For example, the concepts ‘Liver cyst
fluid’ is present in NON SNOMED CT dataset but not in the
SNOMED CT ONLY dataset. On the contrary, the concept
‘BCG site swab’ is present in SNOMED CT ONLY dataset
but not in the NON SNOMED CT dataset.
Table 6. Specimens
DOMAIN
NON
SNOMED CT
SNOMED
CT ONLY
Clinical specimen
823
1,048
e. Body Structure
The body structure domain contains concepts denoting various
normal and abnormal anatomical structures at the organ, organ
system or tissue level. The results in Table 7 show that the
NON SNOMED CT dataset has significantly higher coverage
compared to the SNOMED CT ONLY dataset.
Table 7. Body Structure
DOMAIN
NON
SNOMED CT
SNOMED
CT ONLY
Body Structure
34,257
31,999
f. Procedures
The procedures domain includes medical and surgical proce-
dures performed for diagnostic or therapeutic procedures, ad-
ministrative procedures (admission, discharge, transfer, bill-
ing, etc.), and so on. Procedures form an important part of
several orders, clinical or administrative. The following results
in Table 8 show that the NON SNOMED CT dataset contained
significantly higher number of procedures compared to
SNOMED CT ONLY dataset.
Table 8. Procedures
DOMAIN
NON
SNOMED CT
SNOMED
CT ONLY
Procedure
59,311
53,052
Discussion
The results above show that SNOMED CT contains an exten-
sive coverage of concepts but still lags behind a reference ter-
minology which serves as the superset of several standardized
and legacy vocabularies. Part of this difference is due to the
differences in granularity. But a reference terminology will
need to provide concepts with varying granularities, and may
not expect the users to perform the composition-
decomposition themselves. Lack of concepts with differing
levels of granularity causes problems with internal data storage
as well as external data exchange. Some of the differences are
due to the absence of several concepts in SNOMED CT which
are present in other standard or local vocabularies. These re-
sults show that continuous authoring and updating of concepts
will be required to keep SNOMED CT comprehensive and up
to date for its use as a reference terminology.
Studies by other authors have also shown that SNOMED CT
has anywhere between 30% to 90% coverage for various clini-
cal domains.[6][7][8] These studies have each compared a
single domain of content between SNOMED CT and a single
enterprise reference terminology. Comparing multiple domains
of content between SNOMED CT and the superset of multiple
reference terminologies also yields similar results as shown
above by this experiment.
Furthermore, we have not discussed several domains of con-
tent which are not covered by SNOMED CT, such as billing
and reimbursement vocabularies, lab test names (provided by
LOINC, etc. This denotes that several vocabularies need to be
used together to complement each other and to build the enter-
prise reference terminology of a healthcare organization.
2. Data Consistency
Materials and Methods
The SNOMED CT ‘core’ consists of three tables concepts,
descriptions and relationships. We used the core tables provid-
ed by SNOMED CT and the US English dialect subset (both
obtained from the US Edition, July 2006 release) to assess
data consistency in SNOMED CT. We examined the compli-
ance of the descriptions table to the constraints defined in
SNOMED CT Technical Reference guide and User guide.
Table 9. SNOMED CT Representations
Representation
Type
Fully Specified Name
Scar (morphologic abnormality)
Preferred Term
Scar
Synonym
Scar tissue
Synonym
Fibrous scar
Synonym
Cicatrix
These constraints guide the importing of SNOMED CT into
the reference terminology of an institution, use of SNOMED
CT as the exclusive reference terminology of an institution,
creating subsets, applying extensions to the core content, ver-
sioning and updating, and so on. The subset mechanism is es-
pecially important as SNOMED CT is too large to be used
directly for a clinical application. SNOMED CT Subsets are
often created for specific clinical domains or applications.
According to the SNOMED CT User Guide, each concept has
one fully specified name, one preferred term and zero or more
synonyms.[9] The fully specified name gives the definition of
a concept, and is more explanatory than its preferred term.
Preferred term is a more concise name which is used in clinical
records. Synonyms are concise forms as well, and may be used
as alternatives for the preferred term. An example is given in
Table 9 all these representations denote a single concept. We
encountered some problems and inconsistencies while import-
ing the SNOMED CT representations (textual names or sur-
face forms of concepts) into the 3M HDD. We present the
results below.
Results
The US Edition of SNOMED CT contains both American and
British English representations for several concepts. The rep-
resentations table in the core distribution denotes both the US
(en-US) and UK (en-GB) English representations with an “un-
specified type” for several concepts and a preferred term is not
defined in these cases. The SNOMED CT US Dialect Subset
redefines one of these representations as the Preferred Term
and the rest as synonyms (often including the en-GB represen-
tation) for US English.[10] We first imported the core repre-
sentations table into a relational database, and then applied the
US Dialect Subset to it.
After applying the US dialect subset to the core descriptions
table for the July 2006 US Edition, we found that 10,852 con-
cepts had multiple preferred terms, often with different status-
es (active, retired, limited, etc). The correct preferred term
needs to be selected based on the concept status in these cases.
Not all of these 10,852 concepts’ preferred terms could be
automatically selected based on the concept status and/or de-
scription status alone. 335 concepts had multiple preferred
terms with the same status. E.g. ConceptId 103497003, “Strep-
tococcus penumoniae 3 (organism) [sic] has two preferred
terms in a “current” status.
Several of these occurrences are in direct violations of data
integrity constraints defined in the SNOMED CT User Guide.
These caused issues in automated importing SNOMED CT
data into our reference terminology. This was overcome by
defining one of the preferred terms or synonyms as the pre-
ferred term for some concepts by one of the authors. This task
was labor intensive, and required several hours of the domain
expert’s time.
Discussion
The above results show that SNOMED CT needs stricter qual-
ity analysis before it is released to the end users. Failure of
data consistency and violation of such constraints will lead to
problems in importing and using the terminology by the target
organizations. More studies of data consistency and integrity
need to be done which involve other components of SNOMED
CT. However, the authors wish to mention that SNOMED CT
is one of the well designed terminologies, and has less data
consistency issues than many other standard vocabularies.
3. Frequency of update
Updates to SNOMED CT content are done through the regu-
larly scheduled versioning mechanism. A new version is pub-
lished once every six months, in January and July of every
year. SNOMED has a web-based request mechanism for users
and contributors to request changes. These changes include
new concepts, representations, subsets, or extension
namespaces through which third parties can extend the content
in SNOMED CT. The users can also request modifications to
existing content. However, these changes are incorporated and
published once every six months through the regular release
mechanism. Each new version of SNOMED CT has intro-
duced anywhere between 2,500 and 8,000 new concepts. Sta-
tistics of changes in various core SNOMED CT tables over the
years have been studied by other authors.[11]
Most organizations update their data dictionaries on a continu-
ous basis, and some enterprise reference terminologies are
updated as often as every day. This makes it harder to use
SNOMED CT as the exclusive reference terminology for an
organization. This can be overcome by creating local exten-
sions to SNOMED CT via the SNOMED CT extension mech-
anism. The extension may be reconciled with the subsequent
SNOMED CT version by inactivating the concepts in the ex-
tension and superseding them with equivalent concepts that are
added to SNOMED CT. This process needs specialized exper-
tise in vocabulary and ontology creation and maintenance, and
requires contribution from subject matter experts in several
content domains. An alternative is to use an enterprise refer-
ence terminology which includes SNOMED CT as one of the
vocabulary sources along with other vocabularies.
Conclusion
The above analyses show that SNOMED CT does not have the
necessary coverage to be the exclusive content provider for the
enterprise reference terminology of an organization.
SNOMED CT has one of the most extensive coverage among
biomedical vocabularies, and has a well defined ontology and
semantic relationships. These features make SNOMED CT as
an essential component of an enterprise reference terminology.
However, it needs to be complemented with other standard
vocabularies, and in most cases, ‘homegrown’ vocabularies to
build a reference terminology.
SNOMED CT also has some data inconsistency issues which
need manual intervention for implementation. We expect that
these issues will be addressed in future releases. SNOMED CT
also has a slower update schedule than would be required for
the maintenance of a production enterprise reference terminol-
ogy.
Due to these reasons, we conclude that most healthcare organ-
izations will need further work to implement or adapt
SNOMED CT for their requirements and use it in combination
with other biomedical vocabularies, and may not be able to use
it as a plug-and-play vocabulary.
Acknowledgements
The authors thank Shaun C. Shakib, MPH for his valuable help with
this study.
References
[1] SNOMED CT® Technical Reference Guide. January 2006
Release. Jan 2006: 22.
[2] SNOMED CT® Technical Reference Guide. July 2006
Release. Jul 2006: 22.
[3] Elkin PL, Brown SH, Husser CS, Bauer BA, Wahner-
Roedler D, Rosenbloom ST, Speroff T. Evaluation of the
content coverage of SNOMED CT: ability of SNOMED
clinical terms to represent clinical problem lists. Mayo Clin
Proc. 2006 Jun;81(6):741-8.
[4] Penz JF, Brown SH, Carter JS, Elkin PL, Nguyen VN,
Sims SA, Lincoln MJ. Evaluation of SNOMED coverage
of Veterans Health Administration terms. Medinfo.
2004;11(Pt 1):540-4.
[5] Wasserman H, Wang J. An applied evaluation of
SNOMED CT as a clinical vocabulary for the computer-
ized diagnosis and problem list. AMIA Annu Symp Proc.
2003;699-703.
[6] Brown SH, Elkin PL, Bauer BA, Wahner-Roedler D,
Husser CS, Temesgen Z, Hardenbrook SP, Fielstein EM,
Rosenbloom ST. SNOMED CT®: Utility for a General
Medical Evaluation Template. Proc AMIA Symp 2006:
101-105.
[7] van der Kooij J, Goossen WT, Goossen-Baremans AT, de
Jong-Fintelman M, van Beek L. Using SNOMED CT
codes for coding information in electronic health records
for stroke patients. Stud Health Technol Inform.
2006;124:815-23.
[8] Kim H, Harris MR, Savova G, Chute CG. Content cover-
age of SNOMED-CT toward the ICU nursing flowsheets
and the acuity indicators. Stud Health Technol Inform.
2006;122:722-6.
[9] SNOMED CT® User Guide. January 2006 Release. Sec-
tion 3.2; Jan 06: 11.
[10]SNOMED CT® Technical Reference Guide. July 2006
Release. Jul 06: 83.
[11] Spackman KA. Rates of change in a large clinical termi-
nology: three years experience with SNOMED Clinical
Terms. AMIA Annu Symp Proc. 2005:714-8.
Address for correspondence
Senthil K. Nachimuthu, MD. Email: snachimuthu@mmm.com
... ISO 13606's EHR interoperability has not supported the implementation of messaging components, and its implementation is quite difficult [29]. At the same time, SNOMED CT and LOINC represent the comprehensive clinical reference terminology available in the medical world [30]. Therefore, we chose the FHIR as an interoperability standard for future work in this study. ...
Article
Full-text available
Although Health Level Seven (HL 7) message standards (v2, v3, Clinical Document Architecture (CDA)) have been commonly adopted, there are still issues associated with them, especially the semantic interoperability issues and lack of support for smart devices (e.g., smartphones, fitness trackers, and smartwatches), etc. In addition, healthcare organizations in many countries are still using proprietary electronic health record (EHR) message formats, making it challenging to convert to other data formats—particularly the latest HL7 Fast Health Interoperability Resources (FHIR) data standard. The FHIR is based on modern web technologies such as HTTP, XML, and JSON and would be capable of overcoming the shortcomings of the previous standards and supporting modern smart devices. Therefore, the FHIR standard could help the healthcare industry to avail the latest technologies benefits and improve data interoperability. The data representation and mapping from the legacy data standards (i.e., HL7 v2 and EHR) to the FHIR is necessary for the healthcare sector. However, direct data mapping or conversion from the traditional data standards to the FHIR data standard is challenging because of the nature and formats of the data. Therefore, in this article, we propose a framework that aims to convert proprietary EHR messages into the HL7 v2 format and apply an unsupervised clustering approach using the DBSCAN (density-based spatial clustering of applications with noise) algorithm to automatically group a variety of these HL7 v2 messages regardless of their semantic origins. The proposed framework’s implementation lays the groundwork to provide a generic mapping model with multi-point and multi-format data conversion input into the FHIR. Our experimental results show the proposed framework’s ability to automatically cluster various HL7 v2 message formats and provide analytic insight behind them.
... Application of this standard will standardize and formalize the input and storage of medical information, allowing accumulation of information in the regional and global databases and providing great opportunities for the use of decision support systems, statistical and other data processing, as well as an opportunity to exchange medical information between medical information systems [1]. ...
... With the RDF, the knowledge pertaining to medical image interpretation can be stored as a set of triples, where each element of the triple can be referenced by an explicit uniform resource identifier (URI) to which resources can be linked. By linking to welldefined medical terminologies, such as RadLex [12][13][14], Foundational Model of Anatomy (FMA) [15], and SNOMED CT [16,17], an RDF-based approach can explicitly refer to a formalized set of concepts. With query languages that make use of triple-based patterns, such as the Simple Protocol and RDF Query Language (SPARQL), it is possible to create detailed queries for specific information. ...
Article
As the use of positron emission tomography-computed tomography (PET-CT) has increased rapidly, there is a need to retrieve relevant medical images that can assist image interpretation. However, the images themselves lack the explicit information needed for query. We constructed a semantically structured database of nuclear medicine images using the Annotation and Image Markup (AIM) format and evaluated the ability the AIM annotations to improve image search. We created AIM annotation templates specific to the nuclear medicine domain and used them to annotate 100 nuclear medicine PET-CT studies in AIM format using controlled vocabulary. We evaluated image retrieval from 20 specific clinical queries. As the gold standard, two nuclear medicine physicians manually retrieved the relevant images from the image database using free text search of radiology reports for the same queries. We compared query results with the manually retrieved results obtained by the physicians. The query performance indicated a 98 % recall for simple queries and a 89 % recall for complex queries. In total, the queries provided 95 % (75 of 79 images) recall, 100 % precision, and an F1 score of 0.97 for the 20 clinical queries. Three of the four images missed by the queries required reasoning for successful retrieval. Nuclear medicine images augmented using semantic annotations in AIM enabled high recall and precision for simple queries, helping physicians to retrieve the relevant images. Further study using a larger data set and the implementation of an inference engine may improve query results for more complex queries.
... Data harmonization is used when data standardization is not possible to achieve interoperability across systems. This work is laborious and entails a trans-disciplinary approach wherein informaticists, measurement and topical experts, biostatisticians, and ethicists combine their knowledge to ensure the integrity and security of harmonized data [5] [6]. One method applied to harmonise terminology is the abstraction network. ...
Article
Most patients with chronic disease are prescribed multiple medications, which are recorded in their personal health records. This is rich information for clinical public health researchers but also a challenge to analyse. This paper describes the method that was undertaken within the Public Health Research Data Management System (PHReDMS) to map medication data retrieved from individual patient health records for population health researcher's use. The PHReDMS manages clinical, health service, community and survey research data within a secure web environment that allows for data sharing amongst researchers. The PHReDMS is currently used by researchers to answer a broad range of questions, including monitoring of prescription patterns in different population groups and geographic areas with high incidence/prevalence of chronic renal, cardiovascular, metabolic and mental health issues. In this paper, we present the general notion of abstraction network, a higher level network that sits above a terminology and offers compact and more easily understandable view of its content. We demonstrate the utilisation of abstraction network methodology to examine medication data from electronic medical records to allow a compact and more easily understandable view of its content.
... Schulz et al. (2009Schulz et al. ( , 2007 highlighted the ontological and logical issues associated with SCT. Issues in using SCT as a clinical reference terminology were discussed by Nachimuthu and Lau (2007). As a result of the inconsistencies within SCT as is some users are refraining from using SCT in clinical applications (Rector et al., 2011). ...
Article
SNOMED CT has been regarded as the most prominent clinical health terminology to be used in Electronic Health Records. However, modelling inconsistencies are preventing SNOMED CT from providing proper support for clinical use. This study introduces positional similarity sets as an effective contextual technique to identify such inconsistencies and improve the modelling of SNOMED CT concepts. Positional similarity sets are sets of lexically similar concepts having only one different word at the same position of their names. A technique to incorporate three structural indicators into the selected sets is provided to improve the likelihood of finding inconsistently modelled concepts. The results show that the likelihood of finding inconsistencies using such positional similarity sets is up to 41.6%. Such quality assurance methods can be used to supplement IHTSDO's own efforts in order to improve the quality of SNOMED CT.
... Advocates of SNOMED CT argue that it should be utilized by healthcare providers in the electronic medical record (EMR), whereas ICD is better suited for administrative purposes and statistical reporting. Critics of SNOMED CT point out that there are several ''holes'' and conflicts in its extensive vocabulary, with clinical entities missing or hierarchical relationships not fully worked out (Nachimuthu, 2007). ...
Article
On or after October 1, 2015, the United States will require use of the Clinical Modification of the International Classification of Diseases, 10th Revision (ICD-10-CM) for diagnostic coding. This primer was written to assist the cleft care community with understanding and use of ICD-10-CM for diagnostic coding related to cleft lip and/or palate (CL/P).
Chapter
Healthcare interoperability has been a major challenge since the early 1980s and still remains an unsolved issue. Attempts in the past have addressed this problem using bespoke solutions, but a generic solution still eludes the healthcare community. This chapter discusses the various levels at which interoperability needs to be preserved for an uninterrupted exchange of healthcare information. The chapter reviews the various healthcare standards developed in an attempt to solve the interoperability problem at a syntactic level and then moves on to examine medical ontologies developed to solve the problem at a semantic level. The chapter explains the features of semantic web technology that can be leveraged at each level. A literature survey is carried out to gage the current contribution of semantic web technologies in this area along with an analysis of how semantic web technologies can be improved to better suit the health informatics domain and solve the healthcare interoperability challenge.
Article
Recently, the structured data format in RDF/OWL has played an increasingly vital role in the semantic web. We converted pediatric and oncologic nuclear medicine imaging reports in free text into RDF/OWL format and evaluated the usefulness of nuclear medicine imaging reports in RDF/OWL by comparing SPARQL query results with the manually retrieved results by physicians from the reports in free text. SPARQL query showed 95% recall for simple queries and 91% recall for dedicated queries. In total, SPARQL query retrieved 93% (51 lesions of 55) recall and 100% precision for 20 clinical query items. All query results missed by SPARQL query were of some inference. Nuclear medicine imaging reports in the format of RDF/OWL were very useful for retrieving simple and dedicated query results using SPARQL query. Further study using more number of cases and knowledge for inference is warranted.
Article
An effective image retrieval system is required as the amount of medical imaging data is increasing recently. Authors reviewed the recent development of text-based medical image retrieval including the use of controlled vocabularies - RadLex (Radiology Lexicon), FMA (Foundational Model of Anatomy), etc - natural language processing, semantic ontology, and image annotation and markup.
Article
Full-text available
Computerized Provider Order Entry Systems are supposed to enhance healthcare quality through decreasing the number of prescriptions and medical errors while increasing prescription relevance. These bene ts are still theoretical because prescription computerization faces many problems, including software interoperability and usability. Using interface terminologies in terminolgy ows could help solving them. The main objective of this work was to model and create such a ow of terminologies for biological and radiological examination purposes. The potential bene ts of these solutions in terms of interoperability and usability were then evaluated. Analyzing work ows allowed us to make a model of terminology ows that is probably shared among many di erent domains. Creating these ows was based on interface terminologies produced for this purpose and national or international health terminology standards. Some speci c assessment methods have been developed for the integration of an iconic interface terminology into a guidelines search engine and a medical health record. They were adapted and applied to our speci c context. Created terminology ows led to important loss of information between information systems. In radiology, the ordering interface terminology was signi cantly more usable than other terminologies. There was no evidence of such a di erence in biology. Terminology ows are not yet functional. However, interface terminologies are available for any health facility or software publisher and should ease the implementation of computerization provider order entry systems.
Article
Full-text available
Veterans Health Administration (VHA) is now evaluating use of SNOMED-CT. This paper reports the first phase of this evaluation, which examines the coverage of SNOMED-CT for problem list entries. Clinician expressions in VA problem lists are quite diverse compared to the content of the current VA terminology Lexicon. We selected a random set of 5054 narratives that were previously "unresolved" against the Lexicon. These narratives were mapped to SNOMED-CT using two automated tools. Experts reviewed a subset of the tools' matched, partly matched, and un-matched narratives. The automated tools produced exact or partial matches for over 90% of the 5054 unresolved narratives. SNOMED-CT has promise as a coding system for clinical problems. In subsequent studies, VA will examine the coverage of SNOMED for other clinical domains, such as drugs, allergies, and physician orders.
Article
Full-text available
The Systematized Nomenclature of Medicine, Clinical Terms (SNOMED CT) was produced by merging SNOMED Reference Terminology (RT) with Clinical Terms version 3 (CTV3). It was first released in January 2002. This paper summarizes the overall size of the terminology and its rates of change over a period of three calendar years, comprising six subsequent releases each occurring at six month intervals. Rates of change in raw table size are reported for the concepts, descriptions, and relationships tables. Other measures of change are the number of identifiers made inactive and the reasons for this, as well as the number and rate of changes in the subsumption hierarchies and defining relationships. Awareness of the rate of change in the terminology can help terminology developers focus attention on needed infrastructure support and capacity for handling updates and refinements, and can help application developers by highlighting the need for managing terminology change in applications.
Article
Full-text available
For a project on development of an Electronic Health Record (EHR) for stroke patients, medical information was organised in care information models (templates). All (medical) concepts in these templates need a unique code to make electronic information exchange between different EHR systems possible. When no unique code could be found in an existing coding system, a code was made up. In the study presented in this article we describe our search for unique codes in SNOMED CT to replace the self made codes. This to enhance interoperability by using standardized codes. We wanted to know for how many of the (self made) codes we could find a SNOMED CT code. Next to that we were interested in a possible difference between templates with individual concepts and concepts being part of (scientific) scales. Results of this study were that we could find a SNOMED CT code for 58% of the concepts. When we look at the concepts with a self made code, 54.9% of these codes could be replaced with a SNOMED CT code. A difference could be detected between templates with individual concepts and templates that represent a scientific scale or measurement instrument. For 68% of the individual concepts a SNOMED CT could be found. However, for the scientific scales only 26% of the concepts could get a SNOMED CT code. Although the percentage of SNOMED CT codes found is lower than expected, we still think SNOMED CT could be a useful coding system for the concepts necessary for the continuity of care for stroke patients, and the inclusion in Electronic Health Records. Partly this is due to the fact that SNOMED CT has the option to request unique codes for new concepts, and is currently working on scale representation.
Article
Full-text available
The U.S. government has licensed SNOMED CT to permit broad-based evaluation and use of the terminology. We evaluated the ability of SNOMED CT to represent terms used for interface objects (e.g., labels and captions) and concepts used for data and branching logic in a general medical evaluation template in use within the Department of Veterans Affairs. The general medical evaluation form definition, report definition, and script files were parsed and 1573 expressions were mapped into SNOMED CT. Compositional expressions required to represent 1171 concepts. Double independent reviews were conducted. Exact concept level matches were used to evaluate reference coverage. Exact term level matches were required for interface terms. Semantics were analyzed for a randomly selected subset of 20 terms. Sensitivity of SNOMED CT as a reference terminology was 63.8% , ranging from 29.3% for history items to 92.4% for exam items. SNOMED CT's sensitivity as an "interface terminology" was 55.0%. 80% of the necessary linking semantics for the subset were present. Subgroup statistics are presented. SNOMED CT is promising as a terminology for knowledge representation underlying a large general medical evaluation. Its performed less well as an interface terminology.
Article
The use of a standardized controlled terminology allows diverse systems and applications throughout the enterprise to translate data. In developing a customized enterprise-wide vocabulary for clinical terminology, we implemented SNOMED CT as a base vocabulary, while facilitating the addition of site-specific clinical terms or concepts not represented in SNOMED CT. In this paper, we evaluate the breadth of SNOMED CT terms and concepts for the coding of diagnosis and problem lists by clinicians within a computerized physician order entry (CPOE) system. Clinicians selected diagnosis and problem list terms from a lexicon based on SNOMED CT, submitting requests for clinical terms that were not found in the controlled vocabulary. For each "missing" term, we assigned one of four mapping types, representing the relationship of this new terminology entry to the SNOMED CT reference terminology. Our results show that the majority of diagnosis/problem list terms (88.4%) were found in SNOMED CT. Of the 145 missing terms, only 20 represented significant concepts missing from SNOMED CT, resulting in concept coverage of 98.5%. Our results show that SNOMED CT is a relatively complete standardized terminology on which to base a vocabulary for the clinical problem list.
Article
To evaluate the ability of SNOMED CT (Systematized Nomenclature of Medicine Clinical Terms) version 1.0 to represent the most common problems seen at the Mayo Clinic in Rochester, Minn. We selected the 4996 most common nonduplicated text strings from the Mayo Master Sheet Index that describe patient problems associated with inpatient and outpatient episodes of care. From July 2003 through January 2004, 2 physician reviewers compared the Master Sheet Index text with the SNOMED CT terms that were automatically mapped by a vocabulary server or that they identified using a vocabulary browser and rated the "correctness" of the match. If the 2 reviewers disagreed, a third reviewer adjudicated. We evaluated the specificity, sensitivity, and positive predictive value of SNOMED CT. Of the 4996 problems in the test set, SNOMED CT correctly identified 4568 terms (true-positive results); 36 terms were true negatives, 9 terms were false positives, and 383 terms were false negatives. SNOMED CT had a sensitivity of 92.3%, a specificity of 80.0%, and a positive predictive value of 99.8%. SNOMED CT, when used as a compositional terminology, can exactly represent most (92.3%) of the terms used commonly in medical problem lists. Improvements to synonymy and adding missing modifiers would lead to greater coverage of common problem statements. Health care organizations should be encouraged and provided incentives to begin adopting SNOMED CT to drive their decision-support applications.
Article
Nursing flowsheets contain discrete and detailed patient data that can be reused in various clinical decisions including patient acuity estimation. To be reused, patient data should be represented consistently and unambiguously with standardized terminologies. This study investigated the domain completeness of SNOMED-CT toward the ICU nursing flowsheets and the acuity indicators used at Mayo Clinic. We first paraphrased the terms and the semantic relations appeared in the source materials into short clinical statement. Then, the entire prepared statement was mapped to SNOMED-CT and validated by expert reviewers. SNOMED-CT provided complete matches and partial matches to 78.13% and 14.1% of the flowsheets statements respectively. It provided complete matches and partial matches to 60.8% and 39.2% of the acuity statements respectively. The average observed agreement among the reviewers ran around 85%. Lack of appropriate terms was the major cause of the incomplete matches with the flowshee statements. Majority of the incomplete matches with the statements from the acuity indicators were caused by limitations in the concept models.
Sec-tion 3.2; Jan 06: 11. [10]SNOMED CT® Technical Reference Guide July 2006 Release Jul 06: 83. [11] Spackman KA. Rates of change in a large clinical termi-nology: three years experience with SNOMED Clinical Terms Address for correspondence Senthil K
  • Snomed User
SNOMED CT® User Guide. January 2006 Release. Sec-tion 3.2; Jan 06: 11. [10]SNOMED CT® Technical Reference Guide. July 2006 Release. Jul 06: 83. [11] Spackman KA. Rates of change in a large clinical termi-nology: three years experience with SNOMED Clinical Terms. AMIA Annu Symp Proc. 2005:714-8. Address for correspondence Senthil K. Nachimuthu, MD. Email: snachimuthu@mmm.com