ArticlePDF AvailableLiterature Review

Artificial intelligence in orthopaedics: A scoping review

PLOS
PLOS One
Authors:

Abstract and Figures

There is a growing interest in the application of artificial intelligence (AI) to orthopaedic surgery. This review aims to identify and characterise research in this field, in order to understand the extent, range and nature of this work, and act as springboard to stimulate future studies. A scoping review, a form of structured evidence synthesis, was conducted to summarise the use of AI in orthopaedics. A literature search (1946–2019) identified 222 studies eligible for inclusion. These studies were predominantly small and retrospective. There has been significant growth in the number of papers published in the last three years, mainly from the USA (37%). The majority of research used AI for image interpretation (45%) or as a clinical decision tool (25%). Spine (43%), knee (23%) and hip (14%) were the regions of the body most commonly studied. The application of artificial intelligence to orthopaedics is growing. However, the scope of its use so far remains limited, both in terms of its possible clinical applications, and the sub-specialty areas of the body which have been studied. A standardized method of reporting AI studies would allow direct assessment and comparison. Prospective studies are required to validate AI tools for clinical use.
This content is subject to copyright.
RESEARCH ARTICLE
Artificial intelligence in orthopaedics: A
scoping review
Simon J. FedererID
¤
*
, Gareth G. Jones
MSk Lab, Sir Michael Uren Hub, Imperial College London, London, United Kingdom
These authors contributed equally to this work.
¤Current address: Department of Trauma & Orthopaedics, St Richard’s Hospital, Chichester, United
Kingdom
*simon.federer@nhs.net
Abstract
There is a growing interest in the application of artificial intelligence (AI) to orthopaedic sur-
gery. This review aims to identify and characterise research in this field, in order to under-
stand the extent, range and nature of this work, and act as springboard to stimulate future
studies. A scoping review, a form of structured evidence synthesis, was conducted to sum-
marise the use of AI in orthopaedics. A literature search (1946–2019) identified 222 studies
eligible for inclusion. These studies were predominantly small and retrospective. There has
been significant growth in the number of papers published in the last three years, mainly
from the USA (37%). The majority of research used AI for image interpretation (45%) or as a
clinical decision tool (25%). Spine (43%), knee (23%) and hip (14%) were the regions of the
body most commonly studied. The application of artificial intelligence to orthopaedics is
growing. However, the scope of its use so far remains limited, both in terms of its possible
clinical applications, and the sub-specialty areas of the body which have been studied. A
standardized method of reporting AI studies would allow direct assessment and compari-
son. Prospective studies are required to validate AI tools for clinical use.
Introduction
Interest in the application of artificial intelligence (AI) in healthcare has surged in recent years
[1]. Computer systems are increasingly able to perform tasks that normally require human
intelligence, facilitated by improvements in data storage and computer processing. Despite the
interest, incorporation of AI into clinical practice is in its infancy [2]. AI tools are currently in
use, for example; in segmentation of three-dimensional optical coherence tomography scans
to aid referrals in ophthalmology [3], and detection of atrial fibrillation by a smartphone algo-
rithm and a single lead electrocardiography device in primary care [4]. The increase in digital
medical imaging and information collected in databases and orthopaedic registries, provide
large datasets ideal for the development of AI algorithms. These have the potential to improve
patient care at a number of levels including; diagnosis, management, research and systems
analysis [5].
PLOS ONE
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 1 / 11
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Federer SJ, Jones GG (2021) Artificial
intelligence in orthopaedics: A scoping review.
PLoS ONE 16(11): e0260471. https://doi.org/
10.1371/journal.pone.0260471
Editor: Thippa Reddy Gadekallu, Vellore Institute of
Technology: VIT University, INDIA
Received: May 19, 2021
Accepted: November 11, 2021
Published: November 23, 2021
Peer Review History: PLOS recognizes the
benefits of transparency in the peer review
process; therefore, we enable the publication of
all of the content of peer review and author
responses alongside final, published articles. The
editorial history of this article is available here:
https://doi.org/10.1371/journal.pone.0260471
Copyright: ©2021 Federer, Jones. This is an open
access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: The dataset may be
found at this URL: https://data.mendeley.com/
datasets/xvkr6t263v/1.
Funding: The author(s) received no specific
funding for this work.
The volume and variety of data collected from individuals has facilitated the advancement of AI
across multiple industries. Concerns regarding how personal data is stored and utilised prompted
legislation to protect this information. The General Data Protection Regulation (GDPR) was intro-
duced in the European Union (EU) in 2018, and some medical registries have struggled to gather
data in the same volume since. However, registries where patient consent has been a priority, such
as the National Joint Registry (NJR) in the UK, have not seen a sharp decrease. The NJR holds
information on over 3 million arthroplasty procedures since 2003 [6]. Orthopaedic registries are
some of the largest in healthcare and are primed for the application of AI.
Artificial intelligence remains a relatively new field for most orthopaedic surgeons, and
understanding the extent, range and nature of work conducted so far is useful as a springboard
to identify potential new applications and areas for research. With this goal in mind, we con-
ducted a scoping review, which is a form of structured evidence synthesis suited to this task.
The aims were to: 1) identify the number of research studies using AI in orthopaedics and 2)
summarize how and where these studies have applied AI to the field of orthopaedics.
Methods
A scoping review was chosen due to the breadth of the research topic and the expected varia-
tion in study design, and was conducted using the Arksey and O’Malley framework [7]. The
PRISMA-ScR checklist was utilised to ensure completeness (S1 Table) [8].
Literature search and eligible studies
A literature search of studies in English was conducted (1946–2019) using Ovid (Embase &
Medline) and Scopus. The search timeframe was chosen to ensure early studies were not
missed. The literature search was performed on 30/8/19. The search strategy is shown in Fig 1.
The search terms used are shown in S2 and S3 Tables.
The review focused on summarising the use of AI in applications relevant to clinical prac-
tice rather than related basic science. Hence, the following inclusion criteria were used: (a)
studies which directly applied artificial intelligence to orthopaedic clinical practice or b) the
outcomes of the study had the potential to be directly applied to orthopaedic clinical practice.
Abstracts, conference proceedings, articles not in English and review, commentary or edi-
torial articles were not eligible for inclusion. Articles relating to the following were also
excluded: cancer/oncology, biomechanics, gait analysis without clinical application, image seg-
mentation alone without a direct clinical application, basic science, neuromuscular disorders,
rehabilitation, prosthetics, natural language processing of radiology reports and wearable sen-
sors. These articles were excluded to ensure the review maintained a clinical focus and was
applicable to a general orthopaedic audience.
The literature search was performed by one investigator (SF). Abstract screening and full
text reviews were performed independently by two investigators (SF and GJ). There was full
agreement on the studies selected for inclusion. References from the literature search were
imported into Mendeley (v1.19.6, Elsevier, Amsterdam, Netherlands) where duplicates were
removed. Covidence systematic review software (Veritas Health Innovation Ltd, Melbourne,
Australia. Available at www.covidence.org) was used to synthesize and extract eligible studies.
Data extraction and collation
Data was extracted from eligible studies into an evidence table to summarize the following:
year of publication, country, area of body, procedure, health condition, orthopaedic care func-
tion, study design and number of patients. A formal quality appraisal of eligible studies was
not performed as this is beyond the remit of a scoping review. The data collected in the
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 2 / 11
Competing interests: The authors have declared
that no competing interests exist.
evidence table was used to define the main themes of research and the summarised data repre-
sented below.
Results
Searches
After removal of duplicates, the search retrieved 3649 documents for title and abstract screen-
ing. Of those, 512 met the eligibility criteria for full text screening and 222 met the final
Fig 1. Literature search and study identification strategy. PRISMA flow diagram showing the search strategy and number of included and
excluded studies.
https://doi.org/10.1371/journal.pone.0260471.g001
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 3 / 11
inclusion criteria. A reference list of included studies can be found in S4 Table. The study with
the earliest publication date, 1989, used a machine learning method (inductive learning) to
predict operative findings of disc prolapse or nerve entrapment [9]. 139 studies used one AI
technique and 83 used more than one. Machine learning techniques were used 236 times and
deep learning techniques 162 times. The most used machine learning techniques were Support
Vector Machines, 55 times, and Random Forests, 38 times. Of the studies that used deep learn-
ing techniques, 26 implemented convolution layers in their neural networks. Characteristics of
all the studies are summarised below in categories of data extraction.
Imaging
101 studies used AI to interpret an imaging modality to establish a diagnosis. A number of
early papers assessed and quantified the curvature of the spine in scoliosis [1012], and devel-
oped algorithms capable of calculating the Cobb angle using surface topography before using
radiographs and three-dimensional imaging. Subsequently, AI was applied to the detection of
other spinal pathologies e.g disc herniation or vertebral fractures [1316]. More recently the
scope of AI to aid diagnostic imaging has expanded outside of the spine, with uses ranging
from the identification of hip fractures to soft tissue meniscal tears in the knee [1719]. There
has also been a shift to algorithms providing a more nuanced grading of disease, rather than
binary outputs [20].
Orthopaedic care function
106 studies used AI to aid diagnostic decision support and 95 studies used AI to predict an
aspect of a patient’s care. The first paper to use AI in orthopaedics predicted operative findings
during low back surgery [9]. The data comprised of preoperative clinical features and was ana-
lysed using an inductive learning method. More recently, research has focused on algorithms
predicting patient outcomes post-surgery, utilizing the large orthopaedic data sets collected at
local and national level. In particular, two centres in the USA have developed algorithms using
local hospital data across different patient groups and procedures [2125].
Area of body
96 studies focused on the spine, 51 on the knee, 31 on the hip and 24 involved multiple areas.
Other areas had 5 publications or fewer (Fig 2).
Health condition
68 publications related to spinal pathologies, 64 to trauma and 62 to arthritis. Other conditions
were reported in 5 studies or fewer.
Procedure
141 publications did not relate to a specific orthopaedic procedure. 34 related to arthroplasty
and 26 to spinal procedures. Other procedures were reported in 5 publications or fewer.
Size of dataset used
There was a large range in the size of dataset used in the studies. The largest dataset used
1106234 patients [26], the smallest only 4 [27]. The median number of patients used was 250.
68 studies had a dataset of fewer than 100 patients. Arthroplasty registries were the sources of
some of the larger datasets with information from over 1 million patients being used to build
AI models [23,26,2831].
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 4 / 11
Year of publication
The number of studies has increased in the last half a decade, with 14 publications in 2016 and 70
in 2019 (Fig 3). Between 1989 and 2010 the maximum number of publications per year was 6.
Geographical location
83 studies (37%) were published from the USA, 24 from Canada, 23 from China, 11 from
South Korea and 10 from India. Other countries had fewer than 10 published studies (Fig 4).
Several papers from the USA emanate from the same institution, who have applied similar AI
models to a range of applications [24,32,33].
Discussion
We have reviewed and summarised the characteristics of 222 publications that included AI and
orthopaedics. This scoping review was conducted to establish where and how AI has been used
in orthopaedics. We have described the overarching features of these publications to highlight
where the research has been focused and guide future avenues of research. The predominant
findings were 1) Nearly half of the publications related to imaging interpretation to establish a
diagnosis; 2) The spine was the most studied musculoskeletal region; and 3) Predicting patient
outcomes is an emerging area of interest. Overall, research in AI and orthopaedics is at an early
stage when compared to radiology [34], for example, but entering a phase of significant growth.
AI was used in 101 publications (45%) to interpret an imaging modality to establish a diag-
nosis. This focus can be explained by the large volume of organized data acquired during imag-
ing and the relative ease with which AI models can be built to interpret this data. Radiology,
Fig 2. Publication count by orthopaedic area of interest. A graph showing the number of papers published with regards to the
area of the body.
https://doi.org/10.1371/journal.pone.0260471.g002
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 5 / 11
accordingly, has seen one of the biggest increases in the use of AI to interpret scans [34]. The
overlap between radiology and orthopaedics, for example, in fracture detection [13] or Cobb
angle measurement from radiographs [35] could also explain the predominance of imaging
related studies.
The initial search identified many publications relating to image segmentation, whereby an
algorithm is used to automatically segment a specific structure(s), such as an intervertebral
disc, from an imaging modality [36]. Papers that described segmentation of normal scans or
were unable to detect pathology were not felt to be of direct clinical relevance and hence were
excluded. Segmentation is, however, an important step in the process of establishing a diagno-
sis from imaging and it is relevant to mention the volume of research to date in this area. The
use of real-time image segmentation with augmented reality is now being used as a navigation
tool in spinal surgery [37], and this technique could be applied elsewhere in orthopaedics.
The spine, hip and knee were the regions most studied. The joint management of spinal
pathology with neurosurgery could explain the greater proportion of papers on the spine.
Large arthroplasty registries could suggest why hip and knee have seen more interest than the
sub-specialty areas of foot & ankle and hand. More research should be focused on sub-spe-
cialty areas other than spine, hip and knee.
Fig 3. Number of papers by year of publication and by country of origin.A graph showing the number of papers published per year and by the country of origin. The
five countries with the most publications are listed. Countries with fewer than 10 publications have been grouped into ‘Other’.
https://doi.org/10.1371/journal.pone.0260471.g003
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 6 / 11
A significant volume of research found through the literature search related to translational
engineering. A number of studies were published in engineering journals and so may not have
reached readers from a clinical background [3840]. Comparatively few papers from rheuma-
tology were found in this study [41,42]. This may be due to the inclusion criteria used and the
health conditions of interest. The interplay between different specialties and industries pres-
ents an opportunity to promote interdisciplinary research. Specialists in data science are
needed to progress AI in healthcare, and joint projects between specialties will make future
research more efficient.
AI works best with high quality, large datasets. It was noted that the size of dataset in the
published literature was highly variable. Sixty-eight (31%) of the studies had fewer than 100
patients. Whilst there is no set minimum dataset size for AI algorithms, the reliability of stud-
ies performed using small numbers may be questioned. Registries provided the largest sources
of data in publications identified in this study [23,26,2831]. They will continue to be a valu-
able resource for further studies predicting personalised patient outcomes. Albeit, there is con-
cern that population-based data may be unable to solve clinical problems at a patient level
[43]. Data sharing is needed for ongoing training and improvement of AI algorithms [2]. Leg-
islation, such as GDPR, ensures that consent for data sharing is obtained and appropriate secu-
rity measures are in place for the storage of data. Data privacy and protection is of utmost
importance going forward.
Fig 4. Number of papers by country of origin. A graph showing the number of papers published by the country of origin of the first author.
https://doi.org/10.1371/journal.pone.0260471.g004
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 7 / 11
There is scope for AI tools to assist in decision making regarding the management of
patients. AI models that have been developed to retrospectively look at registry data could be
used to design prospective studies. A decision-making aide would be a useful adjunct, for
example, in understanding which patients will have favourable outcomes after arthroplasty.
Predictive models will also provide insights into cost savings and efficiencies that will be of
interest to healthcare providers.
AI is a rapidly advancing discipline with new algorithmic models constantly in develop-
ment, often described using new and different terminology. Machine learning, deep learning
and neural networks are some of the terms encountered in the literature that come under the
umbrella term of AI. This variation in terminology has led to differences in how the papers are
keyworded and recorded in databases. A PubMed (PubMed.gov, National Center for Biotech-
nology Information, Bethesda, MD, USA) search of “Artificial Intelligence Orthopaedics” in
August 2019 yielded a mere 120 results. It was clear that many appropriate papers were missed
and led to refinement of the search strategy for this study. A standardised method of reporting
AI studies is currently lacking and would allow direct assessment and comparison of studies.
Similarly, consistency in terminology and keywords would allow researchers to search for rele-
vant papers more easily. “Artificial Intelligence” is, perhaps, too broad, and not clearly defined
to be used as an umbrella term for keyword searches. We propose that the umbrella term
“Machine learning” should be included on all papers for standardisation.
There was a geographical split in the location of papers published. As represented in Fig 3,
most papers (n = 83) originated from the USA, followed by Canada (n = 24) and China
(n = 23). These results may have been skewed by our inclusion only of papers written in
English but highlights the dominance of institutions from the USA. Additionally, it is impor-
tant to note that the search terms, whilst broader than a previous literature review [1] were not
exhaustive, and despite our best efforts valid publications may have been missed. Some time
has passed since the literature search was performed, and progress has been made in AI in
orthopaedics and more widely in healthcare. Efforts to quantify the diagnostic accuracy of
deep learning in medical imaging and guidelines for reporting such studies are two examples
of how the field has progressed [44,45].
Conclusion
The use of AI in orthopaedics is increasing. Studies using large datasets exist and novel AI
tools with the ability to have clinical impact are being developed. More research is needed
before the potential of AI can translate to a significant change in the day-to-day clinical prac-
tice of orthopaedic surgeons.
Supporting information
S1 Table. PRISMA-ScR checklist.
(DOCX)
S2 Table. Database search terms for Ovid—Embase and Medline.
(DOCX)
S3 Table. Database search terms for Scopus.
(DOCX)
S4 Table. Reference list of papers included in study.
(DOCX)
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 8 / 11
Author Contributions
Conceptualization: Gareth G. Jones.
Data curation: Simon J. Federer.
Formal analysis: Simon J. Federer.
Investigation: Simon J. Federer.
Methodology: Simon J. Federer.
Project administration: Gareth G. Jones.
Supervision: Gareth G. Jones.
Writing original draft: Simon J. Federer.
Writing review & editing: Gareth G. Jones.
References
1. Cabitza F, Locoro A, Banfi G. Machine learning in orthopedics: A literature review. Front Bioeng Bio-
technol. 2018; 6:75. https://doi.org/10.3389/fbioe.2018.00075 PMID: 29998104
2. He J, Baxter SL, Xu J, Xu J, Zhou X, Zhang K. The practical implementation of artificial intelligence tech-
nologies in medicine. Nat Med. 2019; 25(1):30–6. https://doi.org/10.1038/s41591-018-0307-0 PMID:
30617336
3. De Fauw J, Ledsam JR, Romera-Paredes B, Nikolov S, Tomasev N, Blackwell S, et al. Clinically appli-
cable deep learning for diagnosis and referral in retinal disease. Nat Med. 2018; 24(9):1342–50. https://
doi.org/10.1038/s41591-018-0107-6 PMID: 30104768
4. Himmelreich JCL, Karregat EPM, Lucassen WAM, van Weert HCPM, de Groot JR, Louis Handoko M,
et al. Diagnostic accuracy of a smartphone-operated, single-lead electrocardiography device for detec-
tion of rhythm and conduction abnormalities in primary care. Ann Fam Med. 2019; 17(5):403–11.
https://doi.org/10.1370/afm.2438 PMID: 31501201
5. Panchmatia JR, Visenio MR, Panch T. The role of artificial intelligence in orthopaedic surgery. Br J
Hosp Med. 2018; 79(12):676–81. https://doi.org/10.12968/hmed.2018.79.12.676 PMID: 30526106
6. Registry NJ. National Joint Registry - 17th Annual Report 2020. Natl Jt Regist. 2020;(December
2019):138. PMID: 33439585
7. Arksey H, O’Malley L. Scoping studies: Towards a methodological framework. Int J Soc Res Methodol
Theory Pract. 2005; 8(1):19–32.
8. Tricco AC, Lillie E, Zarin W, O’Brien KK, Colquhoun H, Levac D, et al. PRISMA extension for scoping
reviews (PRISMA-ScR): Checklist and explanation. Ann Intern Med. 2018; 169(7):467–73. https://doi.
org/10.7326/M18-0850 PMID: 30178033
9. Mathew B, Norris D, Mackintosh I, Waddell G. Artificial intelligence in the prediction of operative findings
in low back surgery. Br J Neurosurg. 1989; 3(2):161–70. https://doi.org/10.3109/02688698909002791
PMID: 2679685
10. Jaremko JL, Poncet P, Ronsky J, Harder J, Dansereau J, Labelle H, et al. Estimation of spinal deformity
in scoliosis from torso surface cross sections. Spine (Phila Pa 1976). 2001; 26(14):1583–91. https://doi.
org/10.1097/00007632-200107150-00017 PMID: 11462091
11. Ramirez L, Durdle NG, Raso VJ, Hill DL. A support vector machines classifier to assess the severity of
idiopathic scoliosis from surface topography. IEEE Trans Inf Technol Biomed. 2006; 10(1):84–91.
https://doi.org/10.1109/titb.2005.855526 PMID: 16445253
12. Duong L, Cheriet F, Labelle H. Three-dimensional classification of spinal deformities using fuzzy clus-
tering. Spine (Phila Pa 1976). 2006; 31(8):923–30. https://doi.org/10.1097/01.brs.0000209312.62384.
c1 PMID: 16622383
13. Eller-Vainicher C, Chiodini I, Santi I, Massarotti M, Pietrogrande L, Cairoli E, et al. Recognition of mor-
phometric vertebral fractures by artificial neural networks: Analysis from gismo Lombardia database.
PLoS One. 2011; 6(11):e27277. https://doi.org/10.1371/journal.pone.0027277 PMID: 22076144
14. Koh J, Chaudhary V, Dhillon G. Disc herniation diagnosis in MRI using a CAD framework and a two-
level classifier. Int J Comput Assist Radiol Surg. 2012; 7(6):861–9. https://doi.org/10.1007/s11548-012-
0674-9 PMID: 22392057
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 9 / 11
15. Oktay AB, Albayrak NB, Akgul YS. Computer aided diagnosis of degenerative intervertebral disc dis-
eases from lumbar MR images. Comput Med Imaging Graph. 2014; 38(7):613–9. https://doi.org/10.
1016/j.compmedimag.2014.04.006 PMID: 24972858
16. Hao S, Jiang J, Guo Y, Li H. Active learning based intervertebral disk classification combining shape
and texture similarities. Neurocomputing. 2013; 101:252–7.
17. Saygılı A, Albayrak S. An efficient and fast computer-aided method for fully automated diagnosis of
meniscal tears from magnetic resonance images. Artif Intell Med. 2019; 97:118–30. https://doi.org/10.
1016/j.artmed.2018.11.008 PMID: 30527276
18. Carballido-Gamio J, Yu A, Wang L, Su Y, Burghardt AJ, Lang TF, et al. Hip Fracture Discrimination
Based on Statistical Multi-parametric Modeling (SMPM). Ann Biomed Eng. 2019; 47(11):2199–212.
https://doi.org/10.1007/s10439-019-02298-x PMID: 31240508
19. Couteaux V, Si-Mohamed S, Nempont O, Lefevre T, Popoff A, Pizaine G, et al. Automatic knee menis-
cus tear detection and orientation classification with Mask-RCNN. Diagn Interv Imaging. 2019; 100
(4):235–42. https://doi.org/10.1016/j.diii.2019.03.002 PMID: 30910620
20. Huber FA, Stutz S, Vittoria de Martini I, Mannil M, Becker AS, Winklhofer S, et al. Qualitative versus
quantitative lumbar spinal stenosis grading by machine learning supported texture analysis—Experi-
ence from the LSOS study cohort. Eur J Radiol. 2019; 114:45–50. https://doi.org/10.1016/j.ejrad.2019.
02.023 PMID: 31005175
21. Haeberle HS, Helm JM, Navarro SM, Karnuta JM, Schaffer JL, Callaghan JJ, et al. Artificial Intelligence
and Machine Learning in Lower Extremity Arthroplasty: A Review. J Arthroplasty. 2019; 34(10):2201–3.
https://doi.org/10.1016/j.arth.2019.05.055 PMID: 31253449
22. Lee HK, Jin R, Feng Y, Bain PA, Goffinet J, Baker C, et al. An Analytical Framework for TJR Readmis-
sion Prediction and Cost-Effective Intervention. IEEE J Biomed Heal Informatics. 2019; 23(4):1760–72.
https://doi.org/10.1109/JBHI.2018.2859581 PMID: 30047916
23. Ramkumar PN, Karnuta JM, Navarro SM, Haeberle HS, Scuderi GR, Mont MA, et al. Deep Learning
Preoperatively Predicts Value Metrics for Primary Total Knee Arthroplasty: Development and Validation
of an Artificial Neural Network Model. J Arthroplasty. 2019; 34(10):2220–2227.e1. https://doi.org/10.
1016/j.arth.2019.05.034 PMID: 31285089
24. Karhade A V., Ogink PT, Thio QCBS, Broekman MLD, Cha TD, Hershman SH, et al. Machine learning
for prediction of sustained opioid prescription after anterior cervical discectomy and fusion. Spine J.
2019; 19(6):976–83. https://doi.org/10.1016/j.spinee.2019.01.009 PMID: 30710731
25. Karhade A V., Ogink P, Thio Q, Broekman M, Cha T, Gormley WB, et al. Development of machine
learning algorithms for prediction of discharge disposition after elective inpatient surgery for lumbar
degenerative disc disorders. Neurosurg Focus. 2018; 45(5):E6. https://doi.org/10.3171/2018.8.
FOCUS18340 PMID: 30453463
26. Hyer JM, Ejaz A, Tsilimigras DI, Paredes AZ, Mehta R, Pawlik TM. Novel Machine Learning Approach
to Identify Preoperative Risk Factors Associated with Super-Utilization of Medicare Expenditure Follow-
ing Surgery. JAMA Surg. 2019; 154(11):1014–21. https://doi.org/10.1001/jamasurg.2019.2979 PMID:
31411664
27. D’Lima DD, Patil S, Steklov N, Colwell CW. ‘Lab’-in-a-Knee: In vivo knee forces, kinematics, and con-
tact analysis. Clin Orthop Relat Res. 2011; 469(10):2953–70. https://doi.org/10.1007/s11999-011-
1916-9 PMID: 21598121
28. Huber M, Kurz C, Leidl R. Predicting patient-reported outcomes following hip and knee replacement
surgery using supervised machine learning. BMC Med Inform Decis Mak. 2019; 19(1):3. https://doi.org/
10.1186/s12911-018-0731-6 PMID: 30621670
29. Karnuta JM, Navarro SM, Haeberle HS, Helm JM, Kamath AF, Schaffer JL, et al. Predicting Inpatient
Payments Prior to Lower Extremity Arthroplasty Using Deep Learning: Which Model Architecture Is
Best? J Arthroplasty. 2019; 34(10):2235–2241.e1. https://doi.org/10.1016/j.arth.2019.05.048 PMID:
31230954
30. Harris AHS, Kuo AC, Weng Y, Trickey AW, Bowe T, Giori NJ. Can Machine Learning Methods Produce
Accurate and Easy-to-use Prediction Models of 30-day Complications and Mortality after Knee or Hip
Arthroplasty? Clin Orthop Relat Res. 2019; 477(2):452–60. https://doi.org/10.1097/CORR.
0000000000000601 PMID: 30624314
31. Han SS, Azad TD, Suarez PA, Ratliff JK. A machine learning approach for predictive models of adverse
events following spine surgery. Spine J. 2019; 19(11):1772–81. https://doi.org/10.1016/j.spinee.2019.
06.018 PMID: 31229662
32. Karhade A V., Schwab JH, Bedair HS. Development of Machine Learning Algorithms for Prediction of
Sustained Postoperative Opioid Prescriptions After Total Hip Arthroplasty. J Arthroplasty. 2019; 34
(10):2272–2277.e1. https://doi.org/10.1016/j.arth.2019.06.013 PMID: 31327647
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 10 / 11
33. Ogink PT, Karhade A V., Thio QCBS, Gormley WB, Oner FC, Verlaan JJ, et al. Predicting discharge
placement after elective surgery for lumbar spinal stenosis using machine learning methods. Eur Spine
J. 2019; https://doi.org/10.1007/s00586-019-05928-z PMID: 30941521
34. Saba L, Biswas M, Kuppili V, Cuadrado Godia E, Suri HS, Edla DR, et al. The present and future of
deep learning in radiology. Eur J Radiol. 2019; 114(September 2018):14–24. https://doi.org/10.1016/j.
ejrad.2019.02.038 PMID: 31005165
35. Zhang J, Li H, Lv L, Zhang Y. Computer-Aided Cobb Measurement Based on Automatic Detection of
Vertebral Slopes Using Deep Neural Network. Int J Biomed Imaging. 2017; 2017:9083916. https://doi.
org/10.1155/2017/9083916 PMID: 29118806
36. Zhu X, He X, Wang P, He Q, Gao D, Cheng J, et al. A method of localization and segmentation of inter-
vertebral discs in spine MRI based on Gabor filter bank. Biomed Eng Online. 2016; 15(1):32. https://doi.
org/10.1186/s12938-016-0146-5 PMID: 27000749
37. Auloge P, Cazzato RL, Ramamurthy N, de Marini P, Rousseau C, Garnon J, et al. Augmented reality
and artificial intelligence-based navigation during percutaneous vertebroplasty: a pilot randomised clini-
cal trial. Eur Spine J. 2020; 29(7):1580–9. https://doi.org/10.1007/s00586-019-06054-6 PMID:
31270676
38. Baka N, Leenstra S, Van Walsum T. Ultrasound Aided Vertebral Level Localization for Lumbar Surgery.
IEEE Trans Med Imaging. 2017; 36(10):2138–47. https://doi.org/10.1109/TMI.2017.2738612 PMID:
28809678
39. Chalmers E, Pedrycz W, Lou E. Human experts’ and a fuzzy model’s predictions of outcomes of scolio-
sis treatment: A comparative analysis. IEEE Trans Biomed Eng. 2015; 62(3):1001–7. https://doi.org/10.
1109/TBME.2014.2377594 PMID: 25494498
40. Duong L, Cheriet F, Labelle H. Automatic detection of scoliotic curves in posteroanterior radiographs.
IEEE Trans Biomed Eng. 2010; 57(5):1143–51. https://doi.org/10.1109/TBME.2009.2037214 PMID:
20142161
41. Hussain D, Han SM. Computer-aided osteoporosis detection from DXA imaging. Comput Methods Pro-
grams Biomed [Internet]. 2019; 173:87–107. Available from: http://www.elsevier.com/locate/cmpb
https://doi.org/10.1016/j.cmpb.2019.03.011 PMID: 31046999
42. Zhang M, Gong H, Zhang K, Zhang M. Prediction of lumbar vertebral strength of elderly men based on
quantitative computed tomography images using machine learning. Osteoporos Int. 2019; 30
(11):2271–82. https://doi.org/10.1007/s00198-019-05117-0 PMID: 31401661
43. Bayliss L, Jones LD. The role of artificial intelligence and machine learning in predicting orthopaedic out-
comes. Bone Jt J. 2019; 101-B(12):1476–8. https://doi.org/10.1302/0301-620X.101B12.BJJ-2019-
0850.R1 PMID: 31786999
44. Aggarwal R, Sounderajah V, Martin G, Ting DSW, Karthikesalingam A, King D, et al. Diagnostic accu-
racy of deep learning in medical imaging: a systematic review and meta-analysis. npj Digit Med. 2021; 4
(1). https://doi.org/10.1038/s41746-021-00438-z PMID: 33828217
45. Sounderajah V, Ashrafian H, Golub RM, Shetty S, De Fauw J, Hooft L, et al. Developing a reporting
guideline for artificial intelligence-centred diagnostic test accuracy studies: the STARD-AI protocol.
BMJ Open. 2021; 11(6):e047709. https://doi.org/10.1136/bmjopen-2020-047709 PMID: 34183345
PLOS ONE
Artificial intelligence in orthopaedics: A scoping review
PLOS ONE | https://doi.org/10.1371/journal.pone.0260471 November 23, 2021 11 / 11
... The inclusion criteria of participants and their clinical information obtained a high risk of bias, due to the nature of the studies that did not focus on a follow-up of the patients but more about the prediction capability of the ML models. [8]. ...
... Future research should focus on refining these models for broader clinical applications and standardizing the evaluation metrics used. This finding aligns with recent literature that demonstrates AI's ability to optimize image quality in orthopaedic diagnostics, leading to more reliable clinical outcomes [8]. ...
Article
Full-text available
Purpose While several artificial intelligence (AI)‐assisted medical imaging applications are reported in the recent orthopaedic literature, comparison of the clinical efficacy and utility of these applications is currently lacking. The aim of this systematic review is to evaluate the effectiveness and reliability of AI applications in orthopaedic imaging, focusing on their impact on diagnostic accuracy, image segmentation and operational efficiency across various imaging modalities. Methods Based on the PRISMA guidelines, a comprehensive literature search of PubMed, Cochrane and Scopus databases was performed, using combinations of keywords and MeSH descriptors ('AI', 'ML', 'deep learning', 'orthopaedic surgery' and 'imaging') from inception to March 2024. Included were studies published between September 2018 and February 2024, which evaluated machine learning (ML) model effectiveness in improving orthopaedic imaging. Studies with insufficient data regarding the output variable used to assess the reliability of the ML model, those applying deterministic algorithms, unrelated topics, protocol studies, and other systematic reviews were excluded from the final synthesis. The Joanna Briggs Institute (JBI) Critical Appraisal tool and the Risk Of Bias In Non‐randomised Studies‐of Interventions (ROBINS‐I) tool were applied for the assessment of bias among the included studies. Results The 53 included studies reported the use of 11.990.643 images from several diagnostic instruments. A total of 39 studies reported details in terms of the Dice Similarity Coefficient (DSC), while both accuracy and sensitivity were documented across 15 studies. Precision was reported by 14, specificity by nine, and the F1 score by four of the included studies. Three studies applied the area under the curve (AUC) method to evaluate ML model performance. Among the studies included in the final synthesis, Convolutional Neural Networks (CNN) emerged as the most frequently applied category of ML models, present in 17 studies (32%). Conclusion The systematic review highlights the diverse application of AI in orthopaedic imaging, demonstrating the capability of various machine learning models in accurately segmenting and analysing orthopaedic images. The results indicate that AI models achieve high performance metrics across different imaging modalities. However, the current body of literature lacks comprehensive statistical analysis and randomized controlled trials, underscoring the need for further research to validate these findings in clinical settings. Level of evidence Systematic Review; Level of evidence IV.
... The automation of these tasks via AI can impact several aspects of the patient journey and support the healthcare professionals in providing superior quality of care. With specific reference to the orthopedics field, AI has shown promising results in improving diagnostic accuracy, supporting clinical decision-making, optimizing and personalizing surgical and/or treatment planning and monitoring patients' outcome [2,5]. In 2020, a study by Federer et al. [5] provided an overview of such applications in orthopedics. ...
... With specific reference to the orthopedics field, AI has shown promising results in improving diagnostic accuracy, supporting clinical decision-making, optimizing and personalizing surgical and/or treatment planning and monitoring patients' outcome [2,5]. In 2020, a study by Federer et al. [5] provided an overview of such applications in orthopedics. Since then, a growing body of research has followed with numerous publications. ...
Article
Full-text available
Background and Objectives: This scoping review explores the current state of the art of AI-based applications in the field of orthopedics, focusing on its implementation in diagnostic imaging and preoperative planning of knee joint procedures. Materials and Methods: The search was carried out using the recognized scholarly databases PubMed, Medline and Embase and was set to identify original research addressing AI applied to imaging in knee diagnosis and surgical planning, written in English and published up to January 2025. Results: The search produced 1612 papers, of which 36 were included in our review. All papers addressed AI applied to common imaging methods in clinical practice. Of these, thirty integrated AI-based tools with X-rays, one applied AI to X-rays to produce CT-like 3D reproductions, and two studies applied AI to MRI. Conclusions: Several AI tools have already been validated for enhancing the accuracy of measurements and detecting additional parameters in a shorter time compared to standard assessments. We expect these may soon be introduced into routine clinical practice to streamline a number of technical tasks and in some cases to replace the need for human intervention.
... By enhancing the disease diagnosis and treatment workflow with AI, the sensitivity of early OA screening can be significantly improved, and personalized treatment plans can be developed diagnosis, promoting timely and accurate intervention strategies and broadening the scope of predictive analysis in OA care. 90 As illustrated in Fig. 3, the analysis and diagnosis of OA from imaging data typically follow a generally consistent and wellestablished workflow, encompassing preprocessing, segmentation of regions of interest, extraction of texture features, and ultimately, OA diagnosis. Traditional methods have long provided clinicians with reliable protocols for each step of imaging analysis. ...
Article
Full-text available
Osteoarthritis (OA) is a degenerative joint disease with significant clinical and societal impact. Traditional diagnostic methods, including subjective clinical assessments and imaging techniques such as X-rays and MRIs, are often limited in their ability to detect early-stage OA or capture subtle joint changes. These limitations result in delayed diagnoses and inconsistent outcomes. Additionally, the analysis of omics data is challenged by the complexity and high dimensionality of biological datasets, making it difficult to identify key molecular mechanisms and biomarkers. Recent advancements in artificial intelligence (AI) offer transformative potential to address these challenges. This review systematically explores the integration of AI into OA research, focusing on applications such as AI-driven early screening and risk prediction from electronic health records (EHR), automated grading and morphological analysis of imaging data, and biomarker discovery through multi-omics integration. By consolidating progress across clinical, imaging, and omics domains, this review provides a comprehensive perspective on how AI is reshaping OA research. The findings have the potential to drive innovations in personalized medicine and targeted interventions, addressing longstanding challenges in OA diagnosis and management.
... While the potential of AI programs in the field of orthopedic imaging is considerable, further research is required to facilitate the widespread integration of these technologies into clinical applications. The development of robust studies designed in accordance with standard reporting guidelines is encouraged to facilitate the easy adaptation of AI models to real-world conditions [21,22]. Consequently, it is evident that artificial intelligence technologies offer significant innovations in the field of orthopedic imaging, with a growing variety of applications in this domain. ...
Article
Full-text available
Background/Objectives: Artificial intelligence (AI) has attracted great interest due to its applicability in many fields. The adoption of visual illustration techniques produced by AI in the field of graphic design has further expanded the field of use of this technology. This study focuses on foot anatomy illustrations generated by Adobe Firefly and Microsoft Designer Image Creator applications, evaluating them based on detail, clarity, anatomical realism, accuracy, and aesthetic appeal. Methods: The illustrations were created using text-based scripts, and five anatomists compared them to traditional illustrations from the Sobotta Atlas of Human Anatomy. Results: Fleiss’ Kappa statistic was used to analyze consistency among expert evaluations. For the four figures generated by both AI applications, Fleiss’ Kappa agreement was high. Adobe Firefly performed slightly better in illustrating phalanx and ankle bones, but its anatomical accuracy was lower for tarsal and metatarsal bones. Microsoft Designer Image Creator excelled in illustrating metatarsal bones, while its tarsal and phalanx illustrations were less anatomically accurate than Adobe Firefly and the atlas drawings. Both programs showed average realism in ankle structures, while the tarsal bones had low realism. Conclusions: Artificial intelligence applications within the scope of the study showed fast performance. Aesthetic appeal is dominant at first glance in the resulting drawings. In general, both applications have struggled to reflect anatomical reality.
... Looking to the future, 30.8% agreed that it would be useful to improve studying ML and DL to predict postoperative outcomes. This purpose has often been the subject of research, but it is still not possible to use it routinely due to the current poor accuracy, and further research is needed in this direction [5,6]. In the present study, 28% of the participants proposed to study more ML and DL to increase the image-based diagnosis of OA. ...
Article
Full-text available
Purpose Through an analysis of findings from a survey about the use of artificial intelligence (AI) in orthopaedics, the aim of this study was to establish a scholarly foundation for the discourse on AI in orthopaedics and to elucidate key patterns, challenges and potential future trajectories for AI applications within the field. Methods The International Society of Arthroscopy, Knee Surgery and Orthopaedic Sports Medicine (ISAKOS) Young Professionals Task Force developed a survey to collect feedback on issues related to the use of AI in the orthopaedic field. The survey included 26 questions. Data obtained from the completed questionnaires were transferred to a spreadsheet and then analyzed. Results Two hundred and eleven orthopaedic surgeons completed the survey. The survey encompassed responses from a diverse cohort of orthopaedic professionals, predominantly comprising males (92.9%). There was wide representation across all geographic regions. A notable proportion (52.1%) reported uncertainty or lack of differentiation among AI, machine learning and deep learning (47.9%). Respondents identified imaging‐based diagnosis (60.2%) as the primary field of orthopaedics poised to benefit from AI. A considerable proportion (25.1%) reported using AI in their practice, with primary reasons including referencing scientific literature/publications (40.3%). The vast majority expressed interest in leveraging AI technologies (95.3%), demonstrating an inclination towards incorporating AI into orthopaedic practice. Respondents indicated specific areas of interest for further study, including prediction of patient outcomes after surgery (30.8%) and image‐based diagnosis of osteoarthritis (28%). Conclusions This survey demonstrates that there is currently limited use of AI in orthopaedic practice, mainly due to a lack of knowledge about the subject, a lack of proven evidence of its real utility and high costs. These findings are in accordance with other surveys in the literature. However, there is also a high level of interest in its use in the future, in increased study and further research on the subject, so that it can be of real benefit and make AI an integral part of the orthopaedic surgeon's daily work. Level of Evidence Level IV, survey study.
... Orthopedics, with its demanding environmental requirements, has witnessed a revolutionary advancement in the field of automation efforts in measuring scales for various diagnoses [31] and treatment planning [32]. The HKA angle, a critical measurement in patient diagnosis, has been the primary focus of such attempts. ...
Article
Full-text available
Background Accurate measurement of the hip-knee-ankle (HKA) angle is essential for informed clinical decision-making in the management of knee osteoarthritis (OA). Knee OA is commonly associated with varus deformity, where the alignment of the knee shifts medially, leading to increased stress and deterioration of the medial compartment. The HKA angle, which quantifies this alignment, is a critical indicator of the severity of varus deformity and helps guide treatment strategies, including corrective surgeries. Current manual methods are labor-intensive, time-consuming, and prone to inter-observer variability. Developing an automated model for HKA angle measurement is challenging due to the elaborate process of generating handcrafted anatomical landmarks, which is more labor-intensive than the actual measurement. This study aims to develop a ResNet-based deep learning model that predicts the HKA angle without requiring explicit anatomical landmark annotations and to assess its accuracy and efficiency compared to conventional manual methods. Methods We developed a deep learning model based on the variants of the ResNet architecture to process lower limb radiographs and predict HKA angles without explicit landmark annotations. The classification performance for the four stages of varus deformity (stage I: 0°–10°, stage II: 10°–20°, stage III: > 20°, others: genu valgum or normal alignment) was also evaluated. The model was trained and validated using a retrospective cohort of 300 knee OA patients (Kellgren-Lawrence grade 3 or higher), with horizontal flip augmentation applied to double the dataset to 600 samples, followed by fivefold cross-validation. An extended temporal validation was conducted on a separate cohort of 50 knee OA patients. The model's accuracy was assessed by calculating the mean absolute error between predicted and actual HKA angles. Additionally, the classification of varus deformity stages was conducted to evaluate the model's ability to provide clinically relevant categorizations. Time efficiency was compared between the automated model and manual measurements performed by an experienced orthopedic surgeon. Results The ResNet-50 model achieved a bias of − 0.025° with a standard deviation of 1.422° in the retrospective cohort and a bias of − 0.008° with a standard deviation of 1.677° in the temporal validation cohort. Using the ResNet-152 model, it accurately classified the four stages of varus deformity with weighted F1-score of 0.878 and 0.859 in the retrospective and temporal validation cohorts, respectively. The automated model was 126.7 times faster than manual measurements, reducing the total time from 49.8 min to 23.6 sec for the temporal validation cohort. Conclusions The proposed ResNet-based model provides an efficient and accurate method for measuring HKA angles and classifying varus deformity stages without the need for extensive landmark annotations. Its high accuracy and significant improvement in time efficiency make it a valuable tool for clinical practice, potentially enhancing decision-making and workflow efficiency in the management of knee OA.
Article
Orthopedics is undergoing a transformative shift driven by personalized medical technologies that enhance precision, efficiency, and patient outcomes. Virtual surgical planning, robotic assistance, and real-time 3D navigation have revolutionized procedures like total knee arthroplasty and hip replacement, offering unparalleled accuracy and reducing recovery times. Integrating artificial intelligence, advanced imaging, and 3D-printed patient-specific implants further elevates surgical precision, minimizes intraoperative complications, and supports individualized care. In sports orthopedics, wearable sensors and motion analysis technologies are revolutionizing diagnostics, injury prevention, and rehabilitation, enabling real-time decision-making and improved patient safety. Health-tracking devices are advancing recovery and supporting preventative care, transforming athletic performance management. Concurrently, breakthroughs in biologics, biomaterials, and bioprinting are reshaping treatments for cartilage defects, ligament injuries, osteoporosis, and meniscal damage. These innovations are poised to establish new benchmarks for regenerative medicine in orthopedics. By combining cutting-edge technologies with interdisciplinary collaboration, the field is redefining surgical standards, optimizing patient care, and paving the way for a highly personalized and efficient future.
Book
Ortopedi 4.0: Dijitalleşme, Yapay Zeka ve Yenilikçi Teknolojiler Nihat Demirhan DEMİRKIRAN Ramadan ÖZMANEVRA Geleceğin Ortopedisi: Dijitalleşme ve Yapay Zekanın Rolü Kadir GÜLNAHAR Ortopedide Dijitalleşmenin Tarihçesi ve Geleceği Burak YILDIRIM Kişiselleştirilmiş Ortopedik Tedavi İlyas KABAN Hasta Sonuçlarını İyileştirmek İçin Yapay Zekâ Kullanımı Ahmet ACAR Ortopedik Acil Durumlarda Dijital Çözümler ve Yapay Zeka Destekli Müdahaleler Cengiz Han KANTAR Ortopedik Tedavide 3 Boyutlu Baskı ve Yapay Zeka İdris PERKTAŞ Chatgpt İle Ortopedik Tanı ve Tedavi Süreçlerinde Destek Mehmet Yiğit GÖKMEN Chatgpt’nin Ortopedik Hasta Takibinde Kullanımı: Olgu Sunumları ve Örnekler Ömer TORUN Ortopedik Eğitimde Dijital Araçlar ve Simülasyonlar Cengiz Han KANTAR Ortopedik Hastaların Bilgilendirmesinde Chatgpt’nin Rolü Eren BULUT Robotik Cerrahi ve Ortopedi: Yeni Nesil Cerrahlar Tahir Burak SARITAŞ
Article
Full-text available
Orthopaedic diseases, which affect millions of people globally, present significant diagnostic challenges, often leading to long-term disability and chronic pain. There is an ongoing debate across the literature regarding the trustworthiness of artificial intelligence (AI) in detecting orthopaedic diseases. This systematic review aims to provide a comprehensive taxonomy of AI applications in orthopaedic disease detection. A thorough literature search was conducted across five major databases (Science Direct, Scopus, IEEE Xplore, PubMed, and Web of Science) covering publications from January 2019 to 2024. Following rigorous screening on the basis of predefined inclusion criteria, 85 relevant studies were identified and critically evaluated. For the first time, this review classifies AI contributions into six key categories of orthopaedic conditions on the basis of medical perspective: arthritis, tumours, deformities, fractures, osteoporosis, and general bone abnormalities. In addition to analyzing motivations, challenges, and recommendations for future research, this review highlights the various AI techniques employed, including deep learning (DL), machine learning (ML), explainable AI (XAI), fuzzy logic, and multicriteria decision-making (MCDM), as well as the datasets utilized. Furthermore, the trustworthiness of AI models is evaluated on the basis of seven AI trustworthiness components, aligned with European Union guidelines, within each category. These findings underscore the need for high-quality research to ensure that AI computational systems in orthopaedic disease detection are reliable, safe, and ethical. Future research should focus on optimizing AI algorithms, improving dataset diversity, and addressing ethical and regulatory challenges to ensure successful integration into clinical practice.
Article
Background Prerequisites of artificial intelligence (AI) are a huge unbiased data set, linking them with different “clouds,” a powerful computer with high processing ability, and application of statistical methods to produce a complex algorithm. The concept “can machine think” developed in the early 1940s with the turning rule. The progress was slow till 2000 and then steadily increased and accelerated since 2012. Data scientists used complex statistical mathematics and computer engineers developed machines that allow machine learning, deep learning, and artificial neural network as subsets of AI. These nodes in layers can send feedback to refine its own decision. Among various fields, applications in orthopedics are in stage of validation. Clinical applications are growing fast. Use in orthopedic subfields such as joint disorders and arthroplasty, spine, fractures, sports medicine, and orthopedic oncology are promising. Aims and Objectives Orthopedic clinicians have limited scope to be accustomed with the enmeshed statistical basis. They will be more interested in the application of AI in orthopedics in their practice. This review article is focused on some historical background and applicability of different ML models in various orthopedic domains. The future benefits and limitations are also outlined. Methodology In this descriptive narrative exploratory review, qualitative information is collected randomly from a variety of sources. Conclusion AI is the revolution in industrial development. It has reached the present state by the efforts and endeavors by engineers and data scientists. Its utility has been validated in orthopedic fields and is ready to use in regular practice. However, ethical issues including the “Job-Killing” effect, identification of accountable persons in situations where AI makes some mistakes, and biased data are not yet addressed. Regulating bodies are working on it.
Article
Full-text available
Introduction Standards for Reporting of Diagnostic Accuracy Study (STARD) was developed to improve the completeness and transparency of reporting in studies investigating diagnostic test accuracy. However, its current form, STARD 2015 does not address the issues and challenges raised by artificial intelligence (AI)-centred interventions. As such, we propose an AI-specific version of the STARD checklist (STARD-AI), which focuses on the reporting of AI diagnostic test accuracy studies. This paper describes the methods that will be used to develop STARD-AI. Methods and analysis The development of the STARD-AI checklist can be distilled into six stages. (1) A project organisation phase has been undertaken, during which a Project Team and a Steering Committee were established; (2) An item generation process has been completed following a literature review, a patient and public involvement and engagement exercise and an online scoping survey of international experts; (3) A three-round modified Delphi consensus methodology is underway, which will culminate in a teleconference consensus meeting of experts; (4) Thereafter, the Project Team will draft the initial STARD-AI checklist and the accompanying documents; (5) A piloting phase among expert users will be undertaken to identify items which are either unclear or missing. This process, consisting of surveys and semistructured interviews, will contribute towards the explanation and elaboration document and (6) On finalisation of the manuscripts, the group’s efforts turn towards an organised dissemination and implementation strategy to maximise end-user adoption. Ethics and dissemination Ethical approval has been granted by the Joint Research Compliance Office at Imperial College London (reference number: 19IC5679). A dissemination strategy will be aimed towards five groups of stakeholders: (1) academia, (2) policy, (3) guidelines and regulation, (4) industry and (5) public and non-specific stakeholders. We anticipate that dissemination will take place in Q3 of 2021.
Article
Full-text available
Deep learning (DL) has the potential to transform medical diagnostics. However, the diagnostic accuracy of DL is uncertain. Our aim was to evaluate the diagnostic accuracy of DL algorithms to identify pathology in medical imaging. Searches were conducted in Medline and EMBASE up to January 2020. We identified 11,921 studies, of which 503 were included in the systematic review. Eighty-two studies in ophthalmology, 82 in breast disease and 115 in respiratory disease were included for meta-analysis. Two hundred twenty-four studies in other specialities were included for qualitative review. Peer-reviewed studies that reported on the diagnostic accuracy of DL algorithms to identify pathology using medical imaging were included. Primary outcomes were measures of diagnostic accuracy, study design and reporting standards in the literature. Estimates were pooled using random-effects meta-analysis. In ophthalmology, AUC’s ranged between 0.933 and 1 for diagnosing diabetic retinopathy, age-related macular degeneration and glaucoma on retinal fundus photographs and optical coherence tomography. In respiratory imaging, AUC’s ranged between 0.864 and 0.937 for diagnosing lung nodules or lung cancer on chest X-ray or CT scan. For breast imaging, AUC’s ranged between 0.868 and 0.909 for diagnosing breast cancer on mammogram, ultrasound, MRI and digital breast tomosynthesis. Heterogeneity was high between studies and extensive variation in methodology, terminology and outcome measures was noted. This can lead to an overestimation of the diagnostic accuracy of DL algorithms on medical imaging. There is an immediate need for the development of artificial intelligence-specific EQUATOR guidelines, particularly STARD, in order to provide guidance around key issues in this field.
Article
Full-text available
Purpose To assess technical feasibility, accuracy, safety and patient radiation exposure of a novel navigational tool integrating augmented reality (AR) and artificial intelligence (AI), during percutaneous vertebroplasty of patients with vertebral compression fractures (VCFs). Material and methods This prospective parallel randomised open trial compared the trans-pedicular access phase of percutaneous vertebroplasty across two groups of 10 patients, electronically randomised, with symptomatic single-level VCFs. Trocar insertion was performed using AR/AI-guidance with motion compensation in Group A, and standard fluoroscopy in Group B. The primary endpoint was technical feasibility in Group A. Secondary outcomes included the comparison of Groups A and B in terms of accuracy of trocar placement (distance between planned/actual trajectory on sagittal/coronal fluoroscopic images); complications; time for trocar deployment; and radiation dose/fluoroscopy time. Results Technical feasibility in Group A was 100%. Accuracy in Group A was 1.68 ± 0.25 mm (skin entry point), and 1.02 ± 0.26 mm (trocar tip) in the sagittal plane, and 1.88 ± 0.28 mm (skin entry point) and 0.86 ± 0.17 mm (trocar tip) in the coronal plane, without any significant difference compared to Group B (p > 0.05). No complications were observed in the entire population. Time for trocar deployment was significantly longer in Group A (642 ± 210 s) than in Group B (336 ± 60 s; p = 0.001). Dose–area product and fluoroscopy time were significantly lower in Group A (182.6 ± 106.7 mGy cm² and 5.2 ± 2.6 s) than in Group B (367.8 ± 184.7 mGy cm² and 10.4 ± 4.1 s; p = 0.025 and 0.005), respectively. Conclusion AR/AI-guided percutaneous vertebroplasty appears feasible, accurate and safe, and facilitates lower patient radiation exposure compared to standard fluoroscopic guidance. Graphic abstract These slides can be retrieved under Electronic Supplementary Material. Open image in new window
Article
Full-text available
Studies using quantitative computed tomography (QCT) and data-driven image analysis techniques have shown that trabecular and cortical volumetric bone mineral density (vBMD) can improve the hip fracture prediction of dual-energy X-ray absorptiometry areal BMD (aBMD). Here, we hypothesize that (1) QCT imaging features of shape, density and structure derived from data-driven image analysis techniques can improve the hip fracture discrimination of classification models based on mean femoral neck aBMD (Neck.aBMD), and (2) that data-driven cortical bone thickness (Ct.Th) features can improve the hip fracture discrimination of vBMD models. We tested our hypotheses using statistical multi-parametric modeling (SMPM) in a QCT study of acute hip fracture of 50 controls and 93 fragility fracture cases. SMPM was used to extract features of shape, vBMD, Ct.Th, cortical vBMD, and vBMD in a layer adjacent to the endosteal surface to develop hip fracture classification models with machine learning logistic LASSO. The performance of these classification models was evaluated in two aspects: (1) their hip fracture classification capability without Neck.aBMD, and (2) their capability to improve the hip fracture classification of the Neck.aBMD model. Assessments were done with 10-fold cross-validation, areas under the receiver operating characteristic curve (AUCs), differences of AUCs, and the integrated discrimination improvement (IDI) index. All LASSO models including SMPM-vBMD features, and the majority of models including SMPM-Ct.Th features performed significantly better than the Neck.aBMD model; and all SMPM features significantly improved the hip fracture discrimination of the Neck.aBMD model (Hypothesis 1). An interesting finding was that SMPM-features of vBMD also captured Ct.Th patterns, potentially explaining the superior classification performance of models based on SMPM-vBMD features (Hypothesis 2). Age, height and weight had a small impact on model performances, and the model of shape, vBMD and Ct.Th consistently yielded better performances than the Neck.aBMD models. Results of this study clearly support the relevance of bone density and quality on the assessment of hip fracture, and demonstrate their potential on patient and healthcare cost benefits.
Article
Full-text available
Background: Rates of adverse events following spine surgery vary widely by patient-, diagnosis-, and procedure-related factors. It is critical to understand the expected rates of complications and to be able to implement targeted efforts at limiting these events. Purpose: To develop and evaluate a set of predictive models for common adverse events after spine surgery. Study design: A retrospective cohort study. Patient samples: We extracted 345,510 patients from the Truven MarketScan (MKS) and MarketScan Medicaid Databases and 760,724 patients from the Centers for Medicare and Medicaid Services (CMS) Medicare database who underwent spine surgeries between 2009 and 2013. Outcome measures: Overall adverse event (AE) occurrence and types of AE occurrence during the 30-day postoperative follow-up. Methods: We applied a least absolute shrinkage and selection operator regularization method and a logistic regression approach for predicting the risks of an overall AE and the top six most commonly observed AEs. Predictors included patient demographics, location of the spine procedure, comorbidities, type of surgery performed, and preoperative diagnosis. Results: The median ages of MKS and CMS patients were 49 years and 69, respectively. The most frequent individual AE was a cardiac dysfunction in CMS (10.6%) patients and a pulmonary complication (4.7%) in MKS. The area under the curve (AUC) of a prediction model for an overall AE was 0.7. Among the six individual prediction models, the model for predicting the risk of a pulmonary complication showed the greatest accuracy (AUC 0.76), and the range of AUC for these six models was 0.7 and 0.76. Medicaid status was one of the most important factors in predicting the occurrences of AEs; Medicaid recipients had increased odds of AEs by 20%-60% compared with non-Medicaid patients (odds ratios 1.28-1.6; p<10-10). Logistic regression showed higher AUCs than least absolute shrinkage and selection operator across these different models. Conclusions: We present a set of predictive models for AEs following spine surgery that account for patient-, diagnosis-, and procedure-related factors which can contribute to patient-counseling, accurate risk adjustment, and accurate quality metrics.
Article
This annotation briefly reviews the history of artificial intelligence and machine learning in health care and orthopaedics, and considers the role it will have in the future, particularly with reference to statistical analyses involving large datasets. Cite this article: Bone Joint J 2019;101-B:1476–1478
Article
Purpose: To validate a smartphone-operated, single-lead electrocardiography (1L-ECG) device (AliveCor KardiaMobile) with an integrated algorithm for atrial fibrillation (AF) against 12-lead ECG (12L-ECG) in a primary care population. Methods: We recruited consecutive patients who underwent 12L-ECG for any nonacute indication. Patients held a smartphone with connected 1L-ECG while local personnel simultaneously performed 12L-ECG. All 1L-ECG recordings were assessed by blinded cardiologists as well as by the smartphone-integrated algorithm. The study cardiologists also assessed all 12L-recordings in random order as the reference standard. We determined the diagnostic accuracy of the 1L-ECG in detecting AF or atrial flutter (AFL) as well as any rhythm abnormality and any conduction abnormality with the simultaneously performed 12L-ECG as the reference standard. Results: We included 214 patients from 10 Dutch general practices. Mean ± SD age was 64.1 ± 14.7 years, and 53.7% of the patients were male. The 12L-ECG diagnosed AF/AFL, any rhythm abnormality, and any conduction abnormality in 23, 44, and 28 patients, respectively. The 1L-ECG as assessed by cardiologists had a sensitivity and specificity for AF/AFL of 100% (95% CI, 85.2%-100%) and 100% (95% CI, 98.1%-100%). The AF detection algorithm had a sensitivity and specificity of 87.0% (95% CI, 66.4%-97.2%) and 97.9% (95% CI, 94.7%-99.4%). The 1L-ECG as assessed by cardiologists had a sensitivity and specificity for any rhythm abnormality of 90.9% (95% CI, 78.3%-97.5%) and 93.5% (95% CI, 88.7%-96.7%) and for any conduction abnormality of 46.4% (95% CI, 27.5%-66.1%) and 100% (95% CI, 98.0%-100%). Conclusions: In a primary care population, a smartphone-operated, 1L-ECG device showed excellent diagnostic accuracy for AF/AFL and good diagnostic accuracy for other rhythm abnormalities. The 1L-ECG device was less sensitive for conduction abnormalities.
Article
Importance Typically defined as the top 5% of health care users, super-utilizers are responsible for an estimated 40% to 55% of all health care costs. Little is known about which factors may be associated with increased risk of long-term postoperative super-utilization. Objective To identify clusters of patients with distinct constellations of clinical and comorbid patterns who may be associated with an elevated risk of super-utilization in the year following elective surgery. Design, Setting, and Participants A retrospective longitudinal cohort study of 1 049 160 patients who underwent abdominal aortic aneurysm repair, coronary artery bypass graft, colectomy, total hip arthroplasty, total knee arthroplasty, or lung resection were identified from the 100% Medicare inpatient and outpatient Standard Analytic Files at all inpatient facilities performing 1 or more of the evaluated surgical procedures from 2013 to 2015. Data from 2012 to 2016 were used to evaluate expenditures in the year preceding and following surgery. Using a machine learning approach known as Logic Forest, comorbidities and interactions of comorbidities that put patients at an increased chance of becoming a super-utilizer were identified. All comorbidities, as defined by the Charlson (range, 0-24) and Elixhauser (range, 0-29) comorbidity indices, were used in the analysis. Higher scores indicated higher comorbidity burden. Data analysis was completed on November 16, 2018. Main Outcome and Measures Super-utilization of health care in the year following surgery. Results In total, 1 049 160 patients met inclusion criteria and were included in the analytic cohort. Their median (interquartile range) age was 73 (69-78) years, and approximately 40% were male. Super-utilizers comprised 4.8% of the overall cohort (n = 79 746) yet incurred 31.7% of the expenditures. Although the difference in overall expenditures per person between super-utilizers (4049)andlowusers(4049) and low users (2148) was relatively modest prior to surgery, the difference in expenditures between super-utilizers (79698)vslowusers(79 698) vs low users (2977) was marked in the year following surgery. Risk factors associated with super-utilization of health care included hemiplegia/paraplegia (odds ratio, 5.2; 95% CI, 4.4-6.2), weight loss (odds ratio, 3.5; 95% CI, 2.9-4.2), and congestive heart failure with chronic kidney disease stages I to IV (odds ratio, 3.4; 95% CI, 3.0-3.9). Conclusions and Relevance Super-utilizers comprised only a small fraction of the surgical population yet were responsible for a disproportionate amount of Medicare expenditure. Certain subpopulations were associated with super-utilization of health care following surgical intervention despite having lower overall use in the preoperative period.
Article
The parameters extracted from quantitative computed tomography (QCT) images were used to predict vertebral strength through machine learning models, and the highly accurate prediction indicated that it may be a promising approach to assess fracture risk in clinics. Introduction Vertebral fracture is common in elderly populations. The main factor contributing to vertebral fracture is the reduced vertebral strength. This study aimed to predict vertebral strength based on clinical QCT images by using machine learning. Methods Eighty subjects with QCT data of lumbar spine were randomly selected from the MrOS cohorts. L1 vertebral strengths were computed by QCT-based finite element analysis. A total of 58 features of each L1 vertebral body were extracted from QCT images, including grayscale distribution, grayscale values of 39 partitioned regions, BMDQCT, structural rigidity, axial rigidity, and BMDQCTAmin. Feature selection and dimensionality reduction were used to simplify the 58 features. General regression neural network and support vector regression models were developed to predict vertebral strength. Performance of prediction models was quantified by the mean squared error, the coefficient of determination, the mean bias, and the SD of bias. Results The 58 parameters were simplified to five features (grayscale value of the 60% percentile, grayscale values of three specific partitioned regions, and BMDQCTAmin) and nine principal components (PCs). High accuracy was achieved by using the five features or the nine PCs to predict vertebral strength. Conclusions This study provided an effective approach to predict vertebral strength and showed that it may have great potential in clinical applications for noninvasive assessment of vertebral fracture risk.
Article
Background: Driven by the rapid development of big data and processing power, artificial intelligence and machine learning (ML) applications are poised to expand orthopedic surgery frontiers. Lower extremity arthroplasty is uniquely positioned to most dramatically benefit from ML applications given its central role in alternative payment models and the value equation. Methods: In this report, we discuss the origins and model specifics behind machine learning, consider its progression into healthcare, and present some of its most recent advances and applications in arthroplasty. Results: A narrative review of artificial intelligence and ML developments is summarized with specific applications to lower extremity arthroplasty, with specific lessons learned from osteoarthritis gait models, joint-specific imaging analysis, and value-based payment models. Conclusion: The advancement and employment of ML provides an opportunity to provide data-driven, high performance medicine that can rapidly improve the science, economics, and delivery of lower extremity arthroplasty.