A preview of this full-text is provided by Springer Nature.
Content available from Stochastic Environmental Research and Risk Assessment
This content is subject to copyright. Terms and conditions apply.
ORIGINAL PAPER
Stochastic Environmental Research and Risk Assessment (2025) 39:613–637
https://doi.org/10.1007/s00477-024-02885-y
1 Introduction
In recent years, there has been increased attention to land-
slides and related slope movements due to their uncontrol-
lable nature (Khalil et al. 2022). These natural events, which
can cause signicant damage to lives and property, occur
frequently worldwide, especially in mountainous regions
(Sarkar and Kanungo 2004; Bera et al. 2019). Landslides,
characterized by the sudden collapse of rock or soil on a
slope (Varnes 1978; Goetz et al. 2011; Chakraborty and
Pradhan 2012), are often triggered by factors such as heavy
rainfall, earthquakes, rapid snowmelt, and human activities
(Mersha and Meten 2020). Despite their smaller scale than
other natural disasters, landslides are common and can be
highly destructive (Trigila et al. 2015; Nohani et al. 2019).
In India, the region most vulnerable to landslides include
the Himalayan mountain ranges, the Meghalaya plateau,
and the Western Ghats (Mondal and Mandal 2019).
Prasanya Sarkar
sarkarprasanya@gmail.com
Madhumita Mondal
mgayen79@gmail.com
Alok Sarkar
aloksarkar162@gmail.com
Shasanka Kumar Gayen
gshasanka@gmail.com
1 Department of Geography, Cooch Behar Panchanan Barma
University, Vivekananda Street, Cooch Behar,
West Bengal 736101, India
2 Department of Geography, Bhairab Ganguli College,
Kolkata, West Bengal 700056, India
3 University of Calcutta, Kolkata, West Bengal 700019, India
Abstract
Landslide susceptibility mapping is crucial for reducing risks in culturally and historically signicant areas like the
Darjeeling Toy Train route, a UNESCO World Heritage site. In this study, the risk of landslides along this road is evalu-
ated using Geographic Information System (GIS) tools and advanced machine learning models, such as Support Vec-
tor Machine (SVM), Gradient Boosting Machine (GBM), Logistic Regression, and Classication and Regression Trees
(CART). It uses a set of 512 landslide and non-landslide sites, with a 70:30 split between training and testing. Within the
research area, thirteen topographical, hydrological, and geological factors linked to landslides are shown as GIS layers to
make maps of landslide susceptibility (LSM). The study area particularly vulnerable to various types of landslides, includ-
ing debris slides, rock falls, and soil slips. ROC–AUC results show that the SVM model did the best (0.813), followed by
GBM (0.807), Logistic Regression (0.797), and CART (0.781). SVM had the highest accuracy rate at 83.2%, followed by
GBM at 81.5% and LR at 80.3%. CART had the lowest overall accuracy rate at 78.6%. Furthermore, confusion matrix
analysis showed that SVM and Logistic Regression were better at nding actual landslide-prone areas, with 84.6% and
82.1% recall rates, respectively. This made them more accurate in predicting high-risk areas. Susceptibility levels were
categorized, revealing high-risk areas like Darjeeling and Rishihat and safer areas like Kurseong and Mohanbari. For
lowering the risk of landslides and protecting this historic route, these results are very useful for land management and
disaster preparation.
Keywords Darjeeling Toy Train · Landslide-prone areas · Support vector machine (SVM) · Gradient boosting machine
(GBM) · Logistic regression · CART · Disaster preparedness
Accepted: 11 December 2024 / Published online: 31 December 2024
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024
Landslide susceptibility assessment for the Darjeeling Toy Train route:
a GIS and machine learning approach
PrasanyaSarkar1· MadhumitaMondal2· AlokSarkar3· Shasanka KumarGayen1
1 3
Content courtesy of Springer Nature, terms of use apply. Rights reserved.