Yiran Yang’s research while affiliated with Academy of Chinese Culture and Health Sciences and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (11)


Beyond the limitation of monocular 3D detector via knowledge distillation
  • Conference Paper

October 2023

·

5 Reads

·

3 Citations

Yiran Yang

·

·

·

[...]

·

Xinming Li

GAL: Graph-Induced Adaptive Learning for Weakly Supervised 3D Object Detection

September 2023

·

40 Reads

·

5 Citations

IEEE Transactions on Intelligent Transportation Systems

Weakly Supervised 3D Object Detection (WS3DOD) aims to perform 3D object detection with little reliance on 3D labels, which greatly reduces the cost of 3D annotations. In recent literature, the pseudo-label-based approach brings impressive performance, which generates 3D pseudo-labels from 2D bounding boxes. Despite their success, two key issues remain unresolved that reduce the quality of 3D pseudo-labels: 1) the existing local object locating algorithm can not capture complete clusters of points globally, and 2) the existing algorithm can not capture sparse points caused by the unevenly distributed points obtained by LiDAR cameras. Hence, we propose GAL, a Graph-induced Adaptive Learning algorithm, to generate 3D pseudo-labels. First, we propose the Cluster Locating algorithm based on the Minimum Spanning Tree (MST) to globally locate the objects, which can leverage the characteristic that points inside an object are compact while points between objects are discrete. Second, we propose a density-guided adaptive learning algorithm to optimise the Cluster Locating algorithm, named Cuboid Drift. Cuboid Drift considers the inhomogeneous distribution of reflected points on different reflective surfaces of LiDAR imaging. Finally, 3D pseudo-labels generated by GAL are leveraged to train 3D detectors. Extensive experiments on the challenging KITTI and DAIR-V2X-V dataset demonstrate that GAL without 3D labels can be comparable with strongly supervised approaches and outperforms the previous state-of-the-art WS3DOD methods. Moreover, our method saves 88% of the time spent on pseudo-label generation.



Basic information of dumpsite dataset and typical examples of four categories
a The proportion of the four categories. b Quantity distribution of dumpsite samples in different countries. (The number of dumpsites in China is relatively large since data in China is easier to obtain in this work, and our validation results show that the proposed method has the generalisability to detect dumpsites globally.) c Geographic location of all selected cities in our dataset for training and verification. d Typical examples and characteristics of the four types of dumpsites.
Presentation and confirmation of dumpsite detection results
a–c Photo of a domestic waste dump located in Beijing and its feature distribution in the model, which are shown with the owner’s permission. Supplementary Figure 6 provides an enlarged view of the highlighted areas in c. d Geographic distribution of dumpsites in the selected area in Shanghai. e–g Our model extracts the characteristics of an agricultural waste and uses a rectangular box to mark its location.
Geographical distribution and Global Dumpsite Index (GDI) of selected areas for spatial statistics experiments
a Global distribution and quantity comparison of 28 areas for spatial analysis. Red and black circles represent two areas of the same country, and the area of the circle represents the number of dumpsites. b GDI and percentage of different types of dumpsites in 28 areas. The order is determined by GDI, and different colours represent different types of dumpsites.
Changes in the three types of dumpsites and the total number of dumpsites in the four selected cities from 2015 to 2019
The four tables record the trends of Domestic Waste, Construction Waste, Agricultural Waste, and total dumpsites in these areas from 2015 to 2019.
Changes in the location of four dumpsites in five years
Line I, Demolition of houses forms construction waste, which is then built into lawns. Line II, Illegal dumpsites are cleared and built into a green belt. Line III, A dedicated dumpsite. Line IV, Construction waste formed by the demolition of old houses were subsequently built into office buildings.

+2

Revealing influencing factors on global waste distribution via deep-learning based dumpsite detection from satellite imagery
  • Article
  • Full-text available

March 2023

·

562 Reads

·

56 Citations

With the advancement of global civilisation, monitoring and managing dumpsites have become essential parts of environmental governance in various countries. Dumpsite locations are difficult to obtain in a timely manner by local government agencies and environmental groups. The World Bank shows that governments need to spend massive labour and economic costs to collect illegal dumpsites to implement management. Here we show that applying novel deep convolutional networks to high-resolution satellite images can provide an effective, efficient, and low-cost method to detect dumpsites. In sampled areas of 28 cities around the world, our model detects nearly 1000 dumpsites that appeared around 2021. This approach reduces the investigation time by more than 96.8% compared with the manual method. With this novel and powerful methodology, it is now capable of analysing the relationship between dumpsites and various social attributes on a global scale, temporally and spatially.

Download


MiCro: Modeling Cross-Image Semantic Relationship Dependencies for Class-Incremental Semantic Segmentation in Remote Sensing Images

January 2023

·

15 Reads

·

9 Citations

IEEE Transactions on Geoscience and Remote Sensing

Continual learning is an effective way to overcome catastrophic forgetting (CF) in incremental learning for semantic segmentation. The existing continual semantic segmentation (CSS) methods of remote sensing (RS) ignore the semantic relationships among pixels across different images, which will lead to disappointing segmentation results, such as edge pixel misclassification and small object omission. In this article, we propose a framework for modeling cross-image semantic relationship dependencies (MiCro), which aims to learn an interclass separable and intraclass cohesive feature space from the pixel relationships across various images to ensure that learned categories can prevent CF in the incremental process. Specifically, we exploit the relationships among pixels of images in minibatch to construct three losses: 1) cross-image feature relationship distillation (CFRD) loss, which builds a well-structured feature space; 2) cross-image intraclass feature cohesion (CIFC) loss, which is devised to make intraclass features more cohesive; and 3) cross-image class-area weighted cross-entropy (CCWCE) loss, which is mainly employed to inversely weight the proportion of category area in minibatch. The effectiveness of the proposed approach is demonstrated by extensive experiments on three RS semantic segmentation datasets from ISPRS Vaihingen, ISPRS Potsdam, and iSAID. MiCro is superior to the current most advanced methods in most incremental settings, especially improving mIoU by 11.59% on ISPRS Vaihingen, 13.17% on ISPRS Potsdam, and 15.01% on iSAID in the most difficult incremental settings, which promotes the CSS to a state-of-the-art (SOTA) level. The code will be available at https://github.com/RongXueE/MiCro .


Category Correlation and Adaptive Knowledge Distillation for Compact Cloud Detection in Remote Sensing Images

January 2022

·

17 Reads

·

13 Citations

IEEE Transactions on Geoscience and Remote Sensing

Cloud detection relying on deep convolutional neural networks (DCNNs) obtains remarkable accuracy gains at the expense of high computation and storage costs, which are difficult to deploy to resource-constrained devices, such as intelligent satellites. Recently, knowledge distillation (KD) has been a promising solution for a compact model. However, most existing KD methods only transfer the feature relationship of pairwise pixel, which fails to cope with thin clouds and cloud-like objects in complex scenes. Furthermore, those KD methods directly imitate the output of the complicated model regardless of the correctness. In this article, we propose a novel category correlation and adaptive KD (CAKD) framework for the lightweight cloud detection network. We design a category relational context (CRC) module to refine the structured pixel-category correlation from the teacher and student network. Then, we perform the category correlation distillation (CCD) to make the student model better address the intraclass consistency and the interclass difference, thus reducing the category confusion. Besides, a pixel-adaptive distillation (PAD) module is utilized to adaptively transfer the soft-output knowledge of a teacher model by extracting the teacher’s pixel prediction probability. Extensive experiments on Landsat 8, Landsat 7, Gaofen-2, Gaofen-1, and Google Earth dataset report the effectiveness and universality of our distillation method. The CAKD allows MobileNetV2 with 2.31M parameters and 4.63G float-point operations (FLOPs) to outperform advanced cloud detection methods without the added inference overhead.


Adaptive Knowledge Distillation for Lightweight Remote Sensing Object Detectors Optimizing

January 2022

·

32 Reads

·

80 Citations

IEEE Transactions on Geoscience and Remote Sensing

Lightweight object detector is currently gaining more and more popularity in remote sensing. In general, it is hard for lightweight detectors to achieve competitive performance compared with the traditional deep models, while knowledge distillation (KD) is a promising training method to tackle the issue. Since the background is more complicated and the object size varies extremely in remote sensing images, it will deliver lots of noise and affect the training performance when directly applying the existing KD methods. To tackle the above problems, we propose an adaptive reinforcement supervision distillation (ARSD) framework to promote the detection capability of the lightweight model. First, we put forward a multiscale core features imitation (MCFI) module for transferring the knowledge of features, which can adaptively select the multiscale core features of objects for distillation and focus more on the features of small objects by an area-weighted strategy. In addition, a strict supervision regression distillation (SSRD) module is designed to select the optimal regression results for distillation, which facilitates the student to effectively imitate the more precise regression output of the teacher network. Massive experiments on a large-scale Dataset for Object deTection in Aerial images (DOTA), object DetectIon in Optical Remote sensing images (DIOR), and Northwestern Polytechnical University Very-High-Resolution 10-class (NWPU VHR) datasets prove that ARSD outperforms the existing distillation state-of-the-art (SOTA) methods. Moreover, the performance of the lightweight model trained with our method transcends other classic heavy and lightweight detectors, which beneficiates the development of lightweight models.


Optimal Partition Assignment for Universal Object Detection

January 2022

·

30 Reads

·

2 Citations

IEEE Transactions on Multimedia

The label assignment problem is a core task in object detection, which mainly focuses on how to define the positive/negative samples during the training phase. Recent works have proved that label assignment is significant for performance improvement of the detector. In this paper, we propose an exquisite strategy that can dynamically assign labels according samples' joint scores (classification and location). Moreover, our strategy can apply to both 2D and 3D monocular detectors. In our strategy, we formulate label assignment as an optimization problem. Concretely, we first calculate the classification and location costs of each sample, which are treated as points in a two-dimensional coordinate system. Then an optimal divider line that minimizes the sum of point-to-line distances is designed to separate the positive/negative samples. An iterative Genetic Algorithm is employed in acquiring the optimal solution. Furthermore, a GIoU auxiliary branch is devised to keep sample selection consistent during the training and testing phase. Benefitting from the non-maximum suppression (NMS) that utilizes the joint scores of classification and location, excellent detection performance is achieved. Extensive experiments conducted on MS COCO, PASCAL VOC (2D object detection), and KITTI (3D object detection) verify the effectiveness and universality of our proposed Optimal Partition Assignment (OPA).


Statistical Sample Selection and Multivariate Knowledge Mining for Lightweight Detectors in Remote Sensing Imagery

January 2022

·

16 Reads

·

18 Citations

IEEE Transactions on Geoscience and Remote Sensing

In recent years, more concerns are shed on the lightweight detection model in remote sensing (RS), but it is difficult to reach a competitive performance relative to the deep model. Knowledge distillation has been verified as a promising method, which can promote the performance of the lightweight model without extra parameters. While there are two key issues of detection distillation, one is the sample selection and the other is the knowledge selection. Since the varying object size and complex features in RS, the existing methods based on the fixed threshold are incapable of selecting the optimal distillation samples and they also ignore the potential multivariate knowledge among RS samples simultaneously. In this article, we propose a statistical sample selection and multivariate knowledge mining framework. The statistical sample selection module formulates the task as the modeling and splitting of the probability distribution of sample selection cost, which is more suitable for dynamically choosing multiscale samples in RS and eliminates the distortion of previous static distillation selection. Furthermore, to mine the complex feature knowledge of samples in RS, we design a multivariate knowledge mining module, in which knowledge includes explicit and implicit knowledge. The proposed module validly delivers the core knowledge from the teacher model to the lightweight model. Massive experiments on three challenging RS datasets [a large-scale Dataset for Object deTection in Aerial images (DOTA), Northwestern Polytechnical University very-high-resolution 10-class (NWPU VHR-10), and object DetectIon in Optical Remote sensing images (DIOR)] prove that our method achieves state-of-the-art performance.


Citations (10)


... The encoded 2D feature maps are projected into BEV feature maps via LSS with a grid size of 0.2 m. To match the 0.4 m grid size used in K-Radar, we add a convolutional layer with stride 2. During training, we employ the depth information projected from LiDAR point clouds onto the image and, inspired by [51], we use the LiDAR backbone's (SECOND) BEV feature maps for distillation. As shown in Fig. 3, camera images have limited utility in adverse weather, so we train the camera encoder only on normal or overcast scenes (sequences 1-20) from K-Radar. ...

Reference:

Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera
Beyond the limitation of monocular 3D detector via knowledge distillation
  • Citing Conference Paper
  • October 2023

... Although the aforementioned methods have demonstrated relatively favorable outcomes in the field of vision classification, the investigation of parameter-efficient fine-tuning (PEFT) methods for dense prediction tasks remains scarce. Lorand (Yin et al. 2023), designed specifically for dense prediction tasks, utilizes a low-rank synthesis approach. In light of the current state of research, it inspires us to explore more effective approaches in this realm. ...

1% VS 100%: Parameter-Efficient Low Rank Adapter for Dense Predictions
  • Citing Conference Paper
  • June 2023

... These multiple inputs can have different shapes or lack semantic alignment, with the goal being to find their common mapping target. It can be used for many tasks, such as multi-phase radiology image analysis (Hu et al. 2023) and label assignment problem (Wei et al. 2023). Our manyto-one scheme differs from conventional multi-scale methods by offering a structure that can easily scale to accommodate different numbers and different shapes of inputs. ...

Guide the Many-to-One Assignment: Open Information Extraction via IoU-aware Optimal Transport
  • Citing Conference Paper
  • January 2023

... CIL tasks have disjointed data labeling spaces, and task identifiers are provided only during training. Currently, most of the research focuses on CIL tasks [39], [40], and fewer studies addressing DIL. This article primarily addresses the DIL problem. ...

MiCro: Modeling Cross-Image Semantic Relationship Dependencies for Class-Incremental Semantic Segmentation in Remote Sensing Images
  • Citing Article
  • January 2023

IEEE Transactions on Geoscience and Remote Sensing

... In light of these challenges, weak supervision seeks to leverage annotations that do not fully correspond to the targeted task, but that are substantially cheaper to label. In the case of the 3D object detection task, instead of annotating the complete 3D box, a cheaper annotation can be the center of the 3D box [32], the center in bird-eyeview [14] or a 2D box annotated from a synchronized camera view [12,13,16,27,30]. These weakly supervised models are often used as 3D pseudo-labelers providing 3D box supervision for off-the-shelf fully supervised 3D detectors. ...

GAL: Graph-Induced Adaptive Learning for Weakly Supervised 3D Object Detection
  • Citing Article
  • September 2023

IEEE Transactions on Intelligent Transportation Systems

... Several techniques have been proposed in literature to combine satellite and GIS data (Notarnicola, et al., 2004), to apply traditional CV based on spectral or textural features on RS images (Parrilli, et al., 2021;Vambol, et al., 2019) and, recently, to exploit Convolutional Neural Networks (CNN) to address advanced tasks, such as scene classification (Lavender, 2022;Parrilli, et al., 2021;Torres, et al., 2021) or accurate waste localization (Kruse, et al., 2023;Sun, et al., 2023;Yailymova, et al., 2022;Yang, et al., 2022;Zhou, et al., 2023). ...

Revealing influencing factors on global waste distribution via deep-learning based dumpsite detection from satellite imagery

... To address these challenges, researchers have proposed lightweight methods based on knowledge distillation, such as the self-supervised diversity knowledge distillation framework (SSD-KD), which effectively enhances the performance of lightweight models in classification tasks by integrating the instance relationship knowledge while significantly reducing the parameter count and computational requirements [28]. Additionally, through instance interaction and attribute-aware strategies, the dynamic interaction learning (DIL) framework achieves an outstanding performance for lightweight remote sensing detection models in complex, multi-scale scenarios [29]. ...

Dynamic Interactive Learning for Lightweight Detectors in Remote Sensing Imagery
  • Citing Article
  • January 2022

IEEE Transactions on Geoscience and Remote Sensing

... Object Detection in Remote Sensing Images (Yang et al. 2022b), but it neglected the relationship between different instances. Our method, instead, employs an information fidelity distillation strategy that enhances key knowledge transfer to the student while focusing on refining feature and minimizing noise interference. ...

Statistical Sample Selection and Multivariate Knowledge Mining for Lightweight Detectors in Remote Sensing Imagery
  • Citing Article
  • January 2022

IEEE Transactions on Geoscience and Remote Sensing

... We compared our DIPKD method with several state-ofthe-art knowledge distillation techniques, including FKD (Zhang and Ma 2020b), FGD (Yang et al. 2022c), MGD (Yang et al. 2022d), FRS (Zhixing et al. 2021), ARSD (Yang et al. 2022a), TWA , and IFKD the same experimental conditions and datasets. As shown in Table 2, on the SSDD dataset, other distillation methods overlooked the purity or imbalance of the teacher model's knowledge, leading to a decline in the student model's performance after distillation. ...

Adaptive Knowledge Distillation for Lightweight Remote Sensing Object Detectors Optimizing
  • Citing Article
  • January 2022

IEEE Transactions on Geoscience and Remote Sensing

... Its applications span various fields, including urban planning, disaster monitoring, road extraction, agricultural estimation, etc. [1][2][3][4]. Recently, popular deep neural networks [5] (DNNs) have achieved remarkable progress in this task, relying mainly on training on specific annotated datasets and testing [6][7][8][9][10][11]. However, these supervised learning methods require expensive and laboriously labeled images to obtain satisfactory performance. ...

Category Correlation and Adaptive Knowledge Distillation for Compact Cloud Detection in Remote Sensing Images
  • Citing Article
  • January 2022

IEEE Transactions on Geoscience and Remote Sensing