Yixing Fu’s research while affiliated with Southwest Forestry University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (4)


Some bird images, Mel spectrograms, and WT spectrograms.
Framework diagram of this paper.
Data preprocessing process.
ISTA-Net⁺ framework.
Schematic diagram of the kth stage of CBAM_ISTA-Net⁺.

+11

Combining CBAM and Iterative Shrinkage-Thresholding Algorithm for Compressive Sensing of Bird Images
  • Article
  • Full-text available

September 2024

·

38 Reads

Dan Lv

·

·

Danjv Lv

·

[...]

·

Zhun Li

Bird research contributes to understanding species diversity, ecosystem functions, and the maintenance of biodiversity. By analyzing bird images and the audio of birds, we can monitor bird distribution, abundance, and behavior to better understand the health of ecosystems. However, bird images and audio involve a vast amount of data. To improve the efficiency of data transmission and storage efficiency and save bandwidth, compressive sensing can overcome this challenge. Compressive sensing is a technique that uses the sparsity of signals to recover original data from a small number of linear measurements. This paper introduces a deep neural network based on the Iterative Shrinkage Thresholding Algorithm (ISTA) and a Convolutional Block Attention Module (CBAM), CBAM_ISTA-Net⁺, for the compressive reconstruction of bird images, audio Mel spectrograms and wavelet transform spectrograms. Using 45 bird species as research subjects, including 20 bird images, 15 audio-generated Mel spectrograms, and 10 audio wavelet transform (WT) spectrograms, the experimental results show that CBAM_ISTA-Net⁺ achieves a higher peak signal-to-noise ratio (PSNR) at different compression ratios. At a compression ratio of 50%, the average PSNR of the three datasets reaches 33.62 dB, 55.76 dB, and 38.59 dB, while both the Mel spectrogram and wavelet transform spectrogram achieve more than 30 dB at compression ratios of 25–50%. These results highlight the effectiveness of CBAM_ISTA-Net⁺ in maintaining high reconstruction quality even under significant compression, demonstrating its potential as a valuable tool for efficient data management in ecological research.

Download

Improved Broad Learning System for Birdsong Recognition

October 2023

·

65 Reads

·

5 Citations

Birds play a vital and indispensable role in biodiversity and environmental conservation. Protecting bird diversity is crucial for maintaining the balance of nature, promoting ecosystem health, and ensuring sustainable development. The Broad Learning System (BLS) exhibits an excellent ability to extract highly discriminative features from raw inputs and construct complex feature representations by combining feature nodes and enhancement nodes, thereby enabling effective recognition and classification of various birdsongs. However, within the BLS, the selection of feature nodes and enhancement nodes assumes critical significance, yet the model lacks the capability to identify high quality network nodes. To address this issue, this paper proposes a novel method that introduces residual blocks and Mutual Similarity Criterion (MSC) layers into BLS to form an improved BLS (RMSC-BLS), which makes it easier for BLS to automatically select optimal features related to output. Experimental results demonstrate the accuracy of the RMSC-BLS model for the three construction features of MFCC, d M F C C , and d s q u e n c e is 78.85%, 79.29%, and 92.37%, respectively, which is 4.08%, 4.50%, and 2.38% higher than that of original BLS model. In addition, compared with other models, our RMSC-BLS model shows superior recognition performance, has higher stability and better generalization ability, and provides an effective solution for birdsong recognition.



A Novel Wood Log Measurement Combined Mask R-CNN and Stereo Vision Camera

February 2023

·

208 Reads

·

6 Citations

Wood logs need to be measured for size when passing through customs to verify their quantity and volume. Due to the large number of wood logs needs through customs, a fast and accurate measurement method is required. The traditional log measurement methods are inefficient, have significant errors in determining the long and short diameters of the wood, and are difficult to achieve fast measurements in complex wood stacking environments. We use a Mask R-CNN instance segmentation model to detect the contour of the wood log and employ a binocular stereo camera to measure the log diameter. A rotation search algorithm centered on the wood contour is proposed to find long and short diameters and to optimal log size according to the Chinese standard. The experiments show that the Mask R-CNN we trained obtains 0.796 average precision and 0.943 IOUmask, and the recognition rate of wood log ends reaches 98.2%. The average error of the short diameter of the measurement results is 5.7 mm, the average error of the long diameter is 7.19 mm, and the average error of the diameter of the wood is 5.3 mm.

Citations (3)


... The dataset for this study was derived from Xeno-canto [33]. It comprises 263 audio recordings from 16 bird species belonging to 7 orders, 9 families, and 15 genera. ...

Reference:

A Bird Vocalization Classification Method Based on Bidirectional FBank with Enhanced Robustness
Improved Broad Learning System for Birdsong Recognition

... Classification complexity is inversely related to the quality of training data, often measured in terms of sparsity and dimensionality (Ho and Basu, 2002;Lorena et al., 2019). While deep-learning classifiers can be trained with incomplete, sparse, or weakly labeled data (e.g., few-shot learning) (Fu et al., 2023;Wang et al., 2020), the classification performance may be insufficient for many ecological applications. A regional scope may also facilitate better model tuning through more resources per species, for example species-specific frequency ranges and spectrogram resolution, or adding additional training data to address common pairwise misclassifications. ...

Classification of birdsong spectrograms based on DR-ACGAN and dynamic convolution
  • Citing Article
  • August 2023

Ecological Informatics

... LiDAR technology uses lasers to measure distance, creating detailed 3D point clouds of the object being measured, which can be incredibly precise for volume calculations [2], and could be applied for autonomous forestry tasks, such as, navigation, tree detection, and species classification [14]. An additional innovative technique is through detection and measurement of diameters using computer vision techniques [15,16]. ...

A Novel Wood Log Measurement Combined Mask R-CNN and Stereo Vision Camera