Bingtao Hu’s research while affiliated with Hefei University of Technology and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (4)


Crack image and the corresponding ground truth. The white crack area is only a very small part of the overall image
Diagram of the different backbone architectures. Both a and b are encoder-decoder networks, but b adds additional lateral connections to recover high-resolution feature maps. c is the proposed bilateral crack detection model. The green module represents the detail branch and the yellow one represents the semantic branch
Overall diagram of the proposed BiCrack. SPPM stands for Simple Pyramid Pooling Module. FFM stands for Feature Fusion Module. RB denotes the Residual Block. UPS stands for upsampling, which is implemented by deconvolution in this paper
a Stem block. Conv denotes convolution, MaxPooling denotes Max pooling, and C denotes the concatenate operation. N represents the base number of feature map channels, which is set to 64 in the experiments. The activation function layer of the first 3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document}3 convolution is removed. b the residual block
Simple Pyramid Pooling Module. Avgpool and Resize represent global averaging pooling and upsampling, respectively, where upsampling uses a simple bilinear interpolation. Add indicates the add operation

+6

Bicrack: a bilateral network for real-time crack detection
  • Article
  • Publisher preview available

November 2024

·

19 Reads

International Journal of Machine Learning and Cybernetics

Sailei Wang

·

Rongsheng Lu

·

Bingtao Hu

·

[...]

·

Mingtao Fang

Crack detection is an important task to ensure structural safety. Traditional manual detection is extremely time-consuming and labor-intensive. However, existing deep learning-based methods also commonly suffer from low inference speed and continuous crack interruption. To solve the above problems, a novel bilateral crack detection network (BiCrack) is proposed for real-time crack detection tasks. Specifically, the network fuses two feature branches to achieve the best trade-off between accuracy and speed. A detail branch with a shallow convolutional layer is first designed. It preserves crack detail to the maximum and generates high-resolution features. Meanwhile, the semantic branch with fast-downsampling strategy is used to obtain enough high-level semantic information. Then, a simple pyramid pooling module (SPPM) is proposed to aggregate multi-scale context information with low computational cost. In addition, to enhance feature representation, an attention-based feature fusion module (FFM) is introduced, which uses space and channel attention to generate weights, and then fuses input fusion features with weights. To demonstrate the effectiveness of the proposed method, it was evaluated on 5 challenging datasets and compared with state-of-the-art crack detection methods. Extensive experiments show that BiCrack achieves the best performance in the crack detection task compared to other methods.

View access options


YOLOv8-FCS: A more focused YOLOv8 model for defect detection in images of steel surface

May 2024

·

128 Reads

Defect detection in steel surface is crucial for engineering quality control. Traditional methods for detecting surface defects on steel materials have issues such as low detection accuracy, slow speed, low level of intelligence, and insufficient utilization of images. In response to these challenges, this paper proposes an improved YOLOv8 model for efficient and accurate detection of defects on steel surface. Firstly, we introduce a single-channel adversarial input strategy (AIS) to enhance the utilization of single-channel images and improve the network's detection effectiveness. Secondly, we utilize various attention modules to enhance the Neck and detection head of the network, thereby further improving the network's expressive power and detection performance. Finally, experiments were conducted on three open datasets, achieving a mAP (mean average precision) of 77.3% on the NEU-DET dataset, outperforming YOLOv8 at 74.1%, a mAP of 65.5% on the GC10 dataset, outperforming YOLOv8 at 64.0%, and a mAP of 73.8% on the Magnetic-tile-defect-datasets, outperforming YOLOv8 at 71.2%. Additionally, the average detection speed of this model is 93 frames per second, effectively balancing detection accuracy and efficiency.


Citations (1)


... YOLOv8, an advanced object detection algorithm, effectively strikes a balance between detection accuracy and speed (Wan et al., 2024;Zhu et al., 2024). Consequently, in this study, we selected and enhanced the YOLOv8n model to develop a new network architecture termed ELM-YOLOv8n.The YOLOv8 architecture comprises four components: Input, Backbone, Neck, and Head. ...

Reference:

An enhanced lightweight model for apple leaf disease detection in complex orchard environments
YOLO-MIF: Improved YOLOv8 with Multi-Information fusion for object detection in Gray-Scale images
  • Citing Article
  • July 2024

Advanced Engineering Informatics