The proposed U-HDN architecture consists of three components: U-net architecture, multidilation module, and hierarchical feature learning module. The red dotted box presents the modified U-net; the green dotted box is a multi-dilation module; the blue dotted box shows the hierarchical feature learning module.

The proposed U-HDN architecture consists of three components: U-net architecture, multidilation module, and hierarchical feature learning module. The red dotted box presents the modified U-net; the green dotted box is a multi-dilation module; the blue dotted box shows the hierarchical feature learning module.

Source publication
Preprint
Full-text available
Inspired by the development of deep learning in computer vision and object detection, the proposed algorithm considers an encoder-decoder architecture with hierarchical feature learning and dilated convolution, named U-Hierarchical Dilated Network (U-HDN), to perform crack detection in an end-to-end method. Crack characteristics with multiple conte...

Contexts in source publication

Context 1
... by above observations, in this paper a new network called U-HDN, to fuse multi-scale features in encoder-decoder network based on U-net for crack detection is proposed. The flowchart and the proposed U-HDN architecture are shown in Figure 2 and Figure 3, and the proposed method consists of three components: U-net architecture, multi-dilation module (MDM), and hierarchical feature (HF) learning module. Firstly, an U-net is divided into encoder and decoder networks, which have the same scale at each stage. ...
Context 2
... the proposed method is designed and calculated the number of feature maps. In this paper, we employ spatial domain to calculate the feature maps, and the number of the feature maps are shown in Figure 3 (shown on the green boxes). ...
Context 3
... is shown in Figure 3, the red dotted box presents the modified U-net. Contracting path consists of two 3 × 3 convolution layers, each followed by the activation function rectified linear unit (ReLU) [70], and a 2 × 2 max pooling layers for down-sampling. ...

Similar publications

Article
Full-text available
In order to better detect objects of different scales, detectors need different resolutions and inputs from different receptive fields. Currently, advanced detectors usually combine the structure of feature pyramid to achieve the fusion of multi-scale object features. Top-down and bottom-up network structure is the basic strategy of multi-scale fea...
Article
Full-text available
Potholes on road surfaces pose a serious hazard to vehicles and passengers due to the difficulty detecting them and the short response time. Therefore, many government agencies are applying various pothole-detection algorithms for road maintenance. However, current methods based on object detection are unclear in terms of real-time detection when u...
Article
Full-text available
Attention mechanisms have been explored with CNNs across the spatial and channel dimensions. However, all the existing methods devote the attention modules to capture local interactions from a uni-scale. This paper tackles the following question: can one consolidate multi-scale aggregation while learning channel attention more efficiently? To this...
Article
Full-text available
Nowadays, an increasing number of researchers apply deep-learning-based object detection methods to implement visual defect detection in industrial manufacturing. However, large-scale variation in visual defect detection impedes the improvement of detection accuracy to be further explored. Therefore, we propose a hierarchical multi-scale block (HMS...
Article
Full-text available
Existing deep learning-based RGB-T salient object detection methods often struggle with effectively fusing RGB and thermal features. Therefore, obtaining high-quality features and fully integrating these two modalities are central research focuses. We developed an illumination prior-based coefficient predictor (MICP) to determine optimal interactio...