January 2025
·
3 Reads
IEEE Transactions on Instrumentation and Measurement
Colorectal cancer is a multifaceted disease, but it can be effectively prevented through colonoscopy for the detection of polyps. In clinical practice, the development of automatic polyp segmentation techniques for colonoscopy images can significantly enhance the efficiency and accuracy of polyp detection, and help clinicians to precisely localize the polyps. However, existing segmentation methods have several obvious limitations: (1) inadequate utilization of multi-level features extracted by feature encoders, (2) ineffective aggregation of high-level and low-level features, and (3) unclear delineation of polyp boundaries. To address these challenges while enhancing the clarity of polyp boundaries in segmentation, we propose a novel Multi-layer Information Fusion and Optimization Network (MIFONet) consisting of the following components: (1) Contextual and Fine Feature Processing (CFFP) module, employed to effectively extract both local and global contextual information, (2) Hierarchical Feature Integration Module (HFIM), added to facilitate efficient aggregation of processed high-level and low-level features and strengthen the association between contextual features, (3) Multi-Scale Contextual Attention (MSCA) module, used to deeply integrate aggregated high-level features with low-level features, and (4) a novel refinement module composed of an Adaptive Channel Attention Pyramid (ACAP) part and a Skip-Reverse Attention (SRA) part, with the ability of capturing fine-grained information and refining feature representation. We conducted extensive experiments and comparative analysis of our proposed model with 19 popular or state-of-the-art (SOTA) methods on five renowned polyp benchmark datasets. To further validate the model’s generalization performance, we also designed three cross-dataset experiments. Experimental results demonstrate that MIFONet consistently achieves excellent segmentation performance across most datasets. Especially, we achieve 94.6% mean Dice on CVC-ClinicDB dataset which obtains the superior performance compared with SOTA methods.