Academic Journal of Computing & Information Science, 2023, 6(5); doi: 10.25236/AJCIS.2023.060504.
Dong Wenkuan, Gong Shicai
School of Science, Zhejiang University of Science and Technology, Hangzhou, Zhejiang, 310000, China
To address the issue of decreased segmentation accuracy in foggy images and the difficulty in accurately segmenting heavy fog regions in traditional semantic segmentation models, we propose an improved segmentation model that combines image Levels adjustment and attention mechanisms. Firstly, the fog density of the image is estimated using an atmospheric scattering model, and the image is adjusted based on the estimated fog density to highlight the information that was originally obscured by the fog. A dual branch input (DB Input), is constructed for both heavy fog and light fog areas to enhance the feature learning of the model without destroying the initial information of the foggy image. An RCCA module, which is a spatial domain attention mechanism, is introduced at the end of the dual branch input to enhance the region attention ability of the module in different branches. Experiments are conducted on the datasets Foggy Cityscapes and Foggy Uavid. The results show that the improved model achieves 70.6% and 66.7% in the mIOU accuracy, respectively, which is a 5.2% and 5.4% improvement over the original model, indicating better segmentation results.
fog density estimate; levels adaptive adjustment; attention mechanism; semantic segmentation; hazy images
Dong Wenkuan, Gong Shicai. Hazy Images Segmentation Method Based on Improved DeeplabV3+. Academic Journal of Computing & Information Science (2023), Vol. 6, Issue 5: 21-29. https://doi.org/10.25236/AJCIS.2023.060504.
 Grigorescu S, Trasnea B, Cocias T, et al. A survey of deep learning techniques for autonomous driving [J]. Journal of Field Robotics, 2020, 37(3): 362-386.
 Feng D, Haase S C, Rosenbaum L, et al. Deep multi-modal object detection and semantic segmentation for autonomous driving: Datasets, methods, and challenges [J]. IEEE Transactions on Intelligent Transportation Systems, 2020, 22(3): 1341-1360.
 Fu K, Zhu X Y, Lv Q X, et al. UAV Command Intent Identification Technology Based on Deep Learning [J]. Ordnance Industry Automation, 2022, 41(10): 41-44+59.
 Dong S, Wang P, Abbas K. A survey on deep learning and its applications[J]. Computer Science Review, 2021, 40: 100379.
 Lee T, Mckeever S, Courtney J. Flying free: a research overview of deep learning in drone navigation autonomy [J]. Drones, 2021, 5(2): 52.
 Long J, Evan S, Trevor D. Fully convolutional networks for semantic segmentation[C]//2015 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2015: 3431-3440.
 Ronneberger O, Fischer P, Brox T. U-net: Convolutional networks for biomedical image segmentation [C]//2015 International Conference on Medical Image Computing and Computer-assisted Intervention. Heidelberg: Springer, 2015: 234-241
 Vijay B, Alex K, Roberto C. SegNet: A deep convolutional encoder-decoder architecture for image segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-249512.
 Chen L C, Zhu Y, Papandreou G, et al. Encoder-Decoder with atrous separable convolution for semantic image segmentation[C]//2018 European Conference on Computer Vision. Heidelberg: Springer, 2018: 801-818.
 Agrawal, Subhash C, Anand S J. A comprehensive review on analysis and implementation of recent image dehazing methods [J]. Archives of Computational Methods in Engineering, 2022, 29(7): 4799-4850.
 Zhou J C, Liu D S, Xie X, et al. Underwater image restoration by red channel compensation and underwater median dark channel prior [J]. Applied Optics, 2022, 61(10): 2915-2922.
 Hassan H, Bashir A K, Ahmad M, et al. Real-time image dehazing by super pixels segmentation and guidance filter[J]. Journal of Real-Time Image Processing, 2021, 18: 1555-1575.
 Mondal K, Rabidas R, Dasgupta R. Single image haze removal using contrast limited adaptive histogram equalization based multiscale fusion technique [J]. Multimedia Tools and Applications, 2022, 19: 1-26.
 Manu C M. MSDNet: a novel multi-stage progressive image dehazing network[C]//2021 Proceedings of the Twelfth Indian Conference on Computer Vision, Indian: Graphics and Image Processing, 2021: 1-9
 He K M, Zhang X, Ren S Q, et al. Deep residual learning for image recognition[C]//2016 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016:770-778.
 Dhal K G, Das A, Ray S, et al. Histogram equalization variants as optimization problems: a review[J]. Archives of Computational Methods in Engineering, 2021, 28: 1471-1496.
 Rao B S. Dynamic histogram equalization for contrast enhancement for digital images[J]. Applied Soft Computing, 2020, (89): 106114.
 Cai B, Xu X, Jia K, et al. DehazeNet: An end-to-end system for single Image haze removal[J]. IEEE Transactions on Image Processing, 2016, 25(11): 5187-5198.
 Dai D, Sakaridis C, Hecker S, et al. Curriculum model adaptation with synthetic and real data for semantic foggy scene understanding[J]. International Journal of Computer Vision, 2020, 128(5): 1182-1204.
 Lee S, Son T, Ewak S. Fifo: Learning fog-invariant features for foggy scene segmentation. [C]//2022 Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Louisiana: IEEE Press, 2022: 18911-18921.
 Huang Z L, Wang X G, Huang L C, et al. Ccnet: Criss-cross attention for semantic segmentation [C]// 2019 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2019: 603-612.
 Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//2018 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 7732-7741.
 Cordts M, Omran M, Ramos S, et al. The cityscapes dataset for semantic urban scene understanding[C]//2016 Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 3312-3323.
 Sakaridis C, Dai D, Van G L. Semantic foggy scene understanding with synthetic data[J]. International Journal of Computer Vision, 2018, 126(9): 973–992.
 Lyu Y, Vosselman G, Xia G S, et al. UAVid: A semantic segmentation dataset for UAV imagery[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 165: 108-119.
 Hong Y, Pan H, Sun W, et al. Deep dual-resolution networks for real-time and accurate semantic segmentation of road scenes [J]. arXiv preprint arXiv:2101.06085, 2021.
 Ren D D, Li J B, Zhao J Y. MFANet: multi-task fusion attention network for semantic segmentation of haze image [J]. Journal of natural science of Heilongjiang university, 2021, 38(5):608-616.
 Liu K, Ye Z, Guo H, et al. FISS GAN: A generative adversarial network for foggy image semantic segmentation [J]. IEEE/CAA Journal of Automatica Sinica, 2021, 8(8): 1428-1439.