Research Progress on Urban Built-up Area Identification in High-Resolution Remote Sensing Images Using Deep Learning
DOI:
https://doi.org/10.63313/JCSFT.9003Keywords:
Deep Learning, High-resolution Remote Sensing Image, Urban Built-up Area, Semantic Segmentation, Image Recognition StyleAbstract
Against the backdrop of rapid urbanization, the urban built-up area, as the core component of urban development, requires accurate identification for urban planning, land use monitoring, and sustainable development assessment. How-ever, the complex internal features, significant scale variations, and blurred boundaries within high-resolution remote sensing images often lead to insuffi-cient accuracy with traditional identification methods. Deep learning, leverag-ing its powerful capabilities for automatic feature learning and semantic extrac-tion, has become the core technology for identifying urban built-up areas in high-resolution remote sensing images. This paper provides a comprehensive review of research on high-resolution remote sensing image-based urban built-up area identification from a deep learning perspective: Firstly, it clarifies the connotation of urban built-up areas and summarizes their identification characteristics and challenges. Secondly, it categorizes the commonly used mul-ti-source data types for identification, including high-resolution remote sensing imagery, multispectral data, and LiDAR data. Thirdly, it summarizes methodo-logical advancements in built-up area identification using deep learning-based semantic segmentation models (e.g., CNN, FCN, U-Net, Transformer). Finally, it analyzes the current problems and challenges in the research and proposes prospects and suggestions for future development directions.
References
[1] Feng, Q., Chen, B., Li, G., et al. (2022). A review of research on remote sensing image sample datasets. Journal of Remote Sensing, 26(4), 589–605.
[2] Pei, H., Owari, T., Tsuyuki, S., & Zhong, Y. (2023). Application of a Novel Multiscale Global Graph Convolutional Neural Network to Improve the Accuracy of Forest Type Classifica-tion Using Aerial Photographs. Remote Sensing, 15(4), 1001.
[3] Shao, Z., Tang, P., Wang, Z., Saleem, N., Yam, S., & Sommai, C. (2020). BRRNet: A Fully Con-volutional Neural Network for Automatic Building Extraction From High-Resolution Re-mote Sensing Images. Remote Sensing, 12(6), 1050.
[4] Wang, Y., Yang, L., Liu, X., & Yan, P.. (2024). An improved semantic segmentation algorithm for high-resolution remote sensing images based on deeplabv3. Scientific Reports, 14(1).
[5] Ismael, S. F., Kayabol, K., & Aptoula, E.. (2024). Unsupervised domain adaptation for the semantic segmentation of remote sensing images via a class-aware fourier transform and a fine-grained discriminator. Digital Signal Processing, 151, 104551.
[6] Jia, P., Chen, C., & Zhang, Y. Z. L.. (2024). Semantic segmentation of deep learning remote sensing images based on band combination principle: application in urban planning and land use. Computer communications, 217(Mar.), 97-106.
[7] Feng, Q., Chen, B., Li, G., et al. (2022). A review for sample datasets of remote sensing im-agery. National Remote Sensing Bulletin. 26(4), 589–605.
[8] Wang, B., & Wang, Y. (2022). Research on the current status of remote sensing image land category information acquisition technology based on deep learning. Journal of Land and Resources, 19(4), 74–80.
[9] Zhang, W., Liu, W., Sun, X., et al. (2022). High resolution remote sensing image semantic segmentation using multi-source feature adaptive fusion network. Chinese Journal of Im-age and Graphics, 27(8), 2516–2526.
[10] Lou, M., Zhang, S., Zhou, H. Y., Yang, S., Wu, C., & Yu, Y. (2025). TransXNet: learning both global and local dynamics with a dual dynamic token mixer for visual recognition. IEEE Transactions on Neural Networks and Learning Systems.
[11] Pan, X., Jiao, C., Yang, B., Zhu, H., & Wu, J. (2025). Attribute-guided feature fusion network with knowledge-inspired attention mechanism for multi-source remote sensing classifica-tion. Neural Networks, 187, 107332.
[12] Sanchez, J., Deschaud, J. E., & Goulette, F.. (2023). Cola: coarse-label multi-source lidar se-mantic segmentation for autonomous driving. IEEE Transactions on Robotics, vol. 41, pp. 1742-1754, 2025.
[13] Honglin Jiang, Hesham Elmasry, Sangwon Lim, and Karim El-Basyouny. 2025. Utilizing deep learning models and LiDAR data for automated semantic segmentation of infrastruc-ture on multilane rural highways. Canadian Journal of Civil Engineering. 52(8): 1523-1543.
[14] Zhao, W., Du, S., Wang, Q., & Emery, W. J. (2017). Contextually guided very-high-resolution imagery classification with semantic segments. ISPRS journal of Photogrammetry and Remote Sensing, 132, 48-60.
[15] Kaiser, P., Wegner, J. D., Lucchi, A., Jaggi, M., Hofmann, T., & Schindler, K. (2017). Learning aerial image segmentation from online maps. IEEE Transactions on Geoscience and Re-mote Sensing, 55(11), 6054-6068.
[16] Wang, L., Qiao, Y., Tang, X., & Van Gool, L. (2016). Actionness estimation using hybrid fully convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2708-2717).
[17] Dai, J., Li, Y., He, K., & Sun, J. (2016). R-fcn: Object detection via region-based fully convolu-tional networks. Advances in neural information processing systems, 29.
[18] Qin, S., Wang, Q., Zeng, Q., Ye, M., Fu, A., & Chen, G. (2025). Automatic recognition of debris rock lithology based on unsupervised semantic segmentation. Computers & Geosciences, 196, 105790.
[19] Ding, T., & Wang, X. (2024). DEDU-Net: Dual-Encoder-Decoder-U-Net for road extraction from high-resolution remote sensing images. International Journal of Remote Sensing, 45(10), 3231-3247.
[20] Fang, W., Fu, Y., & Sheng, V. S. (2024). FPS-U2Net: Combining U2Net and multi-level ag-gregation architecture for fire point segmentation in remote sensing images. Computers & Geosciences, 189, 105628.
[21] Deng, J., & Yang, B. (2024). Hyperspectral and multispectral image fusion via residual se-lective kernel attention-based U-net. International Journal of Remote Sensing, 45(5), 1699-1726.
[22] Wang, J., Cheng, L., Zheng, Y., Cui, H., & Wang, Y. (2025). Extraction of Rural Buildings with Different Main Structure Types Based on a Revised U-Net Model. Journal of Computing in Civil Engineering, 39(2), 04024058.
[23] Li, Y., Chen, S., Xu, F., Tang, H., & Xie, W. (2023). AOHSR: attention mechanism-based opti-cal image-guided hyperspectral image super-resolution. International Journal of Remote Sensing, 44(13), 3939-3971.
[24] Tienin, B. W., Cui, G., Ukwuoma, C. C., Talla Nana, Y. A., Mba Esidang, R., & Moniz Moreira, E. Z. (2024). MS3Net: a deep ensemble learning approach for ship classification in heteroge-neous remote sensing data. International Journal of Remote Sensing, 45(3), 748-771.
[25] Xiao, H., Chen, X., Luo, L., & Lin, C. (2025). A dual-path feature reuse multi-scale network for remote sensing image super-resolution. The Journal of Supercomputing, 81(1), 17.
[26] Wang, S. B., Gao, Z. M., Jin, D. H., Gong, S. M., Peng, G. L., & Yang, Z. J. (2024). AMEA-YOLO: a lightweight remote sensing vehicle detection algorithm based on attention mechanism and efficient architecture. The Journal of Supercomputing, 80(8), 11241-11260.
[27] Fayaz, M., Dang, L. M., & Moon, H. (2025). DLAN: a dual attention network for effective land cover classification in remote sensing. Knowledge-Based Systems, 113620.
[28] Wang, L., Li, R., Duan, C., Zhang, C., Meng, X., & Fang, S. (2022). A novel transformer based semantic segmentation scheme for fine-resolution remote sensing images. IEEE Geosci-ence and Remote Sensing Letters, 19, 1-5.
[29] He, X., Zhou, Y., Zhao, J., Zhang, D., Yao, R., & Xue, Y. (2022). Swin transformer embedding UNet for remote sensing image semantic segmentation. IEEE transactions on geoscience and remote sensing, 60, 1-15.
[30] Nan, G., Li, H., Du, H., Liu, Z., Wang, M., & Xu, S. (2024). A semantic segmentation method based on AS-Unet++ for power remote sensing of images. Sensors, 24(1), 269.
[31] Zhou, T., He, H., Wang, Y., &Liao, Y.. (2025). Improved gated recurrent units together with fusion for semantic segmentation of remote sensing images based on parallel hybrid net-work. Multimedia Systems, 31(2).
[32] Wang, Y., Yang, L., Liu, X., & Yan, P. (2024). An improved semantic segmentation algorithm for high-resolution remote sensing images based on DeepLabv3+. Scientific reports, 14(1), 9716.
[33] Wu, Y., Zhang, X., Zhao, X., Sun, Y., & Li, T. (2025). segWCD: A new segmentation-based weak supervision neural network for building change detection. Applied Intelligence, 55(2), 147.
[34] Li, L., Lu, N., & Qin, J. (2025). Joint-task learning framework with scale adaptive and posi-tion guidance modules for improved household rooftop photovoltaic segmentation in re-mote sensing image. Applied Energy, 377, 124521.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 by author(s) and Erytis Publishing Limited.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.