Research Progress on Urban Built-up Area Identification in High-Resolution Remote Sensing Images Using Deep Learning

Chujun Li; Md Gapar Md Johar

doi:10.63313/JCSFT.9003

Authors

Chujun Li School of Engineering, Xi'an International University, Xi'an 710077, China Author
Md Gapar Md Johar Software Engineering and Digital Innovation Centre, Management and Science University, Shah Alam, Selangor, 40100, Malaysia Author

DOI:

https://doi.org/10.63313/JCSFT.9003

Keywords:

Deep Learning, High-resolution Remote Sensing Image, Urban Built-up Area, Semantic Segmentation, Image Recognition Style

Abstract

Against the backdrop of rapid urbanization, the urban built-up area, as the core component of urban development, requires accurate identification for urban planning, land use monitoring, and sustainable development assessment. How-ever, the complex internal features, significant scale variations, and blurred boundaries within high-resolution remote sensing images often lead to insuffi-cient accuracy with traditional identification methods. Deep learning, leverag-ing its powerful capabilities for automatic feature learning and semantic extrac-tion, has become the core technology for identifying urban built-up areas in high-resolution remote sensing images. This paper provides a comprehensive review of research on high-resolution remote sensing image-based urban built-up area identification from a deep learning perspective: Firstly, it clarifies the connotation of urban built-up areas and summarizes their identification characteristics and challenges. Secondly, it categorizes the commonly used mul-ti-source data types for identification, including high-resolution remote sensing imagery, multispectral data, and LiDAR data. Thirdly, it summarizes methodo-logical advancements in built-up area identification using deep learning-based semantic segmentation models (e.g., CNN, FCN, U-Net, Transformer). Finally, it analyzes the current problems and challenges in the research and proposes prospects and suggestions for future development directions.

References

[1] Feng, Q., Chen, B., Li, G., et al. (2022). A review of research on remote sensing image sample datasets. Journal of Remote Sensing, 26(4), 589–605.

[2] Pei, H., Owari, T., Tsuyuki, S., & Zhong, Y. (2023). Application of a Novel Multiscale Global Graph Convolutional Neural Network to Improve the Accuracy of Forest Type Classifica-tion Using Aerial Photographs. Remote Sensing, 15(4), 1001.

[3] Shao, Z., Tang, P., Wang, Z., Saleem, N., Yam, S., & Sommai, C. (2020). BRRNet: A Fully Con-volutional Neural Network for Automatic Building Extraction From High-Resolution Re-mote Sensing Images. Remote Sensing, 12(6), 1050.

[4] Wang, Y., Yang, L., Liu, X., & Yan, P.. (2024). An improved semantic segmentation algorithm for high-resolution remote sensing images based on deeplabv3. Scientific Reports, 14(1).

[5] Ismael, S. F., Kayabol, K., & Aptoula, E.. (2024). Unsupervised domain adaptation for the semantic segmentation of remote sensing images via a class-aware fourier transform and a fine-grained discriminator. Digital Signal Processing, 151, 104551.

[6] Jia, P., Chen, C., & Zhang, Y. Z. L.. (2024). Semantic segmentation of deep learning remote sensing images based on band combination principle: application in urban planning and land use. Computer communications, 217(Mar.), 97-106.

[7] Feng, Q., Chen, B., Li, G., et al. (2022). A review for sample datasets of remote sensing im-agery. National Remote Sensing Bulletin. 26(4), 589–605.

[8] Wang, B., & Wang, Y. (2022). Research on the current status of remote sensing image land category information acquisition technology based on deep learning. Journal of Land and Resources, 19(4), 74–80.

[9] Zhang, W., Liu, W., Sun, X., et al. (2022). High resolution remote sensing image semantic segmentation using multi-source feature adaptive fusion network. Chinese Journal of Im-age and Graphics, 27(8), 2516–2526.

[10] Lou, M., Zhang, S., Zhou, H. Y., Yang, S., Wu, C., & Yu, Y. (2025). TransXNet: learning both global and local dynamics with a dual dynamic token mixer for visual recognition. IEEE Transactions on Neural Networks and Learning Systems.

[11] Pan, X., Jiao, C., Yang, B., Zhu, H., & Wu, J. (2025). Attribute-guided feature fusion network with knowledge-inspired attention mechanism for multi-source remote sensing classifica-tion. Neural Networks, 187, 107332.

[12] Sanchez, J., Deschaud, J. E., & Goulette, F.. (2023). Cola: coarse-label multi-source lidar se-mantic segmentation for autonomous driving. IEEE Transactions on Robotics, vol. 41, pp. 1742-1754, 2025.

[13] Honglin Jiang, Hesham Elmasry, Sangwon Lim, and Karim El-Basyouny. 2025. Utilizing deep learning models and LiDAR data for automated semantic segmentation of infrastruc-ture on multilane rural highways. Canadian Journal of Civil Engineering. 52(8): 1523-1543.

[14] Zhao, W., Du, S., Wang, Q., & Emery, W. J. (2017). Contextually guided very-high-resolution imagery classification with semantic segments. ISPRS journal of Photogrammetry and Remote Sensing, 132, 48-60.

[15] Kaiser, P., Wegner, J. D., Lucchi, A., Jaggi, M., Hofmann, T., & Schindler, K. (2017). Learning aerial image segmentation from online maps. IEEE Transactions on Geoscience and Re-mote Sensing, 55(11), 6054-6068.

[16] Wang, L., Qiao, Y., Tang, X., & Van Gool, L. (2016). Actionness estimation using hybrid fully convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2708-2717).

[17] Dai, J., Li, Y., He, K., & Sun, J. (2016). R-fcn: Object detection via region-based fully convolu-tional networks. Advances in neural information processing systems, 29.

[18] Qin, S., Wang, Q., Zeng, Q., Ye, M., Fu, A., & Chen, G. (2025). Automatic recognition of debris rock lithology based on unsupervised semantic segmentation. Computers & Geosciences, 196, 105790.

[19] Ding, T., & Wang, X. (2024). DEDU-Net: Dual-Encoder-Decoder-U-Net for road extraction from high-resolution remote sensing images. International Journal of Remote Sensing, 45(10), 3231-3247.

[20] Fang, W., Fu, Y., & Sheng, V. S. (2024). FPS-U2Net: Combining U2Net and multi-level ag-gregation architecture for fire point segmentation in remote sensing images. Computers & Geosciences, 189, 105628.

[21] Deng, J., & Yang, B. (2024). Hyperspectral and multispectral image fusion via residual se-lective kernel attention-based U-net. International Journal of Remote Sensing, 45(5), 1699-1726.

[22] Wang, J., Cheng, L., Zheng, Y., Cui, H., & Wang, Y. (2025). Extraction of Rural Buildings with Different Main Structure Types Based on a Revised U-Net Model. Journal of Computing in Civil Engineering, 39(2), 04024058.

[23] Li, Y., Chen, S., Xu, F., Tang, H., & Xie, W. (2023). AOHSR: attention mechanism-based opti-cal image-guided hyperspectral image super-resolution. International Journal of Remote Sensing, 44(13), 3939-3971.

[24] Tienin, B. W., Cui, G., Ukwuoma, C. C., Talla Nana, Y. A., Mba Esidang, R., & Moniz Moreira, E. Z. (2024). MS3Net: a deep ensemble learning approach for ship classification in heteroge-neous remote sensing data. International Journal of Remote Sensing, 45(3), 748-771.

[25] Xiao, H., Chen, X., Luo, L., & Lin, C. (2025). A dual-path feature reuse multi-scale network for remote sensing image super-resolution. The Journal of Supercomputing, 81(1), 17.

[26] Wang, S. B., Gao, Z. M., Jin, D. H., Gong, S. M., Peng, G. L., & Yang, Z. J. (2024). AMEA-YOLO: a lightweight remote sensing vehicle detection algorithm based on attention mechanism and efficient architecture. The Journal of Supercomputing, 80(8), 11241-11260.

[27] Fayaz, M., Dang, L. M., & Moon, H. (2025). DLAN: a dual attention network for effective land cover classification in remote sensing. Knowledge-Based Systems, 113620.

[28] Wang, L., Li, R., Duan, C., Zhang, C., Meng, X., & Fang, S. (2022). A novel transformer based semantic segmentation scheme for fine-resolution remote sensing images. IEEE Geosci-ence and Remote Sensing Letters, 19, 1-5.

[29] He, X., Zhou, Y., Zhao, J., Zhang, D., Yao, R., & Xue, Y. (2022). Swin transformer embedding UNet for remote sensing image semantic segmentation. IEEE transactions on geoscience and remote sensing, 60, 1-15.

[30] Nan, G., Li, H., Du, H., Liu, Z., Wang, M., & Xu, S. (2024). A semantic segmentation method based on AS-Unet++ for power remote sensing of images. Sensors, 24(1), 269.

[31] Zhou, T., He, H., Wang, Y., &Liao, Y.. (2025). Improved gated recurrent units together with fusion for semantic segmentation of remote sensing images based on parallel hybrid net-work. Multimedia Systems, 31(2).

[32] Wang, Y., Yang, L., Liu, X., & Yan, P. (2024). An improved semantic segmentation algorithm for high-resolution remote sensing images based on DeepLabv3+. Scientific reports, 14(1), 9716.

[33] Wu, Y., Zhang, X., Zhao, X., Sun, Y., & Li, T. (2025). segWCD: A new segmentation-based weak supervision neural network for building change detection. Applied Intelligence, 55(2), 147.

[34] Li, L., Lu, N., & Qin, J. (2025). Joint-task learning framework with scale adaptive and posi-tion guidance modules for improved household rooftop photovoltaic segmentation in re-mote sensing image. Applied Energy, 377, 124521.

Research Progress on Urban Built-up Area Identification in High-Resolution Remote Sensing Images Using Deep Learning

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

INDEXING & ABSTRACTING