Kao Zhang (张考)
School of Artificial Intelligence/School of Future Technology (人工智能学院)
Nanjing University of Information Science and Technology (南京信息工程大学)
Address: Room A1911, Linjiang Building, No.219, Ningliu Road, Nanjing, China
Email : kaozhang@nuist.edu.cn; zhangkao@whu.edu.cn
GitHub : https://github.com/zhangkao

About Me (中文) (CV) (CV_ZH) (Google scholar)

Kao Zhang received his Ph.D at Lab. of Intelligent Information Processing (IIP) from Wuhan University, Wuhan, China, in 2020, under the supervision of Prof.Zhenzhong Chen. Formerly, he finished the B.Eng. and M.Eng degrees at Computer Vision & Remote Sensing Lab (CVRS) in 2014 and 2016 respectively, under the guidance of Prof.Jian Yao. He was a postdoctoral fellow at Wuhan University, working on visual saliency pridection, a researcher at Tencent, Shenzhen, working on video processing, and a visiting student at PERCEPT team of INRIA, Rennes, France working on UAV video saliency prediction. His current research interests include visual attention, image/video processing, remote sensing and metaverse.

We are looking for self-motivated undergraduate/graduate students. If you are interested in joining us, please feel free to contact me with your CV! [2025级硕士研究生名额若干!也欢迎感兴趣的本科生加入!]

News

  • 2024.05: One paper is accepted by Neurocomputing.
  • 2024.02: One paper is accepted by JVCI.
  • 2023.10: One paper is accepted by JVCI.
  • 2022.10: One paper is accepted by ISPRS JPRS.
  • 2022.10: One paper is accepted by IEEE VICP.
  • 2022.09: Supported by NSFC.
  • 2021.08: Supported by Postdoctoral Innovative Research Position Funding of Hubei Province of China.
  • 2021.06: Supported by China Postdoctoral Science Foundation.
  • 2020.10: One paper is accepted by IEEE TIP.

Research Interest

  • Visual attention: Video/Image/RGBD/VR/UAV saliency prediction.
  • Remote sensing video analysis: Object detection, tracking and recognition in satellite/UAV videos.
  • Metaverse: Multimodal (text, image, video, and sound) Emotion analysis, Virtual Reality technology.

Education and Experience

  • 2010.09-2014.06, School of Remote Sensing and Information Engineering, WHU, B.E.
  • 2014.09-2016.06, School of Remote Sensing and Information Engineering, WHU, M.E.
  • 2016.09-2020.06, School of Remote Sensing and Information Engineering, WHU, Ph.D.
  • 2020.07-2020.12, Intelligent Media Team, Media Lab, CSIG, Tencent, Visiting Researcher.
  • 2020.12-2023.02, School of Remote Sensing and Information Engineering, WHU, Postdoc.
  • 2023.03- Now     , School of Artificial Intelligence, NUIST, Lecturer.

Selected Publications

Journals

  • Kao Zhang, Zhenzhong Chen, Songnan Li, Shan Liu. An Efficient Saliency Prediction Model for Unmanned Aerial Vehicle Video. ISPRS Journal of Photogrammetry and Remote Sensing, vol. 194, pp. 152-166, 2022. [PDF] [Code]
  • Kao Zhang, Zhenzhong Chen, Shan Liu. A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction. IEEE Transactions on Image Processing (TIP), vol. 30, pp. 572-587, 2021. [PDF] [Code]
  • Kao Zhang, Zhenzhong Chen. Video Saliency Prediction Based on Spatial-Temporal Two-Stream Network. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 29, no. 12, pp. 3544-3557, 2019. [PDF] [Code]
  • Di Liu, Kao Zhang, Zhenzhong Chen. Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection. IEEE Transactions on Multimedia (TMM), vol. 23, pp. 967-981, 2021. [PDF]
  • Hao Cai*, Kao Zhang*, Zhao Chen, Chenxi Jiang, Zhenzhong Chen. Video saliency prediction for first-person view UAV videos: Dataset and benchmark. Neurocomputing, 2024: 127876. (co-first author) [PDF]
  • Zhao Chen*, Kao Zhang*, Hao Cai, Xiaoying Ding, Chenxi Jiang, Zhenzhong Chen. Audio-visual saliency prediction for movie viewing in immersive environments: Dataset and benchmarks. Journal of Visual Communication and Image Representation (JVCI), 2024:104095. (co-first author) [PDF]
  • Yang Li*, Kao Zhang*, Zhao Chen, Wanping Ouyang, Mingpeng Cui, Chenxi Jiang, Daiqin Yang and Zhenzhong Chen. Towards Object Tracking for Quadruped Robots. Journal of Visual Communication and Image Representation (JVCI), 2023, 97: 103958. (co-first author) [PDF]
  • Jing Ling, Kao Zhang, Yingxue Zhang, Daiqin Yang, Zhenzhong Chen. A saliency prediction model on 360 degree images using color dictionary based sparse representation. Signal Processing: Image Communication (SPIC), vol. 69, pp. 60-68, 2017.
  • Zhaopeng Hu, Daiqin Yang, Kao Zhang, Zhenzhong Chen. Object Tracking in Satellite Videos Based on Convolutional Regression Network with Appearance and Motion Features. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (JSTARS), vol. 13, no. 12, pp. 783-793, 2020.

Conferences

  • Kao Zhang, Yan Shang, Songnan Li, Shan Liu, Zhenzhong Chen. SalCrop: Spatio-temporal Saliency Based Video Cropping. in Proc. IEEE International Conference on Visual Communications and Image Processing (VCIP), 2022, (Demo, Oral, Poster).
  • Di Liu, Yaosi Hu, Kao Zhang, Zhenzhong Chen. Two-stream refinement network for RGB-D saliency detection. in Proc. IEEE International Conference on Image Processing (ICIP), 2019, pp. 3925-3929.
  • Xixi Li, Di Liu, Kao Zhang, Zhenzhong Chen. Layout-Driven Top-Down Saliency Detection for Webpage. in Proc. Pacific Rim Conference on Multimedia (PCM), 2017. 438-446.
  • Ruiqian Zhang, Jian Yao, Kao Zhang, Chen Feng and Jiadong Zhang. S-CNN-Based Ship Detection from High-Resolution Remote Sensing Images. in Proc. ISPRS-Int. Arch. Photogram. Remote Sens. Spatial Inf. Sci. (ISPRS), 2016, pp. 423-430. (Best Poster Award).
  • Yuan Liu, Kao Zhang, Jian Yao, Tong He, Yahui Liu, and Jinge Tu. An Efficient Method for Text Detection from Indoor Panorama Images Using Extremal Regions. in Proc. IEEE International Conference on Information and Automation (ICIA), 2015, (Oral).
  • Tong He, Jian Yao, Kao Zhang, Yaolin Hou, Shiyao Han. Accurate Multi-Scale License Plate Localization Via Image Saliency. in Proc. IEEE Conference on Intelligent Transportation Systems (ITSC), 2014, pp. 1567-1572, (Oral).

Patent

  • Zhenzhong Chen, Zhao Chen, Kao Zhang. A video saliency prediction method and system based on audio and video features. CN202310247030.1, 2023.
  • Zhenzhong Chen, Yang Li, Kao Zhang. An object tracking method for quadruped robots based on siamese network. CN202310399358.5, 2023.
  • Kao Zhang, Songnan Li. Image cropping methods, devices, computer equipment and storage media. CN202011644040.1, 2021.
  • Jian Yao, Kao Zhang, Tong He, and Sa Zhu. Accurate Multi-Scale License Plate Localization Based on Affine Rectification. CN201410077985.8, 2014.

Software Copyright

  • Jian Yao, Kao Zhang, Tong He, et al. Panorama Post-Processing Software. 2015R11S199708, 2015.

Selected Awards

  • 2021, Second-Class Prize of Graduate Academic Innovation Award, Wuhan University.
  • 2018, Grand Winner Prize on Images in ICME2018 Grand Challenge (GC) – Salient360!.
  • 2018, 1st place on track: Prediction of Head Saliency for Images in ICME2018 GC–Salient360!.
  • 2018, 1st place on track: Prediction of Head+Eye Saliency for Videos in ICME2018 GC–Salient360!.
  • 2017, Best Head Movement Prediction Student Prize in ICME2017 GC–Salient360!.
  • 2014, Second-Class Prize of the National Graduate Contest on Smart-City Technology and Creative Design, Video Challenge--Face Detection Section.
  • 2014, Excellent Bachelor’s Degree Thesis of Hubei Province.
  • 2014-2016, Excellent Graduate Students of Wuhan University.
  • 2010-2012, Excellent Undergraduate Students of Wuhan University.
  • 2014-2016, First Class Scholarship of Wuhan University.

Funding

  • 2024, 南京信息工程大学人才科研启动项目, 多源无人机视频显著性检测, 主持, 2024.1-2026.10
  • 2023, 国家社会科学基金青年项目, 增强网络意识形态风险防范的战略主动及其能力研究, 参与, 2024.1-2026.12
  • 2023, 国家级人工智能现代产业学院“产教融合型”教材(新编)揭榜挂帅项目, 虚拟现实技术, 主持, 2023.10-2025.8
  • 2023, 横向研究课题, 虚拟现实图像目标检测与场景理解技术研究, 主持, 2023.6-2024.5
  • 2022, 国家自然科学基金青年项目, 基于弱监督学习的高效视频显著性预测方法研究, 主持, 2023.1-2025.12
  • 2021, 中国博士后科学基金会面上项目, 遥感视频显著性预测关键技术研究, 主持, 2021.7-2023.2
  • 2021, 湖北省博士后创新研究岗位项目, 无人机视频显著性预测关键技术研究, 主持, 2021.7-2023.2
  • 2021, 测绘遥感信息工程国家重点实验室探索类课题, 遥感视频显著性目标检测, 主持, 2021.1-2021.12
  • 2018, 国家重点研发计划项目课题, 公共安全立体化协同监测关键技术, 参与, 2018.5-2021.4
  • 2017, 国家重点研发计划项目课题, 融合多通道语境信息的类人智能感知机制与方法, 项目骨干, 2017.10-2021.9

Teaching

  • Virtual Reality Technology, Digital Twin Technology, Digital Image Processing
  • Neural Network and Deep Learning, Introduction to Artificial Intelligence

Services

Journal Reviewer:

  • IEEE Transactions on Image Processing (TIP)
  • IEEE Transactions on Multimedia (TMM)
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  • IEEE Transactions on Geoscience and Remote Sensing (TGRS)
  • IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (JSTARS)
  • IEEE Geoscience and Remote Sensing Letters (GRSL)
  • International Journal of Applied Earth Observation and Geoinformation (JAG)

Conference Reviewer:

  • IEEE International Conference on Image Processing (ICIP)
  • IEEE International Conference on Multimedia and Expo (ICME)
  • IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Other:

  • 2023.5-2028.5, Metaverse Technology and Application Innovation Platform of China, CIUR, Deputy secretary-general.
  • 2022.9-2025.9, Integrated Research Platform for Aerospace Information Intelligent Services, Ministry of Education of China, Member.
  • 2023, International Conference on Graphics and Image Processing (ICGIP), Publicity Co-chairs.