Publications

Email:: lutong@nju.edu.cn
Address:: National Key Laboratory for Novel Software Technology
Department of Computer Science and Technology
Xianlin Campus Mailbox 603, Nanjing University; 163 Xianlin Avenue, Qixia District; Nanjing 210023, China
Phone:: 0086-25-8968-2398
Fax:: 0086-25-8968-2398
URL:: http://cs.nju.edu.cn/lutong/

Selected International Journal Papers

1. Guangchen Shi, Yirui Wu, Wei Zhu, Shivakumara Palaiahnakote, Shirong Zou, Yixuan Wang Tong Lu. Diffusion models with spatial control and attention fusion for incremental few-shot semantic segmentation. Pattern Recognition, to appear, 2026

2. Tao Wang, Kaihao Zhang, Jiankang Deng, Tong Lu, Wei Liu, Stefanos Zafeiriou. Deep face restoration: a survey. ACM Computing Survey, to appear, 2026

3. Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Jiahao Wang, Zhe Chen, Zhiqi Li, Tong Lu, Limin Wang. Video mamba suite: state space model as a versatile alternative for video understanding. International Journal of Computer Vision, to appear, 2026

4. Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Qiao Yu, Jifeng Dai. BEVFormer: learning bird's -eye-view representation from multi-camera images via spatiotemporal transformers. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear, 2025 (corresponding author, SCI, 南京大学学科卓越列表期刊)

5. Lixin Yuan, Cheng Mei, Wenhai Wang, Tong Lu. Feature selection based on intrusive outliers rather than all instances. IEEE Transactions on Image Processing, 33:809-824, 2024 (SCI)

6. Ayan Banerjee, Palaiahnakote Shivakumara, Umapada Pal, Apostolos Antonacopoulos, Tong Lu, Josep Llados Canet. TTS: Hilbert transform-based generative adversarial network for tattoo and scene text spotting. IEEE Transactions on Multimedia, to appear, 2024

7. Zhe Chen, Weiyun Wang, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang. How far are we to GPT-4v? Closing the gap to commercial multimodal models with open-source suites. SCIENCE CHINA Information Sciences, to appear, 2024 (SCI)

8. Yangzhou Liu, Yue Cao, Zhangwei Gao, Weiyun Wang, Zhe Chen, Wenhai Wang, Hao Tian, Lewei Lu, Xizhou Zhu, Tong Lu, Yu Qiao, Jifeng Dai. MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity. SCIENCE CHINA Information Sciences, to appear, 2024 (SCI)

9. Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo, Bjorn Stenger, Tong Lu, Tae-Kyun Kim, Wei Liu, Hongdong Li. GridFormer: residual dense transformer with grid structure for image restoration in adverse weather conditions. International Journal of Computer Vision, to appear, 2024 (SCI)

10. Tao Wang, Guangpin Tao, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Xiaoqin Zhang, Tong Lu*. Restoring vision in hazy weather with hierarchical contrastive learning. Pattern Recognition, to appear, 2024 (corresponding author, SCI)

11. Palaiahnakote Shivakumara, Maryam Asadzadeh Kaljahi, Swati Kanchan, Umapada Pal, Daniel Lopresti, Tong Lu. A robust script independent handwriting system for gender identification. Experts Systems with Applications, to appear, 2024 (corresponding author, SCI)

12. Palaiahnakote Shivakumara, Ayan Banerjee, Lokesh Nandanwar, Umapada Pal, Apostolos Antonacopoulos, Tong Lu, Michael Blumenstein. A new deep CNN for 3D text localization in the wild through shadow removal. Computer Vision and Image Understading, to appear, 2023 (corresponding author, SCI)

13. Palaiahnakote Shivakumara, Ayan Banerjee, Umapada Pal, Lokesh Nandanwar, Tong Lu, Cheng-Lin Liu. A new language Independent deep CNN for scene text detection and style transfer in social media images. IEEE Transactions on Image Processing, to appear, 2023

14. Min Yang, Guo Chen, Yindong Zheng, Tong Lu, Limin Wang. BasicTAD: an astounding RGB-only baseline for temporal action detection. Computer Vision and Image Understanding, to appear, 2023

15. Ruoze Liu, Yangjie Shen, Yang Yu, Tong Lu. Revisiting of AlphaStar. IEEE Transactions on Games, to appear, 2023

16. Wenhai Wang, Enze Xie, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu*, Chunhua Shen. PAN++: towards efficient and accurate arbitrary-shaped text spotting. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear, 2022 (corresponding author, SCI, 南京大学学科卓越列表期刊)

17. Haoran Zhou, Honghua Chen, Yingkui Zhang, Mingqiang Wei, Haoran Xie, Jun Wang, Tong Lu*, Jing Qin, Xiao-ping Zhang. Rfine-Net: normal refinement neural network for noisy point clouds. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear, 2022 (co-corresponding author, SCI, 南京大学学科卓越列表期刊)

18. Lixin Yuan, Guoqiang Yang, Qian Xu, Tong Lu*. Discriminative feature selection with directional outliers correcting for data classification. Pattern Recognition, to appear, 2022 (corresponding author, SCI)

19. Minglei Yuan, Chunhao Cai, Tong Lu*, Yirui Wu, Qian Xu, Shijie Zhou. A novel forget-update module for few-shot domain generation. Pattern Recognition, to appear, 2022 (corresponding author, SCI)

20. Lokesh Nandanwar, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, Ramachandra Raghavendra, Umapada Pal, Tong Lu, Michael Blumenstein. A conformable moments-based deep learning system for forged handwriting detection. IEEE Transactions on Neural Networks and Learning Systems, to appear, 2022 (corresponding author, SCI)

21. Yanwen Lu, Wenliang Ma, Xiang Dong, Mackenzie Brown, Tong Lu*, Weidong Gan*. Differentiate Xp11.2 translocation renal cell carcinoma from computed tomography images and clinical data with ResNet-18 CNN and XGBoost. Computer Modeling in Engineering & Sciences, to appear, 2022 (corresponding author, SCI)

22. Zhiheng Huang, Palaiahnakote Shivakumara, Maryam Asadzadeh Kaljahi, Ahlad Kumar, Umapada Pal, Tong Lu*, Michael Blumenstein. Writer age estimation through handwriting. Multimedia Tools and Applications, to appear, 2022 (corresponding author, SCI)

23. Ruoze Liu, Zhenjia Pang, Zhouyu Meng, Wenhai Wang, Yang Yu, Tong Lu. On efficient reinforcement learning for full-length game of StarCraft II. Journal of Artificial Intelligence Research, to appear, 2022 (corresponding author, SCI)

24. Palaiahnakote Shivakumara, Tanmay Jain, Umapada Pal, Nitish Surana, Apostolos Antonacopoulos, Tong Lu*. Text line segmentation from struck-out handwritten document images. Experts Systems with Applications, to appear, 2022 (corresponding author, SCI)

25. Wenhai Wang, Enze Xie, Xiang Li, Deng-ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao. Improved baselines with pyramid vision transformer. Computational Visual Media, to appear, 2022 (SCI)

26. Lokesh Nandanwar, Palaiahnakote Shivakumara, Divya Krishnani, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Mohan Kankanhalli. A new foreground and background based method for behavior-oriented social media images classification. ACM Transactions on Multimedia Computing Communications and Applications, to appear, 2021 (SCI)

27. Lokesh Nandanwar, Palaiahnakote Shivakumara, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Apostolos Antonacopoulos, Yue Lv. A new deep wavefront based model for text localization in 3D video. IEEE Transactions on Circuits and Systems for Video Technology, to appear, 2021 (SCI)

28. Lokesh Nandanwar, Palaiahnakote Shivakumara, Divya Krishnani, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Mohan Kankanhalli. An episodic learning network for text detectioin on human bodies in sports images. IEEE Transactions on Circuits and Systems for Video Technology, to appear, 2021 (SCI)

29. 王文海, 李志琦, 路通*. 基于网格切分的单阶段实例分割方法. 软件学报, to appear, 2021 (SCI)

30. Pinaki Nath Chowdhury, Palaiahnakote Shivakumara, Lokesh Nandanwar, Faizal Samiron, Umapada Pal, Tong Lu. Oil palm tree counting in drone images. Pattern Recognition Letters, to appear, 2021 (SCI)

31. Ayush Mittal, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein. A new method for detection and prediction of occluded text in natural scene images. Signal processing: image communication, to appear, 2021 (SCI)

32. Abhra Chaudhuri, Palaiahnakote Shivakumara, Pinaki Nath Chowdhury, Umapada Pal, Tong Lu, Daniel Lopresti, G. Hemantha Kumar. Deep action-oriented video image classification system for text detection and recognition. SN Appliced Sciences, to appear, 2021 (SCI)

33. Ruoze Liu, Haifeng Guo, Xiaozhong Ji, Yang Yu*, Zenjia Pang, Zitai Xiao, Yuzhou Wu, Tong Lu*. Efficient reinforcement learning for StarCraft by abstract forward models and transfer learning. IEEE Transactions on Games, to appear, 2021 (SCI)

34. Tapan Karnik, Palaiahnakote Shivakumara, Pinaki Nath Chowdhury, Umapada Pal, Tong Lu*, Nor Badrul Anuar. A new deep model for faimily and non-family photo identification. Multimedia Tools and Applications, to appear, 2021 (SCI)

35. Lokesh Nandanwar, Palaiahnakote Shivakumara, Prabi Mondal, K. S. Raghunandan, Umapada Pal, Tong Lu, Daniel Lopresti. Forged text detection in video, scene and document images. IET Image Processing, to appear, 2021 (SCI)

36. Zhiheng huang, Palaiahnakote Shivakumara, Tong Lu*, Umapada Pal, Michael Blumenstein, Bhaarat Chetty, G. H. Kumar. Improved ring radius transform-based reconstruction for video character recognition. International Journal of Pattern Recognition and Artificial Intelligence, to appear, 2021 (SCI)

37. Abhra Chaudhuri, Palaiahnakote Shivakumara, Pinaki Nath Chowdhury, Umapada Pal, Tong Lu, Daniel Lopresti, G. Hemantha Kumar. Deep action-oriented video images classification system for text detection and recognition. SN Applied Sciences, to appear, 2021 (SCI)

38. Yin-Dong Zheng, Zhaoyang Liu, Tong Lu, Limin Wang. Dynamic sampling networks for efficient action recognition in videos. IEEE Transactions on Image Processing, to appear, 2020 (SCI)

39. Minglong Xue, Palaiahnakote Shivakumara, Chao Zhang, Yao Xiao, Tong Lu*, Umapada Pal, Daniel Lopresti. Arbitrarily-oriented text detection in low light natural scene images. IEEE Transactions on Multimedia, to appear, 2020 (corresponding author)

40. Divya Krishnani, Palaiahnakote Shivakumara, Tong Lu, Umapada Pal, Daniel Lopresti, Govindaraju Hemantha Kumar. A new context-based feature for classification of emotions in photographs. Multimedia Tools and Applications, to appear, 2020

41. Lokesh Nandanwar, Palaiahnakote Shivakumara, Swati Kanchan, V. Basavaraja, D.S. Guru, Umapada Pal, Tong Lu, Michael Blumenstein. DCT-phase statistics for forged IMEI numbers and air ticket detection. Experts Systems with Applications, to appear, 2020 (corresponding author, SCI impact factor: 5.452)

42. Pinaki Nath Chowdhury, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, , Michael Blumenstein. A new augmentation-based method for text detection in night and day license plate images. Multimedia Tools and Applications, 79(43-44):1-28, 2020 (corresponding author)

43. Pinaki Nath Chowdhury, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, Umapada Pal, Tong Lu. A new fractal series expansion based enhancement model for license plate recognition. Signal Processing: Image Communication, to appear

44. Shengkai Yue, Minglei Yuan, Tong Lu, Palaiahnakote Shivakumara, Michael Blumenstein, Jie Shi, G. Hemantha Kumar. Rotation invariant angle-density based features for an ice image classification system. Experts Systems with Applications, to appear, 2020 (corresponding author, SCI impact factor: 5.452)

45. Sauradip Nag, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu*, Michael Blumenstein. A new unified method for detecting text from Marathon runners and sports players in video. Pattern Recognition, to appear, 2020 (corresponding author, SCI)

46. Soumyadip Roy, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu*, Govindaraj Hemantha Kumar. Delaynay triangulation based text detection from multi-view images of natural scene. Pattern Recognition Letters, 129:92-100, 2019 (corresponding author, SCI)

47. Minglong Xue, Palaiahnakote Shivakumara, Chao Zhang, Tong Lu*, Umapada Pal. Curved text detection in blurred/non-blurred video/scene images. Multimedia Tools and Applications, 78(18):25629-25653, 2019 (corresponding author, SCI)

48. Maryam Asadzadeh Kaljahi, Palaiahnakote Shivakumara, Tiangping Hu, Hamid A. Jalab, Rabha W. Ibrahim, Michael Blumenstein, Tong Lu, Mohamad Nizam Bin Ayub. A geometric and fractional entropy-based method for family photo classification. Expert Systems with Applications, Online publication, 2019 (SCI impact factor: 5.452)

49. Yirui Wu, Yuechao He, Palaiahnakote Shivakumara, Ziming Li, Hongxin Guo, Tong Lu. Channel-wise attention model based fire and rating level detection in video. CAAI Transactions on Intelligence Technology, 4(2):117-121, 2019

50. Vijeta Khare, Palaiahnakote Shivakumara, Chee Seng Chan, Tong Lu*, Liang Kim Meng, Hon Kock Woon, Michael Blumenstein. A novel character segmentation-reconstruction approach for license plate recognition. Expert Systems with Applications, 131:219-239, 2019 (corresponding author, SCI impact factor: 5.452)

51. Maryam A. Kaljahi, Palaiahnakote Shivakumara, Mohd Y.I. Idris, Mohammad H. Anisi, Tong Lu*, Michael Blumenstein, Noorzaily Mohamed Noor. An automatic zone detection system for safe landing of UAVs. Expert Systems with Applications, 122:319-333, 2019 (SCI impact factor: 5.452)

52. Palaiahnakote Shivakumara, Dongqi Tang, Maryam Asadzadehkaljahi, Tong Lu*, Umapada Pal, Mohammad Hossein Anisi. A CNN-RNN based method for license plate recognition. CAAI Transactions on Intelligence Technology, 3(3):169-175, 2018

53. K.S. Raghunandan, Palaiahnakote Shivakumara, Lolika Padmanabhan, G. Hemantha Kumar, Tong Lu*, Umapada Pal. New symmetry features for license plate classification. CAAI Transactions on Intelligence Technology, 3(3):176-183, 2018

54. K. S. Raghunandan, Palaiahnakote Shivakumara, Sangheeta Roy, G. Hemantha Kumar, Umapada Pal, Tong Lu*. Multi-script-oriented text detection and recognition in video/scene/born digital images. IEEE Transactions on Circuits and Systems for Video Technology, 29(4):1145-1162, 2018 (corresponding author, SCI)

55. Sangheeta Roy, Palaiahnakote Shivakumara, Namita Jain, Vijeta Khare, Anjan Dutta, Umapada Pal, Tong Lu*. Rough-fuzzy based scene categorization for text detection and recognition in video. Pattern Recognition, 80:64-82, 2018 (corresponding author, SCI)

56. Yirui Wu, Zhouyu Meng, Palaiahnakote Shivakumara, Tong Lu. Compressive sensing based convolutional neural network for object detection. Malaysian Journal of Computer Science, 33(1):78-89, 2018

57. K. S Raghunandan, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, G. Hemantha Kumar, Umapada Pal, Tong Lu*. Riesz fractional based model for enhancing license plate detection and recognition. IEEE Transactions on Circuits and Systems for Video Technology, 28(9):2276-2288, 2018 (corresponding author, SCI)

58. Zehuan Yuan, Tong Lu*, Chew Lim Tan. Learning discriminated and correlated patches for multi-view object detection using sparse coding. Pattern Recognition, 69:26-38, 2017 (corresponding author, SCI)

59. Palaiahnakote Shivakumara, Liang Wu, Tong Lu, Chew Lim Tan, Michael Blumenstein, Basavaraj S. Anami. A fractal-based multi-oriented text detection system for recognition in mobile video images. Pattern Recognition, 68:158-174, 2017 (SCI)

60. Yirui Wu, Tong Lu*, Zehuan Yuan, Hao Wang. FreeScup: a novel platform for assisting sculpture pose design. IEEE Transactions on Multimedia, 19(1):183-195, 2017 (corresponding author, SCI)

61. Aladhahalli Shivegowda Kavitha, Palaiahnakote Shivakumara, Govindaraj, Hemantha Kumar, Tong Lu. A new watershed model based system for character segmentation in degraded text lines. International Journal of Electronics and Communications, 71: 45-52, 2016 (SCI)

62. Sounka Dey, Palaiahnakote Shivakumara, K. S. Raghunandan, Umapada Pal, Tong Lu*, G. Hemantha Kumar, Chee Chen Chan. Script Independent appraoch for multi-oriented text detection in scene image. Neurocomputing, 242:96-112, 2016 (corresponding author, SCI)

63. Yirui Wu, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan, Michael Blumenstein, G. Hemantha Kumar. Contour restoration of text components for recognition in video/scene images. IEEE Transactions on Image Processing, 25(12):5622-5634, 2016 (corresponding author, SCI)

64. Palaiahnakote Shivakumara, R. Raghavendra, Longfei Qin, Kiran. B. Raja, Tong Lu*, Umapada Pal. A new multi-modal approach to bib number/text detection and recognition in Marathon images. Pattern Recognition, 61:479-491, 2016 (corresponding author, SCI)

65. Zehuan Yuan, Hao Wang, Limin Wang, Tong Lu*, Palaiahnakote Shivakumara, Chew Lim Tan. Modeling spatial layout for scene image understanding via a novel multiscale sum-product network. Expert Systems with Applications, 63:231-240, 2016 (impact factor 5.452, corresponding author, SCI)

66. A.S. Kavitha, Palaiahnakote Shivakumara, G. H. Kumar, Tong Lu. Text segmentation in degraded historical document images. Egyptian Informatics Journal, 17(2):189-197, 2016

67. Hao Wang, Tong Lu*, Yiming Wang, Palaiahnakote Shivakumara, Chew Lim Tan. Weakly-supervised region annotation for understanding scene images. Multimedia Tools and Applications, 75(6):3027-3051, 2016 (corresponding author, SCI)

68. Guozhu Liang, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan. Multi-spectral fusion based approach for arbitrarily-oriented scene text detection in video images. IEEE Transactions on Image Processing, 24(11):4488-4501, 2015 (corresponding author, SCI)

69. Liang Wu, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan. A new technique for multi-oriented scene text lines detection and tracking in video. IEEE Transactions on Multimedia, 17(8):1137-1152, 2015 (corresponding author, SCI)

70. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan, Seiichi Uchida. A new method for multi oriented graphics-scene-3D text classification in video. Pattern Recognition, 49(1):19-42, 2015 (corresponding author, SCI)

71. Sangheeta Roy, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, Umapada Pal, Tong Lu. Fractional poisson enhancement model for text detection and recognition in video images. Pattern Recognition, 52:433-447, 2015 (corresponding author, SCI)

72. Yirui Wu, Oscar Kin-Chung Au, Chiew-Lan Tai, Tong Lu*. HIRM: a handle-independent reduced model for incremental mesh editing. Computer-Aided Geometric Design, 35(36):56-68, 2015 (corresponding author, SCI)

73. Palaiahnakote Shivakumara, Zehuan Yuan, Danni Zhao, Tong Lu, Chew Lim Tan. New gradient-Spatial-Structural features for video script identification. Computer Vision and Image Understanding, 130:35-53, 2015 (SCI)

74. Yirui Wu, Palaiahnakote Shivakumara, Wei Wang, Tong Lu*, Umapada Pal. A new ring radius transform based thinning method for multi-oriented video characters. International Journal on Document Analysis and Recognition, 18(2):137-151, 2015 (corresponding author, SCI)

75. Sangheeta Roy, Palaiahnakote Shivakumara, Partha Pratim Roy, Umapada Pal, Chew Lim Tan, Tong Lu. Bayesian classifier for multi-oriented video text recognition system. Expert Systems with Applications, 42(13):5554-5566, 2015 (impact factor 5.452, SCI)

76. Shangxuan Tian, Palaiahnakote Shivakumara, Trung Quy Phan, Tong Lu, Chew Lim Tan. Character shape restoration system through media axis points in video. Neurocomputing, 161(5):183-198, 2015 (SCI)

77. Tong Lu, Gongyou Wang, Feng Su. Context-based environmental audio event recognition for scene understanding. Multimedia Systems, 21(5):507-524, 2014 (corresponding author, SCI)

78. Tong Lu*, Yukang Jin, Feng Su, Palaiahnakote Shivakumara, Chew Lim Tan. Content-oriented multimedia document understanding through cross-media correlation. Multimedia Tools and Applications, 74(18):8105-8135, 2014 (corresponding author, SCI)

79. Hao Wang, Tong Lu*, Oscar Kin-Chung Au, Chiew-Lan Tai. Spectral 3D mesh segmentation with a novel single segmentation field descriptor. Graphical Models, 76(5):440-456, 2014 (corresponding author, SCI)

80. Zehuan Yuan, Tong Lu. Incremental 3D reconstruction using Bayesian learning. Applied Intelligence, 39(4):761-771, 2013 (SCI)

81. Wenyin Liu, Tong Lu*, Yajie Yu, Liang Shuang, Rui Zhang. Online stroke segmentation by quick penalty-based dynamic programming. IET Computer Vision, 7(5):311-319, 2013 (corresponding author, SCI)

82. Tong Lu, Chiew-Lan Tai, Huafei Yang, Shijie Cai. A novel knowledge-based system for interpreting complex engineering drawings: theory, representation and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(8):1444-1457, 2009 (corresponding author, SCI, impact factor: 9.455)

83. Tong Lu, Huafei Yang, Ruoyu Yang, Shijie Cai. Automatic analysis and integration of architectural drawings. International Journal on Document Analysis and Recognition, 9(1):31-47, 2007 (corresponding author, SCI)

84. Tong Lu, Chiew-Lan Tai, Feng Su, Shijie Cai. A new recognition model for electronic architectural drawings. Computer-Aided Design, 37(10):1053-1069, 2005 (corresponding author, SCI)

85. Tong Lu, Chiew-Lan Tai, Li Bao, Feng Su, Shijie Cai. 3D reconstruction of detailed buildings from architectural drawings. Computer-Aided Design and Applications, 2(1-4):527-536, 2005 (corresponding author)

Books

86. Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Tapabrata Chakraborti, Michael Blumenstein. A new roadmap for evaluating descriptive handwritten answer type. Advances in Networks, Security and Commnuications: Reviews, vol. 2, Book Series (Book Chapter), IFSA Publishing, Barcelona, Spain, to appear, 2018

87. Palaiahnakote Shivakumara, Umapada Pal, Sangheeta Roy, Tong Lu, Michael Blumenstein. Identification of superimposed and scene text in video frames. Book Chapter, to appear, 2017

88. Tong Lu, Palaiahnakote Shivakumara, Chew Lim Tan, Wenyin Liu. Developments of computer vision and pattern recognition: video text detection. Springer London, 2014 (ISBN 978-1-4471-6514-9, corresponding author)

89. Tong Lu, Wenyin Liu. Handbook of document image processing and recognition. Book Chapter. Springer New York, 2014 (ISBN 0857298607, corresponding author)

Selected International Conference Papers

2026

90. Guangchen Shi, Yirui Wu, Zhu Wei, Tao Wang, Hao Zhang, Bo Li, Tong Lu. Bayesian decomposition and sematic completion for few-shot semantic segmentation. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'26), June 3 - 7, Denver, USA, 2026

91. Lidong Lu, Guo Chen, Zhu Wei, Zhiqi Li, Yicheng Liu, Tong Lu. AV-Reasoner: improving and benchmarking clue-grounded audio-visual counting for MLLMs. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'26), June 3 - 7, Denver, USA, 2026

92. Zhi Zhu, Yaoqi Fan, Zhe Chen, Yue Cao, Yangzhou Liu, Tong Lu. Will multimodal models be dazzled by multi-image visual puzzles? IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'26), June 3 - 7, Denver, USA, 2026

93. Kanghua Pan, Guo Chen, Wei Zhu, Huaidan Zhao, Tong Lu. HAM-SAM2: enhancing SAM2 for visual object tracking with adaptive motion modeling and hierarchical memory bank. 2026 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'26), May 4-8, Barcelona, Spain, 2026

94. Guangchen Shi, Shuang Liu, Wei Zhu, Zhi Zhu, Yirui Wu, Tong Lu. PRCBench: evaluating text-rich visual comprehension in multimodel large language models. 2026 IEEE International Conference on Multimedia (ICME'26), July 6-10, Bangkok, Thailand, 2026

95. Xinyi Mao, Liangrui Dong, Wei Zhu, Tong Lu. Dual-chain agent with adaptive exploration for long videl understanding. 2026 IEEE International Conference on Multimedia (ICME'26), July 6-10, Bangkok, Thailand, 2026

2025

96. Yuping He, Yifei Huang, Guo Chen, Baoqi Pei, Jilan Xu, Jiangmiao Pang, Tong Lu. EgoExoBench: a benchmark for first- and third- person view video understanding in MLLMs. The 39th Annual Conference on Neural Information Processing Systems (NIPS'25), Dec 2 - 7, San Diego, USA

97. Guo Chen, Zhiqi Li, Shihao Wang, Jindong Jiang, Yicheng Liu, Lidong Lu, DSe-an Huang, Wonmin Byeon, Matthieu Le, Max Enrlich, Tong Lu, Liming Wang, Bryan Catanzaro, Jan Kautz, Andew Tao. Eagle 2.5: boosting long-context post-training for frontier vision-language models. The 39th Annual Conference on Neural Information Processing Systems (NIPS'25), Dec 2 - 7, San Diego, USA

98. Guo Chen, Yifei Huang, Yin-Dong Zheng, Yicheng Liu, Jiahao Wang, Tong Lu. Egocentric object-interaction anticipation with retentive and predictive learning. The Thirty-Fourth International Joint Conference on Artificial Intelligence (IJCAI'25), Aug 16 - 22, Montreal, USA

99. Yirui Wu, Yuhang Xia, Hao Li, Lixin Yuan, Junyang Chen, Jun Liu, Tong Lu, Shaohua Wan. Deconfound semantic shift and incompleteness in incremental few-shot semantic segmentation. The 39th AAAI Conference on Artificial Intelligence (AAAI'25), Philadelphia, Pennsylvania, USA, Feb 25 - March 4, 2025

100. Guo Chen, Yicheng Liu, Yifei Huang, Baoqi Pei, Jilan Xu, Yuping He, Tong Lu, Yali Wang, Limin Wang. CG-Bench: clue-grounded question answering benchmark for long video understanding. The 13th International Conference on Learning Representations (ICLR'25), Singapore, April 24 - 28, 2025

101. Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang. Vision-RWKV: efficient and scalable visual perception with RWKV-like architectures. The 13th International Conference on Learning Representations (ICLR'25), Singapore, April 24 - 28, 2025

102. Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashou Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang. OmniCorpus: a unified multimodal corpus of 10 billion-level images interleaved with text. The 13th International Conference on Learning Representations (ICLR'25), Singapore, April 24 - 28, 2025

103. Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashou Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang. OmniCorpus: a unified multimodal corpus of 10 billion-level images interleaved with text. The 13th International Conference on Learning Representations (ICLR'25), Singapore, April 24 - 28, 2025

104. Tao Wang, Peiwen Xia, Bo Li, Peng-tao Jiang, Zhe Kong, Kaihao Zhang, Tong Lu, Wenhan Luo. MOREL: when mixture-of-experts meet reinforcement learning for adverse weather image restoration. The 13th International Conference on Computer Vision (ICCV'25), Honolulu, Hawaii, Oct 19 - 28, 2025

105. Zhiqian Shao, Tao Wang, Kaihao Zhang, Danhuai Zhao, Tong Lu. LLFA: fusing global illumination and local priors for low-light face image enhancement with adaptor. 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'25), April 6-11, Hyderabad, India, 2025

106. Xiaoge Song, Danhuai Zhao, Wei Zhu, Kang Zheng, Tong Lu. Conditional convolutions for end-to-end single-stage video text detection. 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'25), April 6-11, Hyderabad, India, 2025

107. Peiwen Xia, Tangfei Liao, Wei Zhu, Danhuai Zhao, Jianjun Ke, Kaihao Zhang, Tong Lu , Tao Wang. CorrMoE: mixture of experts with de-stylization learning for cross-scene and corss-domain correspondence pruning. The 28th European Conference on Artificial Intelligence (ECAI'25), Oct 25-30 Bologna, Italy, 2025

2024

108. Zhiqi Li, Zhiding Yu, Shiyi Lan, Jiahan Li, Jan Kautz, Tong Lu, Jose M. Alvarez. Is ego status all you need for open-loop end-to-end autonomous driving? IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

109. Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Zhong Muyan, Qing-long Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai. InternVL: scaling up vision foundation models and aligning for generic visual-linguistic tasks. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

110. Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai. Efficient deformable convNets: rethinking dynamic and sparse operator for vision applications. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

111. Yi Rong, Haoran Zhou, Kang Xia, Cheng Mei, Jiahao Wang, Tong Lu*. RepKPU: point cloud upsampling with kernel point representation and deformation. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

112. Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai. VisionLLM v2: an end-to-end generalist multimodal large language model for hundreds of vision-language tasks. Thirty-Eighth Annual Conference on Neural Information Processing Systems (NIPS'24), Vancouver, Canada, Dec 10-15, 2024

113. Yi Rong, Haoran Zhou, Lixin Yuan, Cheng Mei, Jiahao Wang, Tong Lu*. CRA-PCN: point cloud completion with intra- and inter-level cross-resolution transformers. The 38th AAAI Conference on Artificial Intelligence (AAAI'24), Vancouver, Canada, Feb 20-Feb 27, 2024

114. Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu*. AVSegFormer: audio-visual segmentation with transformer. The 38th AAAI Conference on Artificial Intelligence (AAAI'24), Vancouver, Canada, Feb 20-Feb 27, 2024

115. Guo Chen, Yicheng Liu, Yifei Huang, Yuping He, Baoqi Pei, Jian Xu, Yali Wang, Tong Lu, Limin Wang. CG-Bench: clue-grounded question answering benchmark for long video understanding. The Thirteen International Conference on Learning Representations (ICLR'24), 2024

116. Guangchen Shi, Wei Zhu, Yirui Wu, Huaidan Zhao, Kang Zheng, Tong Lu*. Few-shot semantic segmentation via perceptual attention and spatial control. ACM Multimedia 2024 (ACM MM'24), Melbourne, Australia, Oct 28-Nov 1, 2024

117. Xuanxi Chen, Tong Lu*. SVT: spectral video transformer for video restoration in under-display camera. The 2024 IEEE International Conference on Multimedia and Expo (ICME'24), Niagra Falls, Canada, July 15-19, 2024

118. Wei Zhu, Yicheng Liu, Yuping He, Tangfei Liao, Kang Zheng, Xiaoqiu Xu, Tao Wang, Tong Lu. CorrAdaptor adaptive local context learning for correspondence pruning. The 27th European Conference on Artificial Intelligence (ECAI'24), Santiago de Compostela, Oct 19-24, 2024

2023

119. Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai. VisionLLM: large language model is also an open-ended decoder for vision-centric tasks. The Thirty-seventh Conference on Neural Information Processing Systems (NIPS'23), New Orleans, USA, Dec 12-16, 2023

120. Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao. InterIMAGE: exploring large-scale vision fundamental models with deformable convolutions. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'23), Vancouver, Canada, June 18-June 22, 2023 (Highlight)

121. Zhiqi Li, Zhiding Yu, Wenhai Wang, Animashree Anandkumar, Tong Lu, Jose Alvarez. FB-BEV: BEV representation from forward-backward view transformations. International Conference on Computer Vision (ICCV'23), Paris, France, Oct 2-6, 2023

122. Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu. Mmeory-and-anticipation transformer. International Conference on Computer Vision (ICCV'23), Paris, France, Oct 2-6, 2023

123. Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo. DDP: diffusion model for dense visual prediction. International Conference on Computer Vision (ICCV'23), Paris, France, Oct 2-6, 2023

124. Tao Wang, Kaihao Zhang, Tianrun Shen, Wenhan Luo, Bjorn Stenger, Tong Lu*. Ultra-high-definition low-light image enhancement: a benchmark and transformer-based method. The 37th AAAI Conference on Artificial Intelligence (AAAI'23), Washington, DC, USA, Feb 7-Feb 14, 2023

125. Zhe Chen, Hao Tan, Tao Wang, Tianrun Shen, Tong Lu*, Qiuying Peng, Cheng Cheng, Yue Qi. Graph propagation tranformer for graph representation learning. The 32th International Joint Conference on Artificial Intelligence (IJCAI'23), Macao, S.A.R., Aug 19-25, 2023

126. Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu*, Jifeng Dai, Yu Qiao. Vision transformer adapter for dense predictions. The 11th International Conference on Learning Representations (ICLR'23), Kigali, Rwanda, May 1-5, 2023 (Spotlight paper)

127. Yindong Zheng, Guo Chen, Minglei Yuan, Tong Lu*. MRSN: multi-relation support network for video action detection. The 2023 IEEE International Conference on Multimedia and Expo (ICME'23), July, 10-14, Brisbane, Australia, 2023

128. Guo Chen, Yindong Zheng, Zhe Chen, Jiahao Wang, Tong Lu*. ELAN: enhancing temporal action detection with location awareness. The 2023 IEEE International Conference on Multimedia and Expo (ICME'23), July, 10-14, Brisbane, Australia, 2023

2022

129. Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Animashree Anandkumar, Jose M. Alvarez, Tong Lu*, Ping Luo. Panoptic segformer: delving deeper into panoptic segmentation with transformers. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'22), New Orleans, Louisiana, June 21-June 24, 2022

130. Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai. BEVFormer: learning bird-eye-view representations from multi-view images via spatiotemporal transformer. European Conference on Computer Vision (ECCV'22), 2022

131. Guangchen Shi, Shuang Liu, Wei Zhu, Zhi Zhu, Yirui Wu, Tong Lu. PRCBench: evaluating text-rich visual comprehension in multimodel large language models. 2026 IEEE International Conference on Multimedia (ICME'26), July 6-10, Bangkok, Thailand, 2026

132. Guo Chen, Yindong Zheng, Limin Wang, Tong Lu*. DCAN: improving temporal action detection via dual context aggregation. The 36th AAAI Conference on Artificial Intelligence (AAAI'22), Lisbon, Portugal, Oct 10-14, 2022

133. Guangchen Shi, Yirui Wu, Jun Liu, Wenhai Wang, Tong Lu. Incremental few-shot semantic segmentation via embedding adaptive-update and hyper-class representation. ACM Multimedia (ACM MM'22), Vancouver, BC, Canada, Feb 22-March 1, 2022

134. Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu*, Ying Tai, Chengjie Wang. SeedFormer: patch seeds based point cloud completion with upsample transformer. European Conference on Computer Vision (ECCV'22), 2022

135. Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu*, Ping Luo. Towards ultra-resolution neural style transfer via thumbnail instance normalization. The 36th AAAI Conference on Artificial Intelligence (AAAI'22), Vancouver, BC, Canada, Feb 22-March 1, 2022

2021

136. Guangping Tao, Xiaozhong Ji, Wenzhuo Wang, Shuo Chen, Chuming Lin, Yun Cao, Tong Lu*, Donghao Luo, Ying Tai. Spectrum-to-kernel translation for accurate blind image super-resolution. Thirty-fifth Conference on Nerual Information Processing Systems (NIPS'21), Dec 6-14, 2021

137. Wenhai Wang, Enze Xie, Xiang Li, Dengping Fan, Ding Liang, Tong Lu*, Ping Luo, Ling Shao. Pyramid vision transformer: a versatile backbone for dense prediction without convolutions. International Conference on Computer Vision (ICCV'21), Oct 11-17, 2021 (oral, top-10 Most Influential ICCV 2021 Papers)

138. Haoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin, Tong Lu*. Adaptive graph convolution for point cloud analysis. International Conference on Computer Vision (ICCV'21), Oct 11-17, 2021

139. Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, Tong Lu. TAM: temporal adaptive module for video recognition. International Conference on Computer Vision (ICCV'21), Oct 11-17, 2021

140. Xiaozhong Ji, Guangpin Tao, Yun Cao, Ying Tai, Tong Lu*, Chengjie Wang, Jilin Li, Feiyue Huang. Frequency consistent adaptation for real world super resolution. The 35th AAAI Conference on Artificial Intelligence (AAAI'21), Feb 2-9, 2021

141. Minglong Xue, Ruoze Liu, Tong Lu*. A novel attention enhanced residual-in-residual dense network for text image super-resolution. The 2021 IEEE International Conference on Multimedia and Expo (ICME'21), July, 5-9, Shenzhen, China, 2021

142. Guangcheng Shi, Yirui Wu, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu. ARNet: active-reference network for few-shot image semantic segmentation. The 2021 IEEE International Conference on Multimedia and Expo (ICME'21), July, 5-9, Shenzhen, China, 2021

143. Palaiahnakote Shivakumara, Tanmay Jain, Nitish Surana, Umapada Pal, Tong Lu, Michael Blumenstein. Connected component based deep learning model for multi-type-sized struck-out component classification. The 16th International Conference on Document Analysis and Recognition (ICDAR'21), Sep, 5-10, Lausanne, Switzerland, 2021

2020

144. Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, Zhibo Yang, Tong Lu*, Chunhua Shen, Ping Luo. AE TextSpotter: learning visual and linguistic representation for ambiguous text spotting. The 16th European Conference on Computer Vision (ECCV'20), Aug 23-28, 2020

145. Zhaoyang Liu, Donghao Luo, Yabiao Wang, Limin Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Tong Lu. Temporal enhancement-and-interaction networks for action recognition. The 34th AAAI Conference on Artificial Intelligence (AAAI'20), Hilton New York Midtown, New York, USA, Feb 7-12, 2020

146. Dongqi Tang, Hao Kong, Xi Meng, Ruoze Liu, Tong Lu*. SEE-LPR: A semantic segmentation based end-to-end system for unconstrained license plate detection and recognition. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

147. Pengfei Chen, Minglei Yuan, Tong Lu*. Multi-scale comparsion network for few-shot learning. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

148. Xiaozhong Ji, Yirui Wu, Tong Lu*. Context-aware residual network with promotion gates for single image super-resolution. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

149. Xiaoge Song, Yirui Wu, Wenhai Wang, Tong Lu*. TK-Text: multi-shaped scene text detection via instance segmentation. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

150. Ayush Mittal, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein, Daniel Lopresti. A new context based method for restoring occluded text in natural scene images. The 14th IAPR International Workshop on Document Analysis Systems (DAS'20), July, 2020

151. Lokesh Nandanwar, Palaiahnakote Shivakumara, Suvojit Manna, Umapada Pal, Tong Lu, Michael Blumenstein. A new DCT-FFT fusion based method for caption and scene text classification in action video images. The 2th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI'20), Zhongshan, 2020

152. Lokesh Nandanwar, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Daniel Lopresti, Bhagesh Seraogi, Bidyut. B. Chaudhuri. A new method for detecting altered text in pdf document images. The 2th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI'20), Zhongshan, 2020

153. Yuntao Ma, Yirui Wu, Tong Lu*. Multi-scale relational reasoning with regional attention for visual question answering. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

154. Haifeng Guo, Yirui Wu, Tong Lu*. Dyanmic low-light image enhancement for object detection via end-to-end training. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

155. Chunhao Cai, Minglei Yuan, Tong Lu*. IFSM: an iterative feature selection mechanism for few-shot image classification. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

156. Lokesh Nandanwar, Palaiahnakote Shivakumara, Sayani Kundu, Umapada Pal, Tong Lu, Daniel Lopresti. Chebyshev-Harmonic-Fourier-Moments and deep CNNs for detecting forged handwriting. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

157. Lokesh Nandanwar, Palaiahnakote Shivakumara, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Daniel Lopresti, Nor Badrul Anuar. Local gradient difference based mass features for classification of 2D-3D natural scene text images. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

2019

158. Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu*, Gang Yu, Chunhua Shen. Shape robust text detection with progressive scale expansion network. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'19), Long Beach, CA, June 16-20, 2019 (corresponding author)

159. Wenhai Wang, Enze Xie, Yuhang Zang, Wenjia Wang, Tong Lu*, Gang Yu, Chunhua Shen. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. International Conference on Computer Vision (ICCV'19), Seoul, Korea, Oct. 27- Nov. 2, 2019 (corresponding author)

160. Zhenjia Pang, Ruoze Liu, Zhouyu Meng, Yi Zhang, Yang Yu, Tong Lu. On reinforcement learning for full-length game of StarCraft. The 33th AAAI Conference on Artificial Intelligence (AAAI'19), Honolulu, Hawaii, Jan27-Feb 1, 2019

161. Xi Meng, Hao Kong, Dongqi Tang, Tong Lu*. Multimodal image captioning through combining reinforced cross loss and stochastic deprecation. IEEE International Conference on Multimedia and Expo (ICME'19), Shanghai, China, July 8-12, 2019 (corresponding author)

162. Yisheng Yue, Palaiahnakote Shivakumara, Yirui Wu, Liping Zhu, Tong Lu*, Umapada Pal. An automatic system for generating artificial fake character images. The 25th International Conference on Multimedia Modeling (MMM'19), Thessaloniki, Greece, Jan 8-11, 2019 (corresponding author)

163. Yirui Wu, Weigang Xu, Qinghan Yu, Jun Feng, Tong Lu. Hierarchical Bayesian network based incremental model for flood prediction. The 25th International Conference on Multimedia Modeling (MMM'19), Thessaloniki, Greece, Jan 8-11, 2019 (corresponding author)

164. Wenbo Hou, Wenhai Wang, Ruoze Liu, Tong Lu*. Cropout: a general mechanism for reducing overfitting on convolutional neural networks. 2019 International Joint Conference on Neural Network (IJCNN'19), Budapest, Hungary, July 14-19, 2019 (corresponding author)

165. Minglei Yuan, Xiaozhong Ji, Tong Lu*, Pengfei Chen, Hualu Zhang. A novel two-factor attention encoder-decoder network through combining temporal and prior knowledge for weather forecasting. 2019 International Joint Conference on Neural Network (IJCNN'19), Budapest, Hungary, July 14-19, 2019 (corresponding author)

166. Yindong Zheng, Yuntao Ma, Ruoze Liu, Tong Lu*. A novel group-aware pruning method for few-shot learning. 2019 International Joint Conference on Neural Network (IJCNN'19), Budapest, Hungary, July 14-19, 2019 (corresponding author)

167. Hao Kong, Dongqi Tang, Xi Meng, Tong Lu*. GARN: a novel generative adversarial recognition network for end-to-end scene character recognition. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019 (corresponding author)

168. Yao Xiao, Minglong Xue, Tong Lu*, Yirui Wu, Shivakumara Palaiahnakote. A text-content-aware CNN network for multi-oriented and multi-language scene text detection. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019 (corresponding author)

169. V. Basavaraja, Shivakumara Palaiahnakote, D.S. Guru, Umapada Pal, Tong Lu, Michael Blumenstein. Age estimation using disconnectedness features in handwriting. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019 (corresponding author)

170. Sauadip Nag, R. Raghavendra, Shivakumara Palaiahnakote, Umapada Pal, Tong Lu, Mohan Kankanhalli. CRNN based jersey-bib number/text recognition in sports and Marathon images. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019

171. Divya Krishnani, Shivakumara Palaiahnakote, Tong Lu, Umapada Pal, Raghavendra Ramachadra. Structure function based transform features for person behaviour-oriented social media image classification. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

172. Anaica Grouver, Shivakumara Palaiahnakote, Maryam Asadzadeh Kaljahi, Bhaarat Chetty, Umapada Pal, Tong Lu, G. Hemantha Kumar. A spatial density and phase angle based correlation for forged family photo identification. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

173. Sayani Kundu, Shivakumara Palaiahnakote, Anaica Grouver, Umapada Pal, Tong Lu, Michael Blumenstein. A new forged handwritting detection method based on Fourier spectural density and variation. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

174. Soumyadip Roy, Shivakumara Palaiahnakote, Umapada Pal, Tong Lu, Michael Blumenstein. New moments based fuzzy similarity measure for text detection in distorted social media images. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

175. Pinaki Nath Chowdhury, Shivakumara Palaiahnakote, Raghavendra Ramachandra, Umapada Pal, Tong Lu, Michael Blumenstein. A new U-net based enhancement model for license plate detection in night and day images. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

2018

176. Wenhai Wang, Jian Yang, Tong Lu*, Xiang Li. Mixed link networks. The 27th International Joint Conference on Artificial Intelligence (IJCAI'18), Stockholm, Sweden, July 13-19, 2018 (corresponding author)

177. Lianglei Wei, Yirui Wu, Wenhai Wang, Tong Lu*. A novel 3D human action recognition framework for video content analysis. The 24th International Conference on Multimedia Modeling (MMM'18), Bangkok, Thailand, Feb 5-8, 2018 (corresponding author)

178. Wenhai Wang, Yirui Wu, Palaiahnakote Shivakumara, Tong Lu*. Cloud of line distribution and random forest text detection from natural/video images. The 24th International Conference on Multimedia Modeling (MMM'18), Bangkok, Thailand, Feb 5-8, 2018 (corresponding author)

179. Yirui Wu, Weigang Xu, Jun Feng, Palaiahnakote Shivakumara, Tong Lu. Local and global Bayesian network based model for flood prediction. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018

180. Xuerong Wu, Palaiahnakote Shivakumara, Liping Zhu, Tong Lu*, Umapada Pal, Michael Blumenstein. Fourier transform based features for clean and polluted water image classification. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

181. Vijeta Khare, Palaiahnakote Shivakumara, B. J Navya, G. C Swetha, D. S Guru, Umapada Pal, Tong Lu*. Weighted-gradient features for handwritten line segmentation. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

182. Yirui Wu, Zhikai Li, Palaiahnakote Shivakumara, Tong Lu*. Em-SLAM: a fast and robust monocular SLAM method for embedded systems. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

183. B. J Navya, G. C Swetha, Palaiahnakote Shivakumara, Sangheeta Roy, D. S Guru, Umapada Pal, Tong Lu*. Multi-gradient directional features for gender identification. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

184. Yirui Wu, Zhaoyang Liu, Weigang Xu, Jun Feng, Palaiahnakote Shivakumara, Tong Lu. Context-aware attention LSTM network for flood prediction. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018

185. Minglei Yuan, Palaiahnakote Shivakumara, Hao Kong, Tong Lu*, Umapada Pal. Text component reconstruction for tracking in video. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

186. Zhaoyang Liu, Yirui Wu, Yukai Ding, Jun Feng, Tong Lu*. Context and temporal aware attention model for flood prediction. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

187. Chao Zhang, Palaiahnakote Shivakumara, Minglong Xue, Liping Zhu, Tong Lu*, Umapada Pal. New fusion based enhancement for text detection in night video footage. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

188. Tianping Hu, Wenhai Wang, Tong Lu*. Hand pose estimation with attention-and-sequence network. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

189. Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Tapabrata Chakraborti, Michael Blumenstein. A new roadmap for evaluating descriptive handwritten answer type. The 30th International Conference on Pattern Recognition & Artificial Intelligence (ICPRAI'18), Quebec, Canada, 2018

190. Yirui Wu, Yisheng Yue, Xiao Tan, Wei Wang, Tong Lu. End-to-end chromosome karyotyping with data augmentation using GAN. The 25th International Conference on Image Processing (ICIP'18), Athens, Greece, Oct 7-10, 2018

191. Palaiahnakote Shivakumara, V. Basavaraja, Harsha S. Gowda, D. S. Guru, Umapada Pal, Tong Lu*. New GRB based fusion for forged IMEI number detection in mobile images. The 16th International Conference on Frontiers in Handwriting Recognition (ICFHR'18), Niagara Falls, USA, Aug 5-8, 2018

192. B. J. Navya, Palaiahnakote Shivakumara, G. C Shwetha, Sangheeta Roy, D. S. Guru, Umapada Pal, Tong Lu*. Adaptive multi-gradient kernels for gender identification. The 16th International Conference on Frontiers in Handwriting Recognition (ICFHR'18), Niagara Falls, USA, Aug 5-8, 2018

193. Sauradip Nag, Palaiahnakote Shivakumara, Yirui Wu, Umapada Pal, Tong Lu*. New COLD feature based handwriting analysis for enthnicity/nationality identification. The 16th International Conference on Frontiers in Handwriting Recognition (ICFHR'18), Niagara Falls, USA, Aug 5-8, 2018

194. Wencan Zong, Alex Noel Joseph Raj, Palaiahnakote Shivakumara, Zhemin Zhuang, Tong Lu, Umapada Pal. A new shadow detection and depth removal method for 3D text recognition in scene images. The 2th International Conference on Computer Science and Artificial Intellignce (CSAI'18), Shenzhen, China, Dec 8-10, 2018

2017

195. Zehuan Yuan, Jonathan Stroud, Tong Lu*, Jia Deng. Temporal action localization by structured maximal sums. Computer Vision and Pattern Recognition 2017 (CVPR'17), Honolulu, Hawaii, July 22-25, pp. 3215-3223, 2017 (corresponding author)

196. Zehuan Yuan, Tong Lu*, Yirui Wu. Deep-dense conditional random fields for object co-segmentation. The 26th International Joint Conference on Artificial Intelligence (IJCAI'17), Melbourne, Australia, Aug 19-25, pp. 3371-3377, 2017 (corresponding author)

197. Yiyang Zhou, Wenhai Wang, Wenjie Guan, Yirui Wu, Heng Lai, Tong Lu*, Min Cai. Visual robotic object grasping through combining RGB-D data and 3D mesh. The 23th International Conference on Multimedia Modeling (MMM'17), Reykjavik, Iceland, January 4-6, pp. 404-415, 2017 (corresponding author)

198. Ruoze Liu, Xin Sun, Hailiang Xu, Palaiahnakote Shivakumara, Feng Su, Tong Lu*, Ruoyu Yang. Robust scene text detection for multi-script languages using deep learning. The 23th International Conference on Multimedia Modeling (MMM'17), Reykjavik, Iceland, January 4-6, pp. 329-340, 2017 (corresponding author)

199. Zhen Wang, Palaiahnakote Shivakumara, Tong Lu*, Mahadevappa Basavanna, Umapada Pal, Michael Blumenstein. Fourier-residual for printer identification. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

200. Yirui Wu, Wenhai Wang, Palaiahnakote Shivakumara, Tong Lu*. A robust symmetry-based method for scene/video text detection through convolutional network. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

201. Sangheeta Roy, Palaiahnakote Shivakumara, Namita Jain, Vijeta Khare, Umapada Pal, Tong Lu. New fuzz-mass features for video type categorization. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

202. Sangheeta Roy, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Wahid Bin Abdul Wahab. Temporal integration for word-wise image type classification. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

203. Wenhai Wang, Yirui Wu, Palaiahnakote Shivakumara, Tong Lu, Jun Liu. Cloud of line distribution for arbitrary text detection in scene/video/license plate images. The 18th Pacific-Rim Conference on Multimedia (PCM'17), Harbin, China, Sep 28-29, 2017

204. Yirui Wu, Zhouyu Meng, Palaiahnakote Shivakumara, Tong Lu. Compressing YOLO network by compressive sensing. The 4th Asian Conference on Pattern Recognition (ACPR'17), Nanjing, China, Nov 26-29, 2017

205. Palaiahnakote Shivakumara, Aishik Konwer, Abir Bhowmick, Vijeta Khare, Umapada Pal, Tong Lu. A new GVF arrow pattern for character segmentation from double line license plate images. The 4th Asian Conference on Pattern Recognition (ACPR'17), Nanjing, China, Nov 26-29, 2017

206. K. S. Raghunandan, Palaiahnakote Shivakumara, G. Hemantha Kumar, Umapada Pal, Tong Lu. Sharpness and contrast based features for word-wise video type classification. The 4th Asian Conference on Pattern Recognition (ACPR'17), Nanjing, China, Nov 26-29, 2017

2016

207. Yirui Wu, Xianli Zhou, Tong Lu*, Guo Mei, Linbi Sun. EvaToon: a novel graph matching system for evaluating cartoon drawings. The 23th International Conference on Pattern Recognition (ICPR'16), Cancun, Mexico, December 4-8, pp. 1119-1124, 2016 (corresponding author)

208. Vijeta Khare, Palaiahnakote Shivakumara, Ahald Kumar, Chee Seng Chan, Tong Lu*, Michael Blumenstien. A quad tree based method for blurred and non-blurred video text frames classification through quality metrics. The 23th International Conference on Pattern Recognition (ICPR'16), Cancun, Mexico, December 4-8, pp. 4023-4028, 2016 (corresponding author)

209. Longfei Qin, Palaiahnakote Shivakumara, Tong Lu*, Umapada Pal, Chew Lim Tan. Video scene text frames categorization for text detection and recognition. The 23th International Conference on Pattern Recognition (ICPR'16), Cancun, Mexico, December 4-8, pp. 3875-3880, 2016 (corresponding author)

210. Sangheeta Roy, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Chew Lim Tan. New tampered features for scene and caption text classification in video frame. The 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), Shenzhen, China, October 23-26, pp. 36-41, 2016

211. K.S. Raghunandan, Palaiahnakote Shivakumara, B.J. Navya, G. Pooja, Navya Prakash, G. Hemantha Kumar, Umapada Pal, Tong Lu. Fourier coefficients for fraud handwritten document classification through age analysis. The 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), Shenzhen, China, October 23-26, pp. 25-30, 2016

212. K.S. Raghunandan, Palaiahnakote Shivakumara, G. Hemantha Kumar, Umapada Pal, Tong Lu. New sharpness features for image type classification based on textual information. The 12th IAPR International Workshop on Document Analysis Systems (DAS'16), Santorini, Greece, April 11-14, pp. 204-209, 2016

2015

213. Yirui Wu, Tong Lu, Zehuan Yuan, Hao Wang. FreeScup: a novel platform for assisting sculpture pose design. International Conference on Multimedia and Expo (ICME'15), Torino, Italy, June 29-July 3, pp. 1-6, 2015 (corresponding author, oral, acceptance rate 15%)

214. Yirui Wu, Oscar Kin-Chung Au, Chiew-Lan Tai, Tong Lu. HIRM: a handle-independent reduced model for incremental mesh editing. The 9th International Conference on Geometric Modeling and Processing (GMP'15), Lugano, Switzerland, June 1-3, pp. 56-68, 2015 (corresponding author) Published time: May, 2015

215. Yu Zhang, Tong Lu. A fast color barcode detection method through cross identification on mobile platforms. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 416-420, 2015 (corresponding author, oral)

216. Xiaolong Liu, Tong Lu. Natural scene character recognition using Markov Random Field. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 396-400, 2015 (corresponding author)

217. Qisu Li, Tong Lu, Palaiahnakote Shivakumara, Umapada Pal, Chew Lim Tan. A new method based on bag of filters for character recognition in scene images by learning. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 391-395, 2015 (corresponding author)

218. Guozhu Liang, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. A new Wavelet-Laplacian method for arbitrarily-oriented character segmentation in video text lines. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 9262-930, 2015 (corresponding author)

219. Yangbing Weng, Palaiahnakote Shivakumara, Tong Lu, Liang Kim Meng, Hon Hock Woon. A new multi-spectral fusion method for degraded video text frame enhancement. The 16th Pacific Rim Conference on Multimedia (PCM'15), Gwangju, South Korea, September 16-18, pp. 495-506, 2015 (corresponding author) Published time: December 30, 2015

220. Sangheeta Roy, Palaiahnakote Shivakumara, Prabir Mondal, R. Raghavendra, Umapada Pal, Tong Lu. A new multi-modal technique for bib number/text detection in natural images. The 16th Pacific Rim Conference on Multimedia (PCM'15), Gwangju, South Korea, September 16-18, pp. 483-494, 2015. Published time: December 30, 2015

221. Palaiahnakote Shivakumara, Guozhu Liang, Sangheeta Roy, Umapada Pal, Tong Lu. New texture-spatial features for keyword spotting in video images. The 3rd IAPR Asian Conference on Pattern Recognition (ACPR'15), Kuala Lumpur, Malaysia, November 3-6, pp. 391-395, 2015

222. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan, Michael Blumenstein. Text detection in born-digital images by mass estimation. The 3rd IAPR Asian Conference on Pattern Recognition (ACPR'15), Kuala Lumpur, Malaysia, November 3-6, pp. 690-694, 2015 (corresponding author)

2014

223. Zehuan Yuan, Tong Lu, Palaiahnakote Shivakumara. A novel topic-level random walk framework for scene image co-segmentation. European Conference on Computer Vision (ECCV'14), Zurich, Switzerland, September 6-12, pp. 695-709, 2014 (corresponding author) Published time: April 29, 2015

Palaiahnakote Shivakumara

224. Zehuan Yuan, Tong Lu. A novel context-aware topic model for category discovery in natural scenes. The 12th Asian Conference on Computer Vision (ACCV'14), Singapore, Singapore, November 1-5, pp. 158-171, 2014 (corresponding author) Published time: April 29, 2015

225. Palaiahnakote Shivakumara, Mohamed Lubani, KokSheik Wong, Tong Lu. Optical flow based dynamic curved video text detection. The 21th IEEE International Conference on Image Processing (ICIP'14), Paris, France, October 27-30, pp. 1668-1672, 2014

226. Liang Wu, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. Text detection using Delaunay triangulation in video sequence. The 11th IAPR International Workshop on Document Analysis Systems (DAS'14), Loire Valley, France, April 7-10, pp. 41-45, 2014 (corresponding author, anomination of Best Student Paper Award)

227. Hao Wang, Tong Lu, Oscar Kin-Chung Au, Chiew-Lan Tai. Spectral 3D mesh segmentation with a novel single segmentation field descriptor. The 8th International Conference on Geometric Modeling and Processing (GMP'14), Singapore, Jun4 29-July 1, pp. 440-456, 2014 (corresponding author) Published time : September, 2014

228. Tong Lu, Liang Wu, Xiaolin Ma, Palaiahnakote Shivakumara, Chew Lim Tan. Anomaly detection through spatio-temporal context modeling in crowded scenes. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 2203-2208, accepted, 2014 (corresponding author)

229. Tong Lu, Gongyou Wang, Yangbing Weng. Auditory movie summarization by detecting sound events and scene changes. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 756-760, accepted, 2014 (corresponding author)

230. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, Trung Quy Phan, Chew Lim Tan. Graphics and scene text classification in video. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 4714-4719, accepted, 2014 (corresponding author)

231. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. 2D and 3D video scene text classification. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 2932-2937, accepted, 2014 (corresponding author)

2013

232. Weichong Yin, Tong Lu, Feng Su. A novel multi-view object class detection framework for document image content analysis. The 12th International Conference on Document Analysis and Recognition (ICDAR'13), Washington, DC, USA, August 25-28, pp. 1095-1099, 2013 (corresponding author)

233. Trung Quay Phan, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. Recognition of video text through temporal integration. The 12th International Conference on Document Analysis and Recognition (ICDAR'13), Washington, DC, USA, August 25-28, pp. 589-593, 2013

234. Feng Su, Tong Lu. Discriminative weighting and subspace learning for ensemble symbol recognition. The 12th International Conference on Document Analysis and Recognition (ICDAR'13), Washington, DC, USA, August 25-28, pp. 1088-1092, 2013

235. Yirui Wu, Tong Lu, Jiqiang Song. A real-time animation framework using Kinect. The 13th Pacific-Rim Conference on Multimedia (PCM'13), Nanjing, China, December 13-16, pp. 245-256, 2013 (corresponding author) Published time: November 5, 2013

236. Feiming Xu, Tong Lu, Yirui Wu. Robust object tracking using motion context in crowded scenes. The 13th Pacific-Rim Conference on Multimedia (PCM'13), Nanjing, China, December 13-16, pp. 550-560, 2013 (corresponding author) Published time: November 5, 2013

232. Hao Wang, Tong Lu, et al. Recognition and reconstruction from complex line drawings. The 10th IAPR International Workshop on Graphics Recognition (GREC'13), Bethlehem, PA, USA, August 20-21, 2013 (corresponding author)

2012

237. Wanxia Lin, Tong Lu, Feng Su. A novel multi-view integration and propagation model for cross-media information retrieval. The 18th International Conference on Multimedia Modeling (MMM'12), Klagenfurt, Austria, January 4-6, pp. 740-749, 2012 (corresponding author) Published time: December 21, 2011

238. Xiaolin Ma, Tong Lu, Feiming Xu, Feng Su. Anomaly detection with spatio-temporal context using depth images. The 21th International Conference on Pattern Recognition (ICPR'12), Turkuba, Japan, November 11-15, pp. 2590-2593, 2012 (corresponding author)

239. Feng Su, Yang Li, Tong Lu. Ensemble symbol recognition with Hough forest. The 21th International Conference on Pattern Recognition (ICPR'12), Turkuba, Japan, November 11-15, pp. 1659-1663, 2012

240. Zehuan Yuan, Tong Lu, Haojuan Zhou, Bin Chen, Jianing Li. Incremental 3D reconstruction using Bayesian learning. The 25th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE'12), Dalian, China, June 9-12, pp. 754-763, 2012 (corresponding author, Best Paper Award) Published time: July 7, 2012

241. Yukang Jin, Tong Lu, Feng Su. Movie keyframe retrieval based on cross-media correlation detection and context model. The 25th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE'12), Dalian, China, June 9-12, pp. 816-825, 2012 (corresponding author) Published time: July 7, 2012

2011

242. Limin Wang, Yirui Wu, Tong Lu, Kang Chen. Multiclass object detection by combining local appearances and context. The 19th International Conference on Multimedia (ACM Multimedia'11), Scottsdale, AZ, USA, November 28-December 1, pp. 1161-1164, 2011 (corresponding author)

243. Feng Su, Yang Li, Tong Lu, Gongyou Wang. Environmental sound classification for scene recognition using local discriminant bases and HMM. The 19th International Conference on Multimedia (ACM Multimedia'11), Scottsdale, AZ, USA, November 28-December 1, pp. 1389-1392, 2011

244. Yang Zhao, Tong Lu, Wujun Liao. A robust color-independent text detection method from complex videos. The 11th International Conference on Document Analysis and Recognition (ICDAR'11), Beijing, China, September 18-21, pp. 374-378, 2011 (corresponding author)

245. Feng Su, Tong Lu, Ruoyu Yang. Symbol recognition by multiresolution shape context matching. The 11th International Conference on Document Analysis and Recognition (ICDAR'11), Beijing, China, September 18-21, pp. 1319-1323, 2011

246. Yan Zhao, Zhaokang Wang, Tong Lu, et al. Real-time video caption detection. The 9th IAPR International Workshop on Graphics Recognition (GREC'11), Seoul, Korea, September 15-16, pp. 150-153, 2011 (corresponding author)

2010

247. Yimin Wang, Tong Lu, Rongjun Gao, Wenyin Liu. 3D model comparison through kernel density matching. The 20th International Conference on Pattern Recognition (ICPR'10), Istanbul, Turkey, August 23-26, pp. 3159-3162, 2010 (corresponding author)

248. Feng Su, Tong Lu, Ruoyu Yang. Symbol recognition combining vectorial and pixel-level features for line drawings. The 20th International Conference on Pattern Recognition (ICPR'10), Istanbul, Turkey, August 23-26, pp. 1892-1895, 2010

249. Tong Lu, Rongjun Gao, Tuantuan Wang, Yubin Yang. 3D similarity search using a weighted structural histogram representation. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 348-356, 2010 (corresponding author) Published time: November 4, 2010

250. Tuantuan Wang, Tong Lu, Wenyin Liu. Robust shape retrieval through a novel statistical descriptor. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 330-337, 2010 (corresponding author) Published time: November 4, 2010

251. Zengyu Zhang, Tong Lu, Feng Su, Ruoyu Yang. A new text detection algorithm for content-oriented line drawing image retrieval. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 338-347, 2010 (corresponding author) Published time: November 4, 2010

252. Limin Wang, Yirui Wu, Ziyuan Tian, Zailiang Sun, Tong Lu. A novel approach for robust surveillance video content abstraction. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 660-671, 2010 (corresponding author) Published time: November 4, 2010

253. Feng Su, Tong Lu, Ruoyu Yang. A new shape descriptor for object recognition and retrieval. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 493-502, 2010. Published time: November 4, 2010

254. Ruoyu Yang, Feng Su, Tong Lu. Research of the structural-learning-based symbol recognition mechanism for engineering drawings. The 6th International Conference on Digital Content, Multimedia Technology and its Applications (IDC'10), Seoul Korea, August 16-18, pp. 346-349, 2010

2009

255. Tong Lu, Yubing Yang, Feng Su, Zengxin Sun. Semi-automatic roof reconstruction. The 10th International Conference on Document Analysis and Recognition (ICDAR'09), Barcelona, Spain, July 26-29, pp. 723-727, 2009 (corresponding author)

256. Feng Su, Tong Lu, Ruoyu Yang, Shijie Cai, Yubing Yang. A character segmentation method for engineering drawings based on holistic and contextual constraints. The 8th IAPR International Workshop on Graphics Recognition (GREC'09), La Rochelle, France, July 22-13, pp. 280-287, 2009

257. Yubing Yang, Wei Wei, Tong Lu, Yang Gao. 3D scene analysis using UIMA framework. The 22th International Conference on Industrial, Engineering and Other Applied Intelligent Systems (IEA/AIE'09), Taiwan, China, June 24-27, pp. 369-378, 2009. Published time: July 14,2009

258. Yubing Yang, Jinjie Lin, Tong Lu. Saliency regions for 3D mesh abstraction. The 9th Pacific-Rim Conference on Multimedia (PCM'09), Bangkok, Thailand, December 15-18, pp. 292-299, 2009. Published time: January 13, 2010

259. Feng Su, Tong Lu, Yubing Yang, Shijie Cai. Text separation from engineering drawings. IAPR International Workshop on Graphics Recognition (GREC'09), La Rochelle, France, July 22-13, pp. 280-287, 2009