Email:
lutong@nju.edu.cn
Address:
National Key Laboratory for Novel Software Technology
Department of Computer Science and Technology
Xianlin Campus Mailbox 603, Nanjing University
163 Xianlin Avenue, Qixia District
Nanjing 210023, China
Phone:
0086-25-8968-2398
Fax:
0086-25-8968-2398
URL:
http://cs.nju.edu.cn/lutong/

Selected International Journal Papers


1. Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Qiao Yu, Jifeng Dai. BEVFormer: learning bird's -eye-view representation from multi-camera images via spatiotemporal transformers. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear, 2025 (corresponding author, SCI, 南京大学学科卓越列表期刊)

2. Yangzhou Liu, Yue Cao, Zhangwei Gao, Weiyun Wang, Zhe Chen, Wenhai Wang, Hao Tian, Lewei Lu, Xizhou Zhu, Tong Lu, Yu Qiao, Jifeng Dai. MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity. SCIENCE CHINA Information Sciences, to appear, 2024 (SCI)

3. Tao Wang, Kaihao Zhang, Ziqian Shao, Wenhan Luo, Bjorn Stenger, Tong Lu, Tae-Kyun Kim, Wei Liu, Hongdong Li. GridFormer: residual dense transformer with grid structure for image restoration in adverse weather conditions. International Journal of Computer Vision, to appear, 2024 (SCI)

4. Tao Wang, Guangpin Tao, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Xiaoqin Zhang, Tong Lu*. Restoring vision in hazy weather with hierarchical contrastive learning. Pattern Recognition, to appear, 2024 (corresponding author, SCI)

5. Palaiahnakote Shivakumara, Maryam Asadzadeh Kaljahi, Swati Kanchan, Umapada Pal, Daniel Lopresti, Tong Lu. A robust script independent handwriting system for gender identification. Experts Systems with Applications, to appear, 2024 (corresponding author, SCI)

6. Palaiahnakote Shivakumara, Ayan Banerjee, Lokesh Nandanwar, Umapada Pal, Apostolos Antonacopoulos, Tong Lu, Michael Blumenstein. A new deep CNN for 3D text localization in the wild through shadow removal. Computer Vision and Image Understading, to appear, 2023 (corresponding author, SCI)

7. Palaiahnakote Shivakumara, Ayan Banerjee, Umapada Pal, Lokesh Nandanwar, Tong Lu, Cheng-Lin Liu. A new language Independent deep CNN for scene text detection and style transfer in social media images. IEEE Transactions on Image Processing, to appear, 2023

8. Min Yang, Guo Chen, Yindong Zheng, Tong Lu, Limin Wang. BasicTAD: an astounding RGB-only baseline for temporal action detection. Computer Vision and Image Understanding, to appear, 2023

9. Ruoze Liu, Yangjie Shen, Yang Yu, Tong Lu. Revisiting of AlphaStar. IEEE Transactions on Games, to appear, 2023

10. Wenhai Wang, Enze Xie, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu*, Chunhua Shen. PAN++: towards efficient and accurate arbitrary-shaped text spotting. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear, 2022 (corresponding author, SCI, 南京大学学科卓越列表期刊)

11. Haoran Zhou, Honghua Chen, Yingkui Zhang, Mingqiang Wei, Haoran Xie, Jun Wang, Tong Lu*, Jing Qin, Xiao-ping Zhang. Rfine-Net: normal refinement neural network for noisy point clouds. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear, 2022 (co-corresponding author, SCI, 南京大学学科卓越列表期刊)


12. Lixin Yuan, Guoqiang Yang, Qian Xu, Tong Lu*. Discriminative feature selection with directional outliers correcting for data classification. Pattern Recognition, to appear, 2022 (corresponding author, SCI)

13. Minglei Yuan, Chunhao Cai, Tong Lu*, Yirui Wu, Qian Xu, Shijie Zhou. A novel forget-update module for few-shot domain generation. Pattern Recognition, to appear, 2022 (corresponding author, SCI)

14. Lokesh Nandanwar, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, Ramachandra Raghavendra, Umapada Pal, Tong Lu, Michael Blumenstein. A conformable moments-based deep learning system for forged handwriting detectionIEEE Transactions on Neural Networks and Learning Systems, to appear, 2022 (corresponding author, SCI)

15. Yanwen Lu, Wenliang Ma, Xiang Dong, Mackenzie Brown, Tong Lu*, Weidong Gan*. Differentiate Xp11.2 translocation renal cell carcinoma from computed tomography images and clinical data with ResNet-18 CNN and XGBoost. Computer Modeling in Engineering & Sciences, to appear, 2022 (corresponding author, SCI)

16. Zhiheng Huang, Palaiahnakote Shivakumara, Maryam Asadzadeh Kaljahi, Ahlad Kumar, Umapada Pal, Tong Lu*, Michael Blumenstein. Writer age estimation through handwriting. Multimedia Tools and Applications, to appear, 2022 (corresponding author, SCI)

17. Ruoze Liu, Zhenjia Pang, Zhouyu Meng, Wenhai Wang, Yang Yu, Tong Lu. On efficient reinforcement learning for full-length game of StarCraft II. Journal of Artificial Intelligence Research, to appear, 2022 (corresponding author, SCI)

18. Palaiahnakote Shivakumara, Tanmay Jain, Umapada Pal, Nitish Surana, Apostolos Antonacopoulos, Tong Lu*. Text line segmentation from struck-out handwritten document imagesExperts Systems with Applications, to appear, 2022 (corresponding author, SCI)

19. Wenhai Wang, Enze Xie, Xiang Li, Deng-ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao. Improved baselines with pyramid vision transformer. Computational Visual Media, to appear, 2022 (SCI)

20. Lokesh Nandanwar, Palaiahnakote Shivakumara, Divya Krishnani, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Mohan Kankanhalli. A new foreground and background based method for behavior-oriented social media images classification. ACM Transactions on Multimedia Computing Communications and Applications, to appear, 2021 (SCI)

21. Lokesh Nandanwar, Palaiahnakote Shivakumara, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Apostolos Antonacopoulos, Yue Lv. A new deep wavefront based model for text localization in 3D video. IEEE Transactions on Circuits and Systems for Video Technology, to appear, 2021 (SCI)

22. Lokesh Nandanwar, Palaiahnakote Shivakumara, Divya Krishnani, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Mohan Kankanhalli. An episodic learning network for text detectioin on human bodies in sports images. IEEE Transactions on Circuits and Systems for Video Technology, to appear, 2021 (SCI)

23. 王文海, 李志琦, 路通*. 基于网格切分的单阶段实例分割方法. 软件学报, to appear, 2021 (SCI)

24. Pinaki Nath Chowdhury, Palaiahnakote Shivakumara, Lokesh Nandanwar, Faizal Samiron, Umapada Pal, Tong Lu. Oil palm tree counting in drone images. Pattern Recognition Letters, to appear, 2021 (SCI)

25. Ayush Mittal, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein. A new method for detection and prediction of occluded text in natural scene images. Signal processing: image communication, to appear, 2021 (SCI)

26. Abhra Chaudhuri, Palaiahnakote Shivakumara, Pinaki Nath Chowdhury, Umapada Pal, Tong Lu, Daniel Lopresti, G. Hemantha Kumar. Deep action-oriented video image classification system for text detection and recognition. SN Appliced Sciences, to appear, 2021 (SCI)

27. Ruoze Liu, Haifeng Guo, Xiaozhong Ji, Yang Yu*, Zenjia Pang, Zitai Xiao, Yuzhou Wu, Tong Lu*. Efficient reinforcement learning for StarCraft by abstract forward models and transfer learning. IEEE Transactions on Games, to appear, 2021 (SCI)

28. Tapan Karnik, Palaiahnakote Shivakumara, Pinaki Nath Chowdhury, Umapada Pal, Tong Lu*, Nor Badrul Anuar. A new deep model for faimily and non-family photo identification. Multimedia Tools and Applications, to appear, 2021 (SCI)

29. Lokesh Nandanwar, Palaiahnakote Shivakumara, Prabi Mondal, K. S. Raghunandan, Umapada Pal, Tong Lu, Daniel Lopresti. Forged text detection in video, scene and document images. IET Image Processing, to appear, 2021 (SCI)

30. Zhiheng huang, Palaiahnakote Shivakumara, Tong Lu*, Umapada Pal, Michael Blumenstein, Bhaarat Chetty, G. H. Kumar. Improved ring radius transform-based reconstruction for video character recognition. International Journal of Pattern Recognition and Artificial Intelligence, to appear, 2021 (SCI)

31. Abhra Chaudhuri, Palaiahnakote Shivakumara, Pinaki Nath Chowdhury, Umapada Pal, Tong Lu, Daniel Lopresti, G. Hemantha Kumar. Deep action-oriented video images classification system for text detection and recognition. SN Applied Sciences, to appear, 2021 (SCI)

32. Yin-Dong Zheng, Zhaoyang Liu, Tong Lu, Limin Wang. Dynamic sampling networks for efficient action recognition in videos. IEEE Transactions on Image Processing, to appear, 2020 (SCI)

33. Minglong Xue, Palaiahnakote Shivakumara, Chao Zhang, Yao Xiao, Tong Lu*, Umapada Pal, Daniel Lopresti. Arbitrarily-oriented text detection in low light natural scene images. IEEE Transactions on Multimedia, to appear, 2020 (corresponding author)

34. Divya Krishnani, Palaiahnakote Shivakumara, Tong Lu, Umapada Pal, Daniel Lopresti, Govindaraju Hemantha Kumar. A new context-based feature for classification of emotions in photographs. Multimedia Tools and Applications, to appear, 2020

35. Lokesh Nandanwar, Palaiahnakote Shivakumara, Swati Kanchan, V. Basavaraja, D.S. Guru, Umapada Pal, Tong Lu, Michael Blumenstein. DCT-phase statistics for forged IMEI numbers and air ticket detection. Experts Systems with Applications, to appear, 2020 (corresponding author, SCI impact factor: 5.452)

36. Pinaki Nath Chowdhury, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, , Michael Blumenstein. A new augmentation-based method for text detection in night and day license plate images. Multimedia Tools and Applications, 79(43-44):1-28, 2020 (corresponding author)

37. Pinaki Nath Chowdhury, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, Umapada Pal, Tong Lu. A new fractal series expansion based enhancement model for license plate recognition. Signal Processing: Image Communication, to appear

38. Shengkai Yue, Minglei Yuan, Tong Lu, Palaiahnakote Shivakumara, Michael Blumenstein, Jie Shi, G. Hemantha Kumar. Rotation invariant angle-density based features for an ice image classification system. Experts Systems with Applications, to appear, 2020 (corresponding author, SCI impact factor: 5.452)

39. Sauradip Nag, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu*, Michael Blumenstein. A new unified method for detecting text from Marathon runners and sports players in video. Pattern Recognition, to appear, 2020 (corresponding author, SCI)

40. Soumyadip Roy, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu*, Govindaraj Hemantha Kumar. Delaynay triangulation based text detection from multi-view images of natural scene. Pattern Recognition Letters, 129:92-100, 2019 (corresponding author, SCI)

41. Minglong Xue, Palaiahnakote Shivakumara, Chao Zhang, Tong Lu*, Umapada Pal. Curved text detection in blurred/non-blurred video/scene images. Multimedia Tools and Applications, 78(18):25629-25653, 2019 (corresponding author, SCI)

42. Maryam Asadzadeh Kaljahi, Palaiahnakote Shivakumara, Tiangping Hu, Hamid A. Jalab, Rabha W. Ibrahim, Michael Blumenstein, Tong Lu, Mohamad Nizam Bin Ayub. A geometric and fractional entropy-based method for family photo classification. Expert Systems with Applications, Online publication, 2019 (SCI impact factor: 5.452)

43. Yirui Wu, Yuechao He, Palaiahnakote Shivakumara, Ziming Li, Hongxin Guo, Tong Lu. Channel-wise attention model based fire and rating level detection in video. CAAI Transactions on Intelligence Technology, 4(2):117-121, 2019

44. Vijeta Khare, Palaiahnakote Shivakumara, Chee Seng Chan, Tong Lu*, Liang Kim Meng, Hon Kock Woon, Michael Blumenstein. A novel character segmentation-reconstruction approach for license plate recognition. Expert Systems with Applications, 131:219-239, 2019 (corresponding author, SCI impact factor: 5.452)

45. Maryam A. Kaljahi, Palaiahnakote Shivakumara, Mohd Y.I. Idris, Mohammad H. Anisi, Tong Lu*, Michael Blumenstein, Noorzaily Mohamed Noor. An automatic zone detection system for safe landing of UAVs. Expert Systems with Applications, 122:319-333, 2019 (SCI impact factor: 5.452)

46. Palaiahnakote Shivakumara, Dongqi Tang, Maryam Asadzadehkaljahi, Tong Lu*, Umapada Pal, Mohammad Hossein Anisi. A CNN-RNN based method for license plate recognition. CAAI Transactions on Intelligence Technology, 3(3):169-175, 2018

47. K.S. Raghunandan, Palaiahnakote Shivakumara, Lolika Padmanabhan, G. Hemantha Kumar, Tong Lu*, Umapada Pal. New symmetry features for license plate classification. CAAI Transactions on Intelligence Technology, 3(3):176-183, 2018

48. K. S. Raghunandan, Palaiahnakote Shivakumara, Sangheeta Roy, G. Hemantha Kumar, Umapada Pal, Tong Lu*. Multi-script-oriented text detection and recognition in video/scene/born digital images. IEEE Transactions on Circuits and Systems for Video Technology, 29(4):1145-1162, 2018 (corresponding author, SCI)

49. Sangheeta Roy, Palaiahnakote Shivakumara, Namita Jain, Vijeta Khare, Anjan Dutta, Umapada Pal, Tong Lu*. Rough-fuzzy based scene categorization for text detection and recognition in video. Pattern Recognition, 80:64-82, 2018 (corresponding author, SCI)

50. Yirui Wu, Zhouyu Meng, Palaiahnakote Shivakumara, Tong Lu. Compressive sensing based convolutional neural network for object detection. Malaysian Journal of Computer Science, 33(1):78-89, 2018

51. K. S Raghunandan, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, G. Hemantha Kumar, Umapada Pal, Tong Lu*. Riesz fractional based model for enhancing license plate detection and recognition. IEEE Transactions on Circuits and Systems for Video Technology, 28(9):2276-2288, 2018 (corresponding author, SCI)

52. Zehuan Yuan, Tong Lu*, Chew Lim Tan. Learning discriminated and correlated patches for multi-view object detection using sparse coding. Pattern Recognition, 69:26-38, 2017 (corresponding author, SCI)

53. Palaiahnakote Shivakumara, Liang Wu, Tong Lu, Chew Lim Tan, Michael Blumenstein, Basavaraj S. Anami. A fractal-based multi-oriented text detection system for recognition in mobile video images. Pattern Recognition, 68:158-174, 2017 (SCI)

54. Yirui Wu, Tong Lu*, Zehuan Yuan, Hao Wang. FreeScup: a novel platform for assisting sculpture pose design. IEEE Transactions on Multimedia, 19(1):183-195, 2017 (corresponding author, SCI)


55. Aladhahalli Shivegowda Kavitha, Palaiahnakote Shivakumara, Govindaraj, Hemantha Kumar, Tong Lu. A new watershed model based system for character segmentation in degraded text lines. International Journal of Electronics and Communications, 71: 45-52, 2016 (SCI)

56. Sounka Dey, Palaiahnakote Shivakumara, K. S. Raghunandan, Umapada Pal, Tong Lu*, G. Hemantha Kumar, Chee Chen Chan. Script Independent appraoch for multi-oriented text detection in scene image. Neurocomputing, 242:96-112, 2016 (corresponding author, SCI)

57. Yirui Wu, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan, Michael Blumenstein, G. Hemantha Kumar. Contour restoration of text components for recognition in video/scene images. IEEE Transactions on Image Processing, 25(12):5622-5634, 2016 (corresponding author, SCI) 

58. Palaiahnakote Shivakumara, R. Raghavendra, Longfei Qin, Kiran. B. Raja, Tong Lu*, Umapada Pal. A new multi-modal approach to bib number/text detection and recognition in Marathon images. Pattern Recognition, 61:479-491, 2016 (corresponding author, SCI) 

59. Zehuan Yuan, Hao Wang, Limin Wang, Tong Lu*, Palaiahnakote Shivakumara, Chew Lim Tan. Modeling spatial layout for scene image understanding via a novel multiscale sum-product network. Expert Systems with Applications, 63:231-240, 2016 (impact factor 5.452, corresponding author, SCI)

60. A.S. Kavitha, Palaiahnakote Shivakumara, G. H. Kumar, Tong Lu. Text segmentation in degraded historical document images. Egyptian Informatics Journal, 17(2):189-197, 2016

61. Hao Wang, Tong Lu*, Yiming Wang, Palaiahnakote Shivakumara, Chew Lim Tan. Weakly-supervised region annotation for understanding scene images. Multimedia Tools and Applications, 75(6):3027-3051, 2016 (corresponding author, SCI)

62. Guozhu Liang, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan. Multi-spectral fusion based approach for arbitrarily-oriented scene text detection in video images. IEEE Transactions on Image Processing, 24(11):4488-4501, 2015 (corresponding author, SCI)


63. Liang Wu, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan. A new technique for multi-oriented scene text lines detection and tracking in video. IEEE Transactions on Multimedia, 17(8):1137-1152, 2015 (corresponding author, SCI)

64. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu*, Chew Lim Tan, Seiichi Uchida. A new method for multi oriented graphics-scene-3D text classification in video. Pattern Recognition, 49(1):19-42, 2015 (corresponding author, SCI)

65. Sangheeta Roy, Palaiahnakote Shivakumara, Hamid A. Jalab, Rabha W. Ibrahim, Umapada Pal, Tong Lu. Fractional poisson enhancement model for text detection and recognition in video images. Pattern Recognition, 52:433-447, 2015 (corresponding author, SCI)

66. Yirui Wu, Oscar Kin-Chung Au, Chiew-Lan Tai, Tong Lu*. HIRM: a handle-independent reduced model for incremental mesh editing. Computer-Aided Geometric Design, 35(36):56-68, 2015 (corresponding author, SCI)

67. Palaiahnakote Shivakumara, Zehuan Yuan, Danni Zhao, Tong Lu, Chew Lim Tan. New gradient-Spatial-Structural features for video script identification. Computer Vision and Image Understanding, 130:35-53, 2015 (SCI)

68. Yirui Wu, Palaiahnakote Shivakumara, Wei Wang, Tong Lu*, Umapada Pal. A new ring radius transform based thinning method for multi-oriented video characters. International Journal on Document Analysis and Recognition, 18(2):137-151, 2015 (corresponding author, SCI)

69. Sangheeta Roy, Palaiahnakote Shivakumara, Partha Pratim Roy, Umapada Pal, Chew Lim Tan, Tong Lu. Bayesian classifier for multi-oriented video text recognition system. Expert Systems with Applications, 42(13):5554-5566, 2015 (impact factor 5.452, SCI)

70. Shangxuan Tian, Palaiahnakote Shivakumara, Trung Quy Phan, Tong Lu, Chew Lim Tan. Character shape restoration system through media axis points in video. Neurocomputing, 161(5):183-198, 2015 (SCI)

71. Tong Lu, Gongyou Wang, Feng Su. Context-based environmental audio event recognition for scene understanding. Multimedia Systems, 21(5):507-524, 2014 (corresponding author, SCI)

72. Tong Lu*, Yukang Jin, Feng Su, Palaiahnakote Shivakumara, Chew Lim Tan. Content-oriented multimedia document understanding through cross-media correlation. Multimedia Tools and Applications, 74(18):8105-8135, 2014 (corresponding author, SCI)

73. Hao Wang, Tong Lu*, Oscar Kin-Chung Au, Chiew-Lan Tai. Spectral 3D mesh segmentation with a novel single segmentation field descriptor. Graphical Models, 76(5):440-456, 2014 (corresponding author, SCI)


74. Zehuan Yuan, Tong Lu. Incremental 3D reconstruction using Bayesian learning. Applied Intelligence, 39(4):761-771, 2013 (SCI)


75. Wenyin Liu, Tong Lu*, Yajie Yu, Liang Shuang, Rui Zhang. Online stroke segmentation by quick penalty-based dynamic programming. IET Computer Vision, 7(5):311-319, 2013 (corresponding author, SCI)

76. Tong Lu, Chiew-Lan Tai, Huafei Yang, Shijie Cai. A novel knowledge-based system for interpreting complex engineering drawings: theory, representation and implementation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(8):1444-1457, 2009 (corresponding author, SCI, impact factor: 9.455)

77. Tong Lu, Huafei Yang, Ruoyu Yang, Shijie Cai. Automatic analysis and integration of architectural drawings. International Journal on Document Analysis and Recognition, 9(1):31-47, 2007 (corresponding author, SCI)

78. Tong Lu, Chiew-Lan Tai, Feng Su, Shijie Cai. A new recognition model for electronic architectural drawings. Computer-Aided Design, 37(10):1053-1069, 2005 (corresponding author, SCI)

79. Tong Lu, Chiew-Lan Tai, Li Bao, Feng Su, Shijie Cai. 3D reconstruction of detailed buildings from architectural drawings. Computer-Aided Design and Applications, 2(1-4):527-536, 2005 (corresponding author)  


Books

80. Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Tapabrata Chakraborti, Michael Blumenstein. A new roadmap for evaluating descriptive handwritten answer type. Advances in Networks, Security and Commnuications: Reviews, vol. 2, Book Series (Book Chapter), IFSA Publishing, Barcelona, Spain, to appear, 2018

81. Palaiahnakote Shivakumara, Umapada Pal, Sangheeta Roy, Tong Lu, Michael Blumenstein. Identification of superimposed and scene text in video frames. Book Chapter, to appear, 2017

82. Tong Lu, Palaiahnakote Shivakumara, Chew Lim Tan, Wenyin Liu. Developments of computer vision and pattern recognition: video text detection. Springer London, 2014 (ISBN 978-1-4471-6514-9, corresponding author)

83. Tong Lu, Wenyin Liu. Handbook of document image processing and recognition. Book Chapter. Springer New York, 2014 (ISBN 0857298607, corresponding author)


Selected International Conference Papers


  2024
  

84. Yirui Wu, Yuhang Xia, Hao Li, Lixin Yuan, Junyang Chen, Jun Liu, Tong Lu, Shaohua Wan. Deconfound semantic shift and incompleteness in incremental few-shot semantic segmentation. The 39th AAAI Conference on Artificial Intelligence (AAAI'25), Philadelphia, Pennsylvania, USA, Feb 25 - March 4, 2025


  2025

85. Zhiqi Li, Zhiding Yu, Shiyi Lan, Jiahan Li, Jan Kautz, Tong Lu, Jose M. Alvarez. Is ego status all you need for open-loop end-to-end autonomous driving? IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

86. Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Zhong Muyan, Qing-long Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai. InternVL: scaling up vision foundation models and aligning for generic visual-linguistic tasks. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

87. Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai. Efficient deformable convNets: rethinking dynamic and sparse operator for vision applications. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

88. Yi Rong, Haoran Zhou, Kang Xia, Cheng Mei, Jiahao Wang, Tong Lu*. RepKPU: point cloud upsampling with kernel point representation and deformation. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'24), Seattle, WA, June 17-21, 2024

89. Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai. VisionLLM v2: an end-to-end generalist multimodal large language model for hundreds of vision-language tasks. Thirty-Eighth Annual Conference on Neural Information Processing Systems (NIPS'24), Vancouver, Canada, Dec 10-15, 2024

90. Yi Rong, Haoran Zhou, Lixin Yuan, Cheng Mei, Jiahao Wang, Tong Lu*. CRA-PCN: point cloud completion with intra- and inter-level cross-resolution transformers. The 38th AAAI Conference on Artificial Intelligence (AAAI'24), Vancouver, Canada, Feb 20-Feb 27, 2024

91. Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu*. AVSegFormer: audio-visual segmentation with transformer. The 38th AAAI Conference on Artificial Intelligence (AAAI'24), Vancouver, Canada, Feb 20-Feb 27, 2024

92. Guangchen Shi, Wei Zhu, Yirui Wu, Huaidan Zhao, Kang Zheng, Tong Lu*. Few-shot semantic segmentation via perceptual attention and spatial control. ACM Multimedia 2024 (ACM MM'24), Melbourne, Australia, Oct 28-Nov 1, 2024

93. Xuanxi Chen, Tong Lu*. SVT: spectral video transformer for video restoration in under-display camera. The 2024 IEEE International Conference on Multimedia and Expo (ICME'24), Niagra Falls, Canada, July 15-19, 2024

94. Wei Zhu, Yicheng Liu, Yuping He, Tangfei Liao, Kang Zheng, Xiaoqiu Xu, Tao Wang, Tong Lu. CorrAdaptor adaptive local context learning for correspondence pruning. The 27th European Conference on Artificial Intelligence (ECAI'24), Santiago de Compostela, Oct 19-24, 2024

  2023
  

95. Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai. VisionLLM: large language model is also an open-ended decoder for vision-centric tasks. The Thirty-seventh Conference on Neural Information Processing Systems (NIPS'23), New Orleans, USA, Dec 12-16, 2023

96. Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao. InterIMAGE: exploring large-scale vision fundamental models with deformable convolutions. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'23), Vancouver, Canada, June 18-June 22, 2023 (Highlight)

97. Zhiqi Li, Zhiding Yu, Wenhai Wang, Animashree Anandkumar, Tong Lu, Jose Alvarez. FB-BEV: BEV representation from forward-backward view transformations. International Conference on Computer Vision (ICCV'23), Paris, France, Oct 2-6, 2023

98. Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu. Mmeory-and-anticipation transformer. International Conference on Computer Vision (ICCV'23), Paris, France, Oct 2-6, 2023

99. Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo. DDP: diffusion model for dense visual prediction. International Conference on Computer Vision (ICCV'23), Paris, France, Oct 2-6, 2023

100. Tao Wang, Kaihao Zhang, Tianrun Shen, Wenhan Luo, Bjorn Stenger, Tong Lu*. Ultra-high-definition low-light image enhancement: a benchmark and transformer-based method. The 37th AAAI Conference on Artificial Intelligence (AAAI'23), Washington, DC, USA, Feb 7-Feb 14, 2023

101. Zhe Chen, Hao Tan, Tao Wang, Tianrun Shen, Tong Lu*, Qiuying Peng, Cheng Cheng, Yue Qi. Graph propagation tranformer for graph representation learning. The 32th International Joint Conference on Artificial Intelligence (IJCAI'23), Macao, S.A.R., Aug 19-25, 2023

102. Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu*, Jifeng Dai, Yu Qiao. Vision transformer adapter for dense predictions. The 11th International Conference on Learning Representations (ICLR'23), Kigali, Rwanda, May 1-5,  2023  (Spotlight paper)

103. Yindong Zheng, Guo Chen, Minglei Yuan, Tong Lu*. MRSN: multi-relation support network for video action detection. The 2023 IEEE International Conference on Multimedia and Expo (ICME'23), July, 10-14, Brisbane, Australia, 2023

104. Guo Chen, Yindong Zheng, Zhe Chen, Jiahao Wang, Tong Lu*. ELAN: enhancing temporal action detection with location awareness. The 2023 IEEE International Conference on Multimedia and Expo (ICME'23), July, 10-14, Brisbane, Australia, 2023


   2022

  

105. Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Animashree Anandkumar, Jose M. Alvarez, Tong Lu*, Ping Luo. Panoptic segformer: delving deeper into panoptic segmentation with transformers. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'22), New Orleans, Louisiana, June 21-June 24, 2022

106. Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai. BEVFormer: learning bird-eye-view representations from multi-view images via spatiotemporal transformer. European Conference on Computer Vision (ECCV'22), 2022


107. Guo  Chen, Yindong Zheng, Limin Wang, Tong Lu*. DCAN: improving temporal action detection via dual context aggregation. The 36th AAAI Conference on Artificial Intelligence (AAAI'22), Lisbon, Portugal, Oct 10-14, 2022

108. Guangchen Shi, Yirui Wu, Jun Liu, Wenhai Wang, Tong Lu. Incremental few-shot semantic segmentation via embedding adaptive-update and hyper-class representation. ACM Multimedia (ACM MM'22), Vancouver, BC, Canada, Feb 22-March 1, 2022

109. Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu*, Ying Tai, Chengjie Wang. SeedFormer: patch seeds based point cloud completion with upsample transformer. European Conference on Computer Vision (ECCV'22), 2022

110. Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu*, Ping Luo. Towards ultra-resolution neural style transfer via thumbnail instance normalization. The 36th AAAI Conference on Artificial Intelligence (AAAI'22), Vancouver, BC, Canada, Feb 22-March 1, 2022



   2021

111. Guangping Tao, Xiaozhong Ji, Wenzhuo Wang, Shuo Chen, Chuming Lin, Yun Cao, Tong Lu*, Donghao Luo, Ying Tai. Spectrum-to-kernel translation for accurate blind image super-resolution. Thirty-fifth Conference on Nerual Information Processing Systems (NIPS'21), Dec 6-14, 2021

112. Wenhai Wang, Enze Xie, Xiang Li, Dengping Fan, Ding Liang, Tong Lu*, Ping Luo, Ling Shao. Pyramid vision transformer: a versatile backbone for dense prediction without convolutions. International Conference on Computer Vision (ICCV'21), Oct 11-17, 2021 (oral, top-10 Most Influential ICCV 2021 Papers)


113. Haoran Zhou, Yidan Feng, Mingsheng Fang, Mingqiang Wei, Jing Qin, Tong Lu*. Adaptive graph convolution for point cloud analysis. International Conference on Computer Vision (ICCV'21), Oct 11-17, 2021


114. Zhaoyang Liu, Limin Wang, Wayne Wu, Chen Qian, Tong Lu. TAM: temporal adaptive module for video recognition.    International Conference on Computer Vision (ICCV'21), Oct 11-17, 2021

115. Xiaozhong Ji, Guangpin Tao, Yun Cao, Ying Tai, Tong Lu*, Chengjie Wang, Jilin Li, Feiyue Huang. Frequency consistent adaptation for real world super resolution. The 35th AAAI Conference on Artificial Intelligence (AAAI'21), Feb 2-9, 2021

116. Minglong Xue, Ruoze Liu, Tong Lu*. A novel attention enhanced residual-in-residual dense network for text image super-resolution. The 2021 IEEE International Conference on Multimedia and Expo (ICME'21), July, 5-9, Shenzhen, China, 2021

117. Guangcheng Shi, Yirui Wu, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu. ARNet: active-reference network for few-shot image semantic segmentation. The 2021 IEEE International Conference on Multimedia and Expo (ICME'21), July, 5-9, Shenzhen, China, 2021

118. Palaiahnakote Shivakumara, Tanmay Jain, Nitish Surana, Umapada Pal, Tong Lu, Michael Blumenstein. Connected component based deep learning model for multi-type-sized struck-out component classification. The 16th  International Conference on Document Analysis and Recognition (ICDAR'21), Sep, 5-10, Lausanne, Switzerland, 2021


   2020

119. Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, Zhibo Yang, Tong Lu*, Chunhua Shen, Ping Luo. AE TextSpotter: learning visual and linguistic representation for ambiguous text spotting. The 16th European Conference on Computer Vision (ECCV'20), Aug 23-28, 2020

120. Zhaoyang Liu, Donghao Luo, Yabiao Wang, Limin Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Tong Lu. Temporal enhancement-and-interaction networks for action recognition. The 34th AAAI Conference on Artificial Intelligence (AAAI'20), Hilton New York Midtown, New York, USA, Feb 7-12, 2020

121. Dongqi Tang, Hao Kong, Xi Meng, Ruoze Liu, Tong Lu*. SEE-LPR: A semantic segmentation based end-to-end system for unconstrained license plate detection and recognition. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

122. Pengfei Chen, Minglei Yuan, Tong Lu*. Multi-scale comparsion network for few-shot learning. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

123. Xiaozhong Ji, Yirui Wu, Tong Lu*. Context-aware residual network with promotion gates for single image super-resolution. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

124. Xiaoge Song, Yirui Wu, Wenhai Wang, Tong Lu*. TK-Text: multi-shaped scene text detection via instance segmentation. The 26th International Conference on Multimedia Modeling (MMM'20), Daejeon, Korea, Jan 5-8, 2020 (corresponding author)

125. Ayush Mittal, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein, Daniel Lopresti. A new context based method for restoring occluded text in natural scene images. The 14th IAPR International Workshop on Document Analysis Systems (DAS'20), July, 2020

126. Lokesh Nandanwar, Palaiahnakote Shivakumara, Suvojit Manna, Umapada Pal, Tong Lu, Michael Blumenstein. A new DCT-FFT fusion based method for caption and scene text classification in action video images. The 2th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI'20), Zhongshan, 2020

127. Lokesh Nandanwar, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Daniel Lopresti, Bhagesh Seraogi, Bidyut. B. Chaudhuri. A new method for detecting altered text in pdf document images. The 2th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI'20), Zhongshan, 2020

128. Yuntao Ma, Yirui Wu, Tong Lu*. Multi-scale relational reasoning with regional attention for visual question answering. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

129. Haifeng Guo, Yirui Wu, Tong Lu*. Dyanmic low-light image enhancement for object detection via end-to-end training. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

130. Chunhao Cai, Minglei Yuan, Tong Lu*. IFSM: an iterative feature selection mechanism for few-shot image classification. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

131. Lokesh Nandanwar, Palaiahnakote Shivakumara, Sayani Kundu, Umapada Pal, Tong Lu, Daniel Lopresti. Chebyshev-Harmonic-Fourier-Moments and deep CNNs for detecting forged handwriting. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020

132. Lokesh Nandanwar, Palaiahnakote Shivakumara, Raghavendra Ramachandra, Tong Lu, Umapada Pal, Daniel Lopresti, Nor Badrul Anuar. Local gradient difference based mass features for classification of 2D-3D natural scene text images. The 25th International Conference on Pattern Recognition (ICPR'20), Milan, Italy, Jan 10-15, 2020


2019

133. Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu*, Gang Yu, Chunhua Shen. Shape robust text detection with progressive scale expansion network. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'19), Long Beach, CA,  June 16-20, 2019 (corresponding author)

134. Wenhai Wang, Enze Xie, Yuhang Zang, Wenjia Wang, Tong Lu*, Gang Yu, Chunhua Shen. Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. International Conference on Computer Vision (ICCV'19), Seoul, Korea,  Oct. 27- Nov. 2, 2019 (corresponding author)

135. Zhenjia Pang, Ruoze Liu, Zhouyu Meng, Yi Zhang, Yang Yu, Tong Lu. On reinforcement learning for full-length game of StarCraft. The 33th AAAI Conference on Artificial Intelligence (AAAI'19), Honolulu, Hawaii,  Jan27-Feb 1, 2019

136. Xi Meng, Hao Kong, Dongqi Tang, Tong Lu*. Multimodal image captioning through combining reinforced cross loss and stochastic deprecation. IEEE International Conference on Multimedia and Expo (ICME'19), Shanghai, China, July 8-12, 2019 (corresponding author)


137. Yisheng Yue, Palaiahnakote Shivakumara, Yirui Wu, Liping Zhu, Tong Lu*, Umapada Pal. An automatic system for generating artificial fake character images. The 25th International Conference on Multimedia Modeling (MMM'19), Thessaloniki, Greece, Jan 8-11, 2019 (corresponding author)

138. Yirui Wu, Weigang Xu, Qinghan Yu, Jun Feng, Tong Lu. Hierarchical Bayesian network based incremental model for flood prediction. The 25th International Conference on Multimedia Modeling (MMM'19), Thessaloniki, Greece, Jan 8-11, 2019 (corresponding author)

139. Wenbo Hou, Wenhai Wang, Ruoze Liu, Tong Lu*. Cropout: a general mechanism for reducing overfitting on convolutional neural networks. 2019 International Joint Conference on Neural Network (IJCNN'19), Budapest, Hungary, July 14-19, 2019 (corresponding author) 

140. Minglei Yuan, Xiaozhong Ji, Tong Lu*, Pengfei Chen, Hualu Zhang. A novel two-factor attention encoder-decoder network through combining temporal and prior knowledge for weather forecasting. 2019 International Joint Conference on Neural Network (IJCNN'19), Budapest, Hungary, July 14-19, 2019 (corresponding author) 

141. Yindong Zheng, Yuntao Ma, Ruoze Liu, Tong Lu*. A novel group-aware pruning method for few-shot learning. 2019 International Joint Conference on Neural Network (IJCNN'19), Budapest, Hungary, July 14-19, 2019 (corresponding author)

142. Hao Kong, Dongqi Tang, Xi Meng, Tong Lu*. GARN: a novel generative adversarial recognition network for end-to-end scene character recognition. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019 (corresponding author)

143. Yao Xiao, Minglong Xue, Tong Lu*, Yirui Wu, Shivakumara Palaiahnakote. A text-content-aware CNN network for multi-oriented and multi-language scene text detection. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019 (corresponding author)

144. V. Basavaraja, Shivakumara Palaiahnakote, D.S. Guru, Umapada Pal, Tong Lu, Michael Blumenstein. Age estimation using disconnectedness features in handwriting. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019 (corresponding author)

145. Sauadip Nag, R. Raghavendra, Shivakumara Palaiahnakote, Umapada Pal, Tong Lu, Mohan Kankanhalli. CRNN based jersey-bib number/text recognition in sports and Marathon images. The 15th International Conference on Document Analysis and Recognition (ICDAR'19), Sydney, Australia, Sep 20-25, 2019

146. Divya Krishnani, Shivakumara Palaiahnakote, Tong Lu, Umapada Pal, Raghavendra Ramachadra. Structure function based transform features for person behaviour-oriented social media image classification. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

147. Anaica Grouver, Shivakumara Palaiahnakote, Maryam Asadzadeh Kaljahi, Bhaarat Chetty, Umapada Pal, Tong Lu, G. Hemantha Kumar. A spatial density and phase angle based correlation for forged family photo identification. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

148. Sayani Kundu, Shivakumara Palaiahnakote, Anaica Grouver, Umapada Pal, Tong Lu, Michael Blumenstein. A new forged handwritting detection method based on Fourier spectural density and variation. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

149. Soumyadip Roy, Shivakumara Palaiahnakote, Umapada Pal, Tong Lu, Michael Blumenstein. New moments based fuzzy similarity measure for text detection in distorted social media images. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

150. Pinaki Nath Chowdhury, Shivakumara Palaiahnakote, Raghavendra Ramachandra, Umapada Pal, Tong Lu, Michael Blumenstein. A new U-net based enhancement model for license plate detection in night and day images. The 5th Asian Conference on Pattern Recognition (ACPR'19), Auckland, New Zealand, Nov 26-29, 2019

  2018

151. Wenhai Wang, Jian Yang, Tong Lu*, Xiang Li. Mixed link networks. The 27th International Joint Conference on Artificial Intelligence (IJCAI'18), Stockholm, Sweden, July 13-19, 2018 (corresponding author)



152. Lianglei Wei, Yirui Wu, Wenhai Wang, Tong Lu*. A novel 3D human action recognition framework for video content analysis. The 24th International Conference on Multimedia Modeling (MMM'18), Bangkok, Thailand, Feb 5-8,  2018 (corresponding author)


153. Wenhai Wang, Yirui Wu, Palaiahnakote Shivakumara, Tong Lu*. Cloud of line distribution and random forest text detection from natural/video images. The 24th International Conference on Multimedia Modeling (MMM'18), Bangkok, Thailand, Feb 5-8, 2018 (corresponding author)

154. Yirui Wu, Weigang Xu, Jun Feng, Palaiahnakote Shivakumara, Tong Lu. Local and global Bayesian network based model for flood prediction. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018

155. Xuerong Wu, Palaiahnakote Shivakumara, Liping Zhu, Tong Lu*, Umapada Pal, Michael Blumenstein. Fourier transform based features for clean and polluted water image classification. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

156. Vijeta Khare, Palaiahnakote Shivakumara, B. J Navya, G. C Swetha, D. S Guru, Umapada Pal, Tong Lu*. Weighted-gradient features for handwritten line segmentation. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

157. Yirui Wu, Zhikai Li, Palaiahnakote Shivakumara, Tong Lu*. Em-SLAM: a fast and robust monocular SLAM method for embedded systems. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)


158. B. J Navya, G. C Swetha, Palaiahnakote Shivakumara, Sangheeta Roy, D. S Guru, Umapada Pal, Tong Lu*. Multi-gradient directional features for gender identification. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018 (corresponding author)

159. Yirui Wu, Zhaoyang Liu, Weigang Xu, Jun Feng, Palaiahnakote Shivakumara, Tong Lu. Context-aware attention LSTM network for flood prediction. The 24th International Conference on Pattern Recognition (ICPR'18), Beijing, China, Aug 20-24, 2018

160. Minglei Yuan, Palaiahnakote Shivakumara, Hao Kong, Tong Lu*, Umapada Pal. Text component reconstruction for tracking in video. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

161. Zhaoyang Liu, Yirui Wu, Yukai Ding, Jun Feng, Tong Lu*. Context and temporal aware attention model for flood prediction. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

162. Chao Zhang, Palaiahnakote Shivakumara, Minglong Xue, Liping Zhu, Tong Lu*, Umapada Pal. New fusion based enhancement for text detection in night video footage. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)

163. Tianping Hu, Wenhai Wang, Tong Lu*. Hand pose estimation with attention-and-sequence network. The 19th Pacific-Rim Conference on Multimedia (PCM'18), Hefei, China, Sep 21-22, 2018 (corresponding author)


164. Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Tapabrata Chakraborti, Michael Blumenstein. A new roadmap for evaluating descriptive handwritten answer type. The 30th International Conference on Pattern Recognition & Artificial Intelligence (ICPRAI'18), Quebec, Canada, 2018

165. Yirui Wu, Yisheng Yue, Xiao Tan, Wei Wang, Tong Lu. End-to-end chromosome karyotyping with data augmentation using GAN. The 25th International Conference on Image Processing (ICIP'18), Athens, Greece, Oct 7-10, 2018

166. Palaiahnakote Shivakumara, V. Basavaraja, Harsha S. Gowda, D. S. Guru, Umapada Pal, Tong Lu*. New GRB based fusion for forged IMEI number detection in mobile images. The 16th International Conference on Frontiers in Handwriting Recognition (ICFHR'18), Niagara Falls, USA, Aug 5-8, 2018

167. B. J. Navya, Palaiahnakote Shivakumara, G. C Shwetha, Sangheeta Roy, D. S. Guru, Umapada Pal, Tong Lu*. Adaptive multi-gradient kernels for gender identification. The 16th International Conference on Frontiers in Handwriting Recognition (ICFHR'18), Niagara Falls, USA, Aug 5-8, 2018

168. Sauradip Nag, Palaiahnakote Shivakumara, Yirui Wu, Umapada Pal, Tong Lu*. New COLD feature based handwriting analysis for enthnicity/nationality identification. The 16th International Conference on Frontiers in Handwriting Recognition (ICFHR'18), Niagara Falls, USA, Aug 5-8, 2018

169. Wencan Zong, Alex Noel Joseph Raj, Palaiahnakote Shivakumara, Zhemin Zhuang, Tong Lu, Umapada Pal. A new shadow detection and depth removal method for 3D text recognition in scene images. The 2th International Conference on Computer Science and Artificial Intellignce (CSAI'18), Shenzhen, China, Dec 8-10, 2018

 2017

170. Zehuan Yuan, Jonathan Stroud, Tong Lu*, Jia Deng. Temporal action localization by structured maximal sums. Computer Vision and Pattern Recognition 2017 (CVPR'17), Honolulu, Hawaii, July 22-25, pp. 3215-3223, 2017 (corresponding author)

171. Zehuan Yuan, Tong Lu*, Yirui Wu. Deep-dense conditional random fields for object co-segmentation. The 26th International Joint Conference on Artificial Intelligence (IJCAI'17), Melbourne, Australia, Aug 19-25, pp. 3371-3377, 2017 (corresponding author)

172. Yiyang Zhou, Wenhai Wang, Wenjie Guan, Yirui Wu, Heng Lai, Tong Lu*, Min Cai. Visual robotic object grasping through combining RGB-D data and 3D mesh. The 23th International Conference on Multimedia Modeling (MMM'17), Reykjavik, Iceland, January 4-6, pp. 404-415, 2017 (corresponding author)



173. Ruoze Liu, Xin Sun, Hailiang Xu, Palaiahnakote Shivakumara, Feng Su, Tong Lu*, Ruoyu Yang. Robust scene text detection for multi-script languages using deep learning. The 23th International Conference on Multimedia Modeling (MMM'17), Reykjavik, Iceland, January 4-6, pp. 329-340, 2017 (corresponding author) 

174. Zhen Wang, Palaiahnakote Shivakumara, Tong Lu*, Mahadevappa Basavanna, Umapada Pal, Michael Blumenstein. Fourier-residual for printer identification. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

175. Yirui Wu, Wenhai Wang, Palaiahnakote Shivakumara, Tong Lu*. A robust symmetry-based method for scene/video text detection through convolutional network. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

176. Sangheeta Roy, Palaiahnakote Shivakumara, Namita Jain, Vijeta Khare, Umapada Pal, Tong Lu. New fuzz-mass features for video type categorization. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

177. Sangheeta Roy, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Wahid Bin Abdul Wahab. Temporal integration for word-wise image type classification. The 14th International Conference on Document Analysis and Recognition (ICDAR'17), Kyoto, Japan, Nov 9-15, 2017 (corresponding author)

178. Wenhai Wang, Yirui Wu, Palaiahnakote Shivakumara, Tong Lu, Jun Liu. Cloud of line distribution for arbitrary text detection in scene/video/license plate images. The 18th Pacific-Rim Conference on Multimedia (PCM'17), Harbin, China, Sep 28-29, 2017

179. Yirui Wu, Zhouyu Meng, Palaiahnakote Shivakumara, Tong Lu. Compressing YOLO network by compressive sensing. The 4th Asian Conference on Pattern Recognition (ACPR'17), Nanjing, China, Nov 26-29, 2017

180. Palaiahnakote Shivakumara, Aishik Konwer, Abir Bhowmick, Vijeta Khare, Umapada Pal, Tong Lu. A new GVF arrow pattern for character segmentation from double line license plate images. The 4th Asian Conference on Pattern Recognition (ACPR'17), Nanjing, China, Nov 26-29, 2017

181. K. S. Raghunandan, Palaiahnakote Shivakumara, G. Hemantha  Kumar, Umapada Pal, Tong Lu. Sharpness and contrast based features for word-wise video type classification. The 4th Asian Conference on Pattern Recognition (ACPR'17), Nanjing, China, Nov 26-29, 2017


 2016

182. Yirui Wu, Xianli Zhou, Tong Lu*, Guo Mei, Linbi Sun. EvaToon: a novel graph matching system for evaluating cartoon drawings. The 23th International Conference on Pattern Recognition (ICPR'16), Cancun, Mexico, December 4-8, pp. 1119-1124, 2016 (corresponding author)


183. Vijeta Khare, Palaiahnakote Shivakumara, Ahald Kumar, Chee Seng Chan, Tong Lu*, Michael Blumenstien. A quad tree based method for blurred and non-blurred video text frames classification through quality metrics. The 23th International Conference on Pattern Recognition (ICPR'16), Cancun, Mexico, December 4-8, pp. 4023-4028, 2016 (corresponding author)

184. Longfei Qin, Palaiahnakote Shivakumara, Tong Lu*, Umapada Pal, Chew Lim Tan. Video scene text frames categorization for text detection and recognition. The 23th International Conference on Pattern Recognition (ICPR'16), Cancun, Mexico, December 4-8, pp. 3875-3880, 2016 (corresponding author)

185. Sangheeta Roy, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Chew Lim Tan. New tampered features for scene and caption text classification in video frame. The 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), Shenzhen, China, October 23-26, pp. 36-41, 2016

186. K.S. Raghunandan, Palaiahnakote Shivakumara, B.J. Navya, G. Pooja, Navya Prakash, G. Hemantha Kumar, Umapada Pal, Tong Lu. Fourier coefficients for fraud handwritten document classification through age analysis. The 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), Shenzhen, China, October 23-26, pp. 25-30, 2016 

187. K.S. Raghunandan, Palaiahnakote Shivakumara, G. Hemantha Kumar, Umapada Pal, Tong Lu. New sharpness features for image type classification based on textual information. The 12th IAPR International Workshop on Document Analysis Systems (DAS'16), Santorini, Greece, April 11-14, pp. 204-209,  2016


 2015

188. Yirui Wu, Tong Lu, Zehuan Yuan, Hao Wang. FreeScup: a novel platform for assisting sculpture pose design. International Conference on Multimedia and Expo (ICME'15), Torino, Italy, June 29-July 3, pp. 1-6, 2015 (corresponding author, oral, acceptance rate 15%) 


189. Yirui Wu, Oscar Kin-Chung Au, Chiew-Lan Tai, Tong Lu. HIRM: a handle-independent reduced model for incremental mesh editing. The 9th International Conference on Geometric Modeling and Processing (GMP'15), Lugano, Switzerland, June 1-3, pp. 56-68, 2015 (corresponding author) Published time: May, 2015

190. Yu Zhang, Tong Lu. A fast color barcode detection method through cross identification on mobile platforms. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 416-420, 2015 (corresponding author, oral)


191. Xiaolong Liu, Tong Lu. Natural scene character recognition using Markov Random Field. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 396-400, 2015 (corresponding author)

192. Qisu Li, Tong Lu, Palaiahnakote Shivakumara, Umapada Pal, Chew Lim Tan. A new method based on bag of filters for character recognition in scene images by learning. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 391-395, 2015 (corresponding author)

193. Guozhu Liang, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. A new Wavelet-Laplacian method for arbitrarily-oriented character segmentation in video text lines. The 13th International Conference on Document Analysis and Recognition (ICDAR'15), Nancy, France, August 23-26, pp. 9262-930, 2015 (corresponding author)

194. Yangbing Weng, Palaiahnakote Shivakumara, Tong Lu, Liang Kim Meng, Hon Hock Woon. A new multi-spectral fusion method for degraded video text frame enhancement. The 16th Pacific Rim Conference on Multimedia (PCM'15), Gwangju, South Korea, September 16-18, pp. 495-506, 2015  (corresponding author) Published time: December 30, 2015

195. Sangheeta Roy, Palaiahnakote Shivakumara, Prabir Mondal, R. Raghavendra, Umapada Pal, Tong Lu. A new multi-modal technique for bib number/text detection in natural images. The 16th Pacific Rim Conference on Multimedia (PCM'15), Gwangju, South Korea, September 16-18, pp. 483-494, 2015. Published time: December 30, 2015

196. Palaiahnakote Shivakumara, Guozhu Liang, Sangheeta Roy, Umapada Pal, Tong Lu. New texture-spatial features for keyword spotting in video images. The 3rd IAPR Asian Conference on Pattern Recognition (ACPR'15), Kuala Lumpur, Malaysia, November 3-6, pp. 391-395, 2015

197. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan, Michael Blumenstein. Text detection in born-digital images by mass estimation. The 3rd IAPR Asian Conference on Pattern Recognition (ACPR'15), Kuala Lumpur, Malaysia, November 3-6, pp. 690-694, 2015 (corresponding author)


2014

198. Zehuan Yuan, Tong Lu, Palaiahnakote Shivakumara. A novel topic-level random walk framework for scene image co-segmentation. European Conference on Computer Vision (ECCV'14), Zurich, Switzerland, September 6-12, pp. 695-709, 2014 (corresponding author) Published time: April 29, 2015

Palaiahnakote Shivakumara

199. Zehuan Yuan, Tong Lu. A novel context-aware topic model for category discovery in natural scenes. The 12th Asian Conference on Computer Vision (ACCV'14), Singapore, Singapore, November 1-5, pp. 158-171, 2014 (corresponding author) Published time: April 29, 2015

200. Palaiahnakote Shivakumara, Mohamed Lubani, KokSheik Wong, Tong Lu. Optical flow based dynamic curved video text detection. The 21th IEEE International Conference on Image Processing (ICIP'14), Paris, France, October 27-30, pp. 1668-1672, 2014

201. Liang Wu, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. Text detection using Delaunay triangulation in video sequence. The 11th IAPR International Workshop on Document Analysis Systems (DAS'14), Loire Valley, France, April 7-10, pp. 41-45, 2014 (corresponding author, anomination of Best Student Paper Award)


202. Hao Wang, Tong Lu, Oscar Kin-Chung Au, Chiew-Lan Tai. Spectral 3D mesh segmentation with a novel single segmentation field descriptor. The 8th International Conference on Geometric Modeling and Processing (GMP'14), Singapore, Jun4 29-July 1, pp. 440-456, 2014 (corresponding author) Published time : September, 2014

203. Tong Lu, Liang Wu, Xiaolin Ma, Palaiahnakote Shivakumara, Chew Lim Tan. Anomaly detection through spatio-temporal context modeling in crowded scenes. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 2203-2208, accepted, 2014 (corresponding author)

204. Tong Lu, Gongyou Wang, Yangbing Weng. Auditory movie summarization by detecting sound events and scene changes. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 756-760, accepted, 2014 (corresponding author)

205. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, Trung Quy Phan, Chew Lim Tan. Graphics and scene text classification in video. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 4714-4719, accepted, 2014 (corresponding author)

206. Jiamin Xu, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. 2D and 3D video scene text classification. The 21th International Conference on Pattern Recognition (ICPR'14), Stockholm, Sweden, August 24-28, pp. 2932-2937, accepted, 2014 (corresponding author)



 2013

207. Weichong Yin, Tong Lu, Feng Su. A novel multi-view object class detection framework for document image content analysis. The 12th International Conference on Document Analysis and Recognition (ICDAR'13), Washington, DC, USA, August 25-28, pp. 1095-1099, 2013 (corresponding author)

208. Trung Quay Phan, Palaiahnakote Shivakumara, Tong Lu, Chew Lim Tan. Recognition of video text through temporal integration. The 12th International Conference on Document Analysis and Recognition (ICDAR'13), Washington, DC, USA, August 25-28, pp. 589-593, 2013

209. Feng Su, Tong Lu. Discriminative weighting and subspace learning for ensemble symbol recognition. The 12th International Conference on Document Analysis and Recognition (ICDAR'13), Washington, DC, USA, August 25-28, pp. 1088-1092, 2013

210. Yirui Wu, Tong Lu, Jiqiang Song. A real-time animation framework using Kinect. The 13th Pacific-Rim Conference on Multimedia (PCM'13), Nanjing, China, December 13-16, pp. 245-256, 2013 (corresponding author) Published time: November 5, 2013


211. Feiming Xu, Tong Lu, Yirui Wu. Robust object tracking using motion context in crowded scenes. The 13th Pacific-Rim Conference on Multimedia (PCM'13), Nanjing, China, December 13-16, pp. 550-560, 2013 (corresponding author) Published time: November 5, 2013


212. Hao Wang, Tong Lu, et al. Recognition and reconstruction from complex line drawings. The 10th IAPR International Workshop on Graphics Recognition (GREC'13), Bethlehem, PA, USA, August 20-21, 2013 (corresponding author)


 2012

213. Wanxia Lin, Tong Lu, Feng Su. A novel multi-view integration and propagation model for cross-media information retrieval. The 18th International Conference on Multimedia Modeling (MMM'12), Klagenfurt, Austria, January 4-6, pp. 740-749, 2012 (corresponding author) Published time: December 21, 2011

214. Xiaolin Ma, Tong Lu, Feiming Xu, Feng Su. Anomaly detection with spatio-temporal context using depth images. The 21th International Conference on Pattern Recognition (ICPR'12), Turkuba, Japan, November 11-15, pp. 2590-2593, 2012 (corresponding author)


215. Feng Su, Yang Li, Tong Lu. Ensemble symbol recognition with Hough forest. The 21th International Conference on Pattern Recognition (ICPR'12), Turkuba, Japan, November 11-15, pp. 1659-1663, 2012

216. Zehuan Yuan, Tong Lu, Haojuan Zhou, Bin Chen, Jianing Li. Incremental 3D reconstruction using Bayesian learning. The 25th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE'12), Dalian, China, June 9-12, pp. 754-763, 2012 (corresponding author, Best Paper Award) Published time: July 7, 2012

217. Yukang Jin, Tong Lu, Feng Su. Movie keyframe retrieval based on cross-media correlation detection and context model. The 25th International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA/AIE'12), Dalian, China, June 9-12, pp. 816-825, 2012 (corresponding author) Published time: July 7, 2012


 2011

218. Limin Wang, Yirui Wu, Tong Lu, Kang Chen. Multiclass object detection by combining local appearances and context. The 19th International Conference on Multimedia (ACM Multimedia'11), Scottsdale, AZ, USA, November 28-December 1, pp. 1161-1164, 2011 (corresponding author)


219. Feng Su, Yang Li, Tong Lu, Gongyou Wang. Environmental sound classification for scene recognition using local discriminant bases and HMM. The 19th International Conference on Multimedia (ACM Multimedia'11), Scottsdale, AZ, USA, November 28-December 1, pp. 1389-1392, 2011

220. Yang Zhao, Tong Lu, Wujun Liao. A robust color-independent text detection method from complex videos. The 11th International Conference on Document Analysis and Recognition (ICDAR'11), Beijing, China, September 18-21, pp. 374-378, 2011 (corresponding author)

221. Feng Su, Tong Lu, Ruoyu Yang. Symbol recognition by multiresolution shape context matching. The 11th International Conference on Document Analysis and Recognition (ICDAR'11), Beijing, China, September 18-21, pp. 1319-1323, 2011

222. Yan Zhao, Zhaokang Wang, Tong Lu, et al. Real-time video caption detection. The 9th IAPR International Workshop on Graphics Recognition (GREC'11), Seoul, Korea, September 15-16, pp. 150-153, 2011 (corresponding author)


 2010

223. Yimin Wang, Tong Lu, Rongjun Gao, Wenyin Liu. 3D model comparison through kernel density matching. The 20th International Conference on Pattern Recognition (ICPR'10), Istanbul, Turkey, August 23-26, pp. 3159-3162, 2010 (corresponding author)

224. Feng Su, Tong Lu, Ruoyu Yang. Symbol recognition combining vectorial and pixel-level features for line drawings. The 20th International Conference on Pattern Recognition (ICPR'10), Istanbul, Turkey, August 23-26, pp. 1892-1895, 2010

225. Tong Lu, Rongjun Gao, Tuantuan Wang, Yubin Yang. 3D similarity search using a weighted structural histogram representation. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 348-356, 2010 (corresponding author) Published time: November 4, 2010

226. Tuantuan Wang, Tong Lu, Wenyin Liu. Robust shape retrieval through a novel statistical descriptor. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 330-337, 2010 (corresponding author) Published time: November 4, 2010

227. Zengyu Zhang, Tong Lu, Feng Su, Ruoyu Yang. A new text detection algorithm for content-oriented line drawing image retrieval. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 338-347, 2010 (corresponding author) Published time: November 4, 2010

228. Limin Wang, Yirui Wu, Ziyuan Tian, Zailiang Sun, Tong Lu. A novel approach for robust surveillance video content abstraction. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 660-671, 2010 (corresponding author) Published time: November 4, 2010

229. Feng Su, Tong Lu, Ruoyu Yang. A new shape descriptor for object recognition and retrieval. The 10th Pacific-Rim Conference on Multimedia (PCM'10), Shanghai, China, September 21-24, pp. 493-502, 2010. Published time: November 4, 2010

230. Ruoyu Yang, Feng Su, Tong Lu. Research of the structural-learning-based symbol recognition mechanism for engineering drawings. The 6th International Conference on Digital Content, Multimedia Technology and its Applications (IDC'10), Seoul Korea, August 16-18, pp. 346-349, 2010


 2009

231. Tong Lu, Yubing Yang, Feng Su, Zengxin Sun. Semi-automatic roof reconstruction. The 10th International Conference on Document Analysis and Recognition (ICDAR'09), Barcelona, Spain, July 26-29, pp. 723-727, 2009 (corresponding author)

232. Feng Su, Tong Lu, Ruoyu Yang, Shijie Cai, Yubing Yang. A character segmentation method for engineering drawings based on holistic and contextual constraints. The 8th IAPR International Workshop on Graphics Recognition (GREC'09), La Rochelle, France, July 22-13, pp. 280-287, 2009

233. Yubing Yang, Wei Wei, Tong Lu, Yang Gao. 3D scene analysis using UIMA framework. The 22th International Conference on Industrial, Engineering and Other Applied Intelligent Systems (IEA/AIE'09), Taiwan, China, June 24-27, pp. 369-378, 2009. Published time: July 14,2009

234. Yubing Yang, Jinjie Lin, Tong Lu. Saliency regions for 3D mesh abstraction. The 9th Pacific-Rim Conference on Multimedia (PCM'09), Bangkok, Thailand, December 15-18, pp. 292-299, 2009. Published time: January 13, 2010

235. Feng Su, Tong Lu, Yubing Yang, Shijie Cai. Text separation from engineering drawings. IAPR International Workshop on Graphics Recognition (GREC'09), La Rochelle, France, July 22-13, pp. 280-287, 2009