Learning to Hash
Paper, Code and Dataset

Table of Content

  • Introduction
  • Tutorial Slides
  • Data-Independent Method
  • Learning to Hash Method (Data-Dependent Method)
  • Unsupervised Hashing
  • Supervised Hashing
  • Ranking-Based Hashing
  • Multi-Modal Hashing
  • Deep Hashing
  • Online Hashing
  • Quantization for Hashing
  • Distributed Hashing
  • Dataset
  •  
    Introduction

    This homepage lists some representative papers about hashing, especially Learning to Hash, for big data applications. If you have any question, feel free to contact Dr. Wu-Jun LI.

     
    Tutorial Slides

    The tutorial slides about Learning to Hash, provided by Dr. Wu-Jun LI, can be downloaded from http://cs.nju.edu.cn/lwj/slides/L2H.pdf

     
    Data Independent Method

  • Similarity Search in High Dimensions via Hashing [paper]
        Aristides Gionis, Piotr Indyk and Rajeev Motwani. [VLDB], 1999.
  • Locality-Sensitive Hashing Scheme Based on p-Stable Distributions [paper]
        Mayur Datar, Nicole Immorlica, Piotr Indyk, Vahab S. Mirrokni. [SCG], 2004.
  • Near-Optimal Hashing Algorithms for Approximate Nnearest Neighbor in High Dimensions. [paper]
        Alexandr Andoni and Piotr Indyk. [Communications of the ACM ], 2008.
  • Kernelized Locality-Sensitive Hashing for Scalable Image Search [paper]
        Brian Kulis and Kristen Grauman. [ICCV], 2009.
  • Fast Similarity Search for Learned Metrics [paper]
        Prateek Jain, Brian Kulis and Kristen Grauman. [TPAMI], 2009.
  • Locality-Sensitive Binary Codes from Shift-Invariant Kernels [paper]
        Maxim Raginsky and Svetlana Lazebnik. [NIPS], 2009.
  • Hashing Algorithms for Large Scale Learning [paper]
        Ping Li, Anshumali Shrivastava, Joshua Moore and Christian Konig. [NIPS], 2011.
  • Densifying One Permutation Hashing via Rotation for Fast Near Neighbor Search. [paper]
        Anshumali Shrivastava and Ping Li. [ICML], 2014
  • Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product Search (MIPS). [paper]
        Anshumali Shrivastava and Ping Li. [NIPS], 2014.
  • 0-Bit Consistent Weighted Sampling. [paper]
        Ping Li. [KDD], 2015
  • Practical and Optimal LSH for Angular Distance. [paper]
        Alexandr Andoni, Piotr Indyk and Thijs Laarhoven. [NIPS], 2015


  •  
    Learning to Hash Method (Data Dependent Method)
  • Unsupervised Hashing
  • Spectral Hashing [paper][code]
        Yair Weiss, Antonio Torralba and Rob Fergus. [NIPS], 2008
  • Self-Taught Hashing for Fast Similarity Search [paper][code]
        Dell Zhang, Jun Wang, Deng Cai and Jinsong Lu. [SIGIR], 2010
  • Hashing with Graphs [paper][code]
        Wei Liu, Jun Wang, Sanjiv Kumar and Shih-Fu Chang. [ICML], 2011
  • Iterative Quantization: A Procerustean Approach to Learning Binary Codes [paper][code]
        Yunchao Gong and Svetlana Lazebnik. [CVPR], 2011
  • Isotropic Hashing [paper][code]
        Weihao Kong and Wu-Jun Li. [NIPS], 2012
  • Spherical Hashing [paper][C++ code][matlab code][slide]
        Jae-Pil Heo, Youngwoon Lee, Junfeng He, Shih-Fu Chang and Sung-Eui Yoon. [CVPR], 2012
  • Harmonious Hashing [paper]
        Bin Xu, Jiajun Bu, Yue Lin Chun Chen, Xiaofei He and Deng Cai. [IJCAI], 2013
  • Complementary Projection Hashing [paper]
        Zhongming Jin, Yao Hu, Yue Lin, Debing Zhang, Shiding Lin, Deng Cai and Xuelong Li. [ICCV], 2013
  • Inductive Hashing on Manifolds [paper][supplementary][code][poster]
        Fumin Shen, Chunhua Shen, Qinfeng Shi, Anton van den Hengel and Zhenmin Tang. [CVPR], 2013
  • Learning Binary Codes for High-Dimensional Data Using Bilinear Projections [paper][code]
        Yunchao Gong, Sanjiv Kumar, Henry A. Rowley and Svetlana Lazebnik. [CVPR], 2013
  • Discrete Graph Hashing [paper]
        Wei Liu, Cun Mu, Sanjiv Kumar and Shih-Fu Chang. [NIPS], 2014
  • Circulant Binary Embedding [paper][code][slide]
        Felix Xu, Sanjiv Kumar, Yunchao Gong and Shih-Fu Chang. [ICML], 2014
  • Locality Linear Hashing for Extracting Non-Linear Manifolds. [paper][supplementary]
        Go Irie, Zhenguo Li and Xiao-Ming Wu and Shih-Fu Chang. [CVPR], 2014
  • Large-Scale Unsupervised Hashing with Shared Structure Learning [paper]
        Xianglong Liu, Yadong Mu, Danchen Zhang, Bo Lang, Xuelong Li. [IEEE Transactions on Cybernetics], 2015
  • Learning Binary Codes for Maximum Inner Product Search. [paper]
        Fumin Shen, Wei Liu, Shaoting Zhang, Yang Yang, Heng Tao Shen. [ICCV], 2015
  • An NMF perspective on Binary Hashing. [paper]
    Lopamudra Mukherjee, Sathya N. Ravi, Vamsi K. Ithapu, Tyler Holmes and Vikas Singh. [ICCV], 2015
  • Scalable Graph Hashing with Feature Transformation. [paper][code]
        Qing-Yuan Jiang and Wu-Jun Li. [IJCAI], 2015
  •  
  • Supervised Hashing
  • Semi-Supervised Hashing for Scalable Image Retrieval [paper]
        Jun Wang, Sanjiv Kumar, and Shih-Fu Chang. [CVPR], 2010
  • Minimal Loss Hashing for Compact Binary Codes [paper] [code]
        Mohammad Norouzi and David M. Blei. [ICML], 2011
  • Supervised Hashing with Kernels [paper][code]
        Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang.[CVPR], 2012
  • LDAHash: Improved Matching with Smaller Descriptors [paper] [code]
        Christoph Strecha, Alexander M. Bronstein, Michael M. Bronstein, and Pascal Fua. [TPAMI], 2012
  • A General Two-Step Approach to Learning-based Hashing [paper] [code]
        Guosheng Lin, Chunhua Shen, David Suter, and Anton van den Hengel. [ICCV], 2013
  • Supervised binary hash code learning with jensen shannon divergence [paper]
        Lixin Fan. [ICCV], 2013
  • The power of asymmetry in binary hashing [paper] [code]
        Behnam Neyshabur, Nati Srebro, Ruslan R. Salakhutdinov, Yury Makarychev, Payman Yadollahpour. [NIPS], 2013
  • Graph cuts for supervised binary coding [paper]
        Tiezheng Ge, Kaiming He, and Jian Sun. [ECCV], 2014
  • Fast supervised hashing with decision trees for high-dimensional data [paper] [code]
        Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton van den Hengel, and David Suter. [CVPR], 2014
  • Supervised hashing with latent factor models [paper] [code]
        Peichao Zhang, Wei Zhang, Wu-Jun Li, and Minyi Guo. [SIGIR], 2014
  • Supervised Discrete Hashing [paper] [code]
        Fumin Shen, Chunhua Shen, Wei Liu, and Heng Tao Shen. [CVPR], 2015
  • kNN Hashing with Factorized Neighborhood Representation. [paper]
        Kun Ding, Chunlei Huo, Bin Fan, Chunhong Pan. [ICCV], 2015
  • Two Birds, One Stone: Jointly Learning Binary Code for Large-scale Face Image Retrieval and Attributes Prediction [paper]
        Yan Li, Ruiping Wang, Haomiao Liu, Huajie Jiang, Shiguang Shan and Xilin Chen. [ICCV], 2015
  • Large-scale multi-task image labeling with adaptive relevance discovery and feature hashing. [paper]     Cheng Deng, Xianglong Liu, Yadong Mu, Jie Li. [Signal Processing], 2015
  • Column sampling based discrete supervised hashing. [paper] [code]
        Wang-Cheng Kang, Wu-Jun Li and Zhi-Hua Zhou. [AAAI], 2016
  •  
  • Ranking-Based Hashing
  • Hamming Distance Metric Learning [paper] [code]
        Mohammad Norouzi, David J. Fleet, Ruslan Salakhutdinov. [NIPS], 2012
  • Learning Hash Functions Using Column Generation [paper][code]
        Xi Li, Guosheng Lin,Chunhua Shen,Anton van den Hengel,Anthony Dick. [ICML], 2013
  • Order preserving Hashing for Approximate Nearest Neighbor Search [paper]
        Jianfeng Wang, Jingdong Wang, and Nenghai Yu,Shipeng Li. [ACM MM], 2013
  • Learning Hash Codes with Listwise Supervision [paper]
        Jun Wang, Wei Liu,Andy X. Sun,Yu-Gang Jiang. [ICCV], 2013
  • Optimizing Ranking Measures for Compact Binary Code Learning [paper]
        Guosheng Lin, Chunhua Shen, Jianxin Wu. [ECCV], 2014
  • Query-Adaptive Hash Code Ranking for Fast Nearest Neighbor Search [paper][code]
        Tianxu Ji, Xianglong Liu, Cheng Deng, Lei Huang, Bo Lang. [ACM MM], 2014
  • Ranking Preserving Hashing for Fast Similarity Search [paper]
        Qifan Wang, Zhiwei Zhang,Luo Si. [IJCAI], 2015
  • Top Rank Supervised Binary Coding for Visual Search [paper]
        Dongjin Song, Wei Liu, Rongrong Ji, David A. Meyer, John R. Smith. [ICCV], 2015
  •  
  • Multi-Modal Hashing
  • Composite Hashing with Multiple Information Sources [paper]
        Dan Zhang, Fei Wang, Luo Si. [ACM SIGIR], 2011
  • Learning hash functions for cross-view similarity search [paper]
        Shaishav Kumar and Raghavendra Udupa. [IJCAI], 2011
  • Co-Regularized Hashing for Multimodal Data [paper]
        Yi Zhen and Dit-Yan Yeung. [NIPS], 2012
  • Inter-Media Hashing for Large-Scale Retrieval from Heterogeneous Data Sources [paper]
        Jingkuan Song, Yang Yang, Yi Yang, Zi Huang and Heng Tao Shen. [SIGMOD], 2013
  • Comparing apples to oranges: a scalable solution with heterogeneous hashing [paper]
        Mingdong Ou, Peng Cui, Fei Wang, Jun Wang, Wenwu Zhu, Shiqiang Yang. [KDD], 2013
  • Large-scale supervised multimodal hashing with semantic correlation maximization [paper][code]
        Dongqing Zhang and Wu-Jun Li. [AAAI], 2014
  • Collective Matrix Factorization Hashing for Multimodal data [paper]
        Guiguang Ding, Yuchen Guo and Jile Zhou.[CVPR], 2014
  • Collaborative Hashing [paper][code]
        Xianglong Liu, Junfeng He, Cheng Deng, Bo Lang. [CVPR], 2014
  • Quantized correlation hashing for fast cross-modal search [paper]
        Botong Wu, Qiang Yang, Wei-Shi Zheng, Yizhou Wang and Jingdong Wang.[IJCAI], 2015
  • Semantic Topic Multimodal Hashing for Cross-Media Retrieval [paper]
        Di Wang, Xinbo Gao, Xiumei Wang and Lihuo He. [IJCAI], 2015
  • Semantics-Preserving Hashing for Cross-View Retrieval [paper]
        Zijia Lin, Guiguang Ding, Mingqing Hu and Jianmin Wang. [CVPR], 2015
  • Multi-View Complementary Hash Tables for Nearest Neighbor Search [paper]
        Xianglong Liu, Lei Huang, Cheng Deng, Jiwen Lu and Bo Land. [ICCV], 2015
  • Alternating Co-Quantization for Cross-modal Hashing [paper]
        Go Irie, Hiroyuki Arai and Yukinobu Taniguchi. [ICCV], 2015
  • Cross-Modal Similarity Learning via Pairs, Preferences, and Active Supervision. [paper]
        Yi Zhen, Piyush Rai, Hongyuan Zha, and Lawrence Carin. [AAAI], 2015
  •  
  • Deep Hashing
  • Supervised Hashing via Image Representation Learning [paper][code][slide]
        Rongkai Xia , Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. [AAAI], 2014
  • Simultaneous Feature Learning and Hash Coding with Deep Neural Networks [paper]
        Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. [CVPR], 2015
  • Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification [paper][code]
        Ruimao Zhang, Liang Lin, Rui Zhang, Wangmeng Zuo, and Lei Zhang. [TIP], 2015
  • Convolutional Neural Networks for Text Hashing [paper]
        Jiaming Xu, PengWang, Guanhua Tian, Bo Xu, Jun Zhao, Fangyuan Wang, Hongwei Hao. [IJCAI], 2015
  • Deep Semantic Ranking Based Hashing for Multi-Label Image Retrieval [paper][code]
        Fang Zhao, Yongzhen Huang, Liang Wang, and Tieniu Tan. [CVPR], 2015
  • Deep Hashing for Compact Binary Codes Learning [paper]
        Venice Erin Liong, Jiwen Lu, Gang Wang, Pierre Moulin, and Jie Zhou. [CVPR], 2015
  • Deep Learning of Binary Hash Codes for Fast Image Retrieval [paper][code][questions]
        Kevin Lin, Huei-Fang Yang, Jen-Hao Hsiao, and Chu-Song Chen. [CVPRW], 2015
  • Feature Learning based Deep Supervised Hashing with Pairwise Labels [paper][code]
        Wu-Jun Li, Sheng Wang and Wang-Cheng Kang. [IJCAI], 2016
  •  
  • Online Hashing
  • Online Hashing [paper]
        Long-kai Huang, Qiang Yang and Wei-Shi Zheng. [IJCAI], 2013
  • Online Sketching Hashing [paper]
        Cong Leng, Jiaxiang Wu, Jian Cheng, Xiao Bai and Hangqing Lu. [CVPR], 2015
  • Adaptive Hashing for Fast Similarity Search [paper]
         Fatih Cakir, Stan Sclaroff. [ICCV], 2015
  • Online Supervised Hashing [paper]
         Fatih Cakir, Stan Sclaroff. [ICIP], 2015
  •  
  • Quantization
  • Double-Bit Quantization for Hashing [paper] [code]
        Weihao Kong, Wu-Jun Li. [AAAI], 2012
  • Manhattan hashing for large-scale image retrieval [paper] [code]
        Weihao Kong, Wu-Jun Li,Minyi Guo. [SIGIR], 2012
  • Variable Bit Quantisation for LSH [paper]
        Sean Moran, Victor Lavrenko,Miles Osborne. [ACL], 2013
  • Hash Bit Selection: a Unified Solution for Selection Problems in Hashing. [paper][supplementary][slide (in Chinese)][code][data]
        Xianglong Liu, Junfeng He, Bo Lang and Shih-Fu Chang . [CVPR], 2013
  • Hamming Compatible Quantization for Hashing [paper]
        Zhe Wang, Ling-Yu Duan, Jie Lin, Xiaofang Wang, Tiejun Huang and Wen Gao [IJCAI], 2015
  •  
  • Distributed Hashing
  • Hashing for Distributed Data [paper]
        Cong Leng, Jiaxiang Wu, Jian Cheng, Xi Zhang and Hanqing Lu. [ICML], 2015
  •  
    Dataset

  • CIFAR [ link ]
  • IAPRTC-12  [ link ]
  • ImageNet [ link ]
  • LabelMe [ link ]
  • MIR-FLICKR [ link ]
  • MNIST  [ link ]
  • NUS-WIDE  [ link ]
  • SUN-397 [ link ]
  • Tiny-Image [ link ]
  • Wiki [ link ]
  • To the top