Loading…

Indexing the bit-code and distance for fast KNN search in high-dimensional spaces

TP311; Various index structures have recently been proposed to facilitate high-dimensional KNN queries, among which the techniques of approximate vector presentation and one-dimensional (iD) transformation can break the curse of dimensionality.Based on the two techniques above, a novel high-dimensio...

Full description

Saved in:
Bibliographic Details
Published in:Journal of Zhejiang University. A. Science 2007-05, Vol.8 (6), p.857-863
Main Authors: Liang, Jun-jie, Feng, Yu-cai
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:TP311; Various index structures have recently been proposed to facilitate high-dimensional KNN queries, among which the techniques of approximate vector presentation and one-dimensional (iD) transformation can break the curse of dimensionality.Based on the two techniques above, a novel high-dimensional index is proposed, called Bit-code and Distance based index (BD).BD is based on a special partitioning strategy which is optimized for high-dimensional data. By the definitions of bit code and transformation function, a high-dimensional vector can be first approximately represented and then transformed into a 1D vector,the key managed by a B+-tree. A new KNN search algorithm is also proposed that exploits the bit code and distance to prune the search space more effectively. Results of extensive experiments using both synthetic and real data demonstrated that BD outperforms the existing index structures for KNN search in high-dimensional spaces.
ISSN:1673-565X
1862-1775
DOI:10.1631/jzus.2007.A0857