Proceedings paper

Perils of Combining Parallel Distance Computations with Metric and Ptolemaic Indexing in kNN Queries
M. Kruliš, S. Kirchhoff, J. Yaghob
Similarity Search and Applications

Similarity search methods face serious performance issues since similarity functions are rather expensive to compute. Many optimization techniques were designed to reduce the number of similarity computations, when a query is being resolved. Indexing methods, like pivot table prefiltering, based on the metric properties of feature space, are one of the most popular methods. They can increase the speed of query evaluation even by orders of magnitude. Another approach is to employ highly parallel architectures like GPUs to accelerate evaluation by unleashing their raw computational power. Unfortunately, resolving the k nearest neighbors (kNN) queries optimized with metric indexing is a problem that is serial in nature. In this paper, we explore the perils of kNN parallelization and we propose a new algorithm that basically converts kNN queries into range queries, which are perfectly parallelizable. We have experimentally evaluated all approaches using a highly parallel environment comprised of multiple GPUs. The new algorithm demonstrates more than 2× speedup to the naïve parallel implementation of kNN queries.

    title = {{Perils of Combining Parallel Distance Computations with Metric and Ptolemaic Indexing in kNN Queries}},
    author = {Kruliš, Martin and Kirchhoff, Steffen and Yaghob, Jakub},
    year = {2014},
    booktitle = {{Similarity Search and Applications}},
    editor = {Traina, Agma Juci Machado and Traina, Caetano and Cordeiro, Robson Leonardo Ferreira},
    publisher = {Springer International Publishing},
    series = {{Lecture Notes in Computer Science}},
    location = {Cham},
    doi = {10.1007/978-3-319-11988-5_12},
    isbn = {978-3-319-11988-5},
    pages = {127--138},