Efficient Processing of k-Nearest Neighbor Queries in Spatial Databases with the SH-tree

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Nearest neighbor problem has special considerations among database researchers. In many cases of querying, users want to see returned results that contain database objects similar to a given query object, and especially these returned results are ranked according to their similarity to the query object. In this paper, we introduce two adapted algorithms to the SH-tree [KKW2001] from the state-of-the-art of corresponding research results for efficiently processing the nearest neighbor problem in spatial databases. We do intensive performance tests on synthetic data sets, which dimension number varies from 2 to 64, as well as on real data set. Our experimental results show that the SH-tree with these adapted algorithms totally outperforms the search performance of the SR-tree in both IO-cost and CPU-time when processing the k-nearest neighbor queries. This result also confirms our theory analyses in [KKW2001]: the SH-tree can efficiently scale to high dimensional spatial databases.
Original languageEnglish
Title of host publicationProceedings of the Third International Conference on Information and Integration and Web-based Applications and Services
Pages425-435
Number of pages11
Publication statusPublished - Sept 2001

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure
  2. SDG 16 - Peace, Justice and Strong Institutions
    SDG 16 Peace, Justice and Strong Institutions

Fields of science

  • 102001 Artificial intelligence
  • 102006 Computer supported cooperative work (CSCW)
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102016 IT security
  • 102028 Knowledge engineering
  • 102019 Machine learning
  • 102022 Software development
  • 102025 Distributed systems
  • 502007 E-commerce
  • 505002 Data protection
  • 506002 E-government
  • 509018 Knowledge management
  • 202007 Computer integrated manufacturing (CIM)
  • 102033 Data mining
  • 102035 Data science

Cite this