A General and Efficient Approach for Solving Nearest Neighbor Problem in the Vague Query System

Josef Küng, Khanh Tran Dang, Roland Wagner

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

This article presents a general and efficient approach for finding the best match for complex vague queries in the Vague Query System (VQS) [16]. The VQS is an extension to conventional database systems and can operate on top of them in order to facilitate vague retrieval capabilities. The VQS’s key is Numeric-Coordinate-Representation-Tables (NCR-Tables), which store semantic background information of attributes. Concretely, attributes of arbitrary types in a query relation/view are mapped to the Euclidean space and kept by NCR-Tables. Answering a complex vague query requires parallel searching on some NCR-Tables, which usually contain multidimensional data. In [17] Kueng et al proposed an incremental hyper-cube approach for solving complex vague queries, however, this approach has weaknesses lead to degenerate the search performance of the VQS. Theoretical analyses and experimental results in this article will prove that our new approach defeats all these defects and makes the VQS a full-fledged flexible query answering system.
Original languageEnglish
Title of host publicationAdvances in Web-age information management. Third international conference, WAIM 2002, Beijing, China, August 11 - 13, 2002 ; proceedings
Editors Xiaofeng Meng
PublisherSpringer Verlag
Pages367-378
Number of pages12
Volume2419
ISBN (Print)3-540-44045-3
Publication statusPublished - Aug 2002

Publication series

NameLecture Notes in Computer Science (LNCS)
ISSN (Print)0302-9743

Fields of science

  • 102001 Artificial intelligence
  • 102006 Computer supported cooperative work (CSCW)
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102016 IT security
  • 102028 Knowledge engineering
  • 102019 Machine learning
  • 102022 Software development
  • 102025 Distributed systems
  • 502007 E-commerce
  • 505002 Data protection
  • 506002 E-government
  • 509018 Knowledge management
  • 202007 Computer integrated manufacturing (CIM)
  • 102033 Data mining
  • 102035 Data science

Cite this