Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

An Efficient Similarity search in Large Data Collections with MapReduce, in Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

Abstract

The era of big data has been calling for many innovations on improving similarity search computing. Such unstoppable large amounts of data threaten both processing capacity and performance of existing information systems. Joining the challenges on scalability, we propose an efficient similarity search in large data collections with MapReduce. In addition, we make the best use of the proposed scheme for widespread similarity search cases including pairwise similarity, search by example, range query, and k-Nearest Neighbor query. Moreover, collaborative strategic refinements are utilized to effectively eliminate unnecessary computations and efficiently speed up the whole process. Last but not least, our methods are enhanced by experiments, along with a previous work, on real large datasets, which shows how well these methods are verified.
OriginalspracheEnglisch
TitelFuture Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov.
ErscheinungsortBerlin, Heidelberg
VerlagSpringer
Seiten44-57
Seitenumfang14
Band8860
PublikationsstatusVeröffentlicht - Nov. 2014

Publikationsreihe

NameLecture Notes in Computer Science (LNCS)

UN SDGs

Dieser Output leistet einen Beitrag zu folgendem(n) Ziel(en) für nachhaltige Entwicklung

  1. SDG 9 – Industrie, Innovation und Infrastruktur
    SDG 9 – Industrie, Innovation und Infrastruktur
  2. SDG 16 – Frieden, Gerechtigkeit und starke Institutionen
    SDG 16 – Frieden, Gerechtigkeit und starke Institutionen

Wissenschaftszweige

  • 202007 Computer Integrated Manufacturing (CIM)
  • 102 Informatik
  • 102001 Artificial Intelligence
  • 102006 Computer Supported Cooperative Work (CSCW)
  • 102010 Datenbanksysteme
  • 102014 Informationsdesign
  • 102015 Informationssysteme
  • 102016 IT-Sicherheit
  • 102022 Softwareentwicklung
  • 102025 Verteilte Systeme
  • 502007 E-Commerce
  • 505002 Datenschutz
  • 506002 E-Government
  • 509018 Wissensmanagement

JKU-Schwerpunkte

  • Computation in Informatics and Mathematics

Dieses zitieren