TY - GEN
T1 - An Efficient Similarity search in Large Data Collections with MapReduce, in Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov
AU - Phan, Trong Nhan
AU - Küng, Josef
AU - Khanh Dang, Tran
PY - 2014/11
Y1 - 2014/11
N2 - The era of big data has been calling for many innovations on improving similarity search computing. Such unstoppable large amounts of data threaten both processing capacity and performance of existing information systems. Joining the challenges on scalability, we propose an efficient similarity search in large data collections with MapReduce. In addition, we make the best use of the proposed scheme for widespread similarity search cases including pairwise similarity, search by example, range query, and k-Nearest Neighbor query. Moreover, collaborative strategic refinements are utilized to effectively eliminate unnecessary computations and efficiently speed up the whole process. Last but not least, our methods are enhanced by experiments, along with a previous work, on real large datasets, which shows how well these methods are verified.
AB - The era of big data has been calling for many innovations on improving similarity search computing. Such unstoppable large amounts of data threaten both processing capacity and performance of existing information systems. Joining the challenges on scalability, we propose an efficient similarity search in large data collections with MapReduce. In addition, we make the best use of the proposed scheme for widespread similarity search cases including pairwise similarity, search by example, range query, and k-Nearest Neighbor query. Moreover, collaborative strategic refinements are utilized to effectively eliminate unnecessary computations and efficiently speed up the whole process. Last but not least, our methods are enhanced by experiments, along with a previous work, on real large datasets, which shows how well these methods are verified.
M3 - Conference proceedings
VL - 8860
T3 - Lecture Notes in Computer Science (LNCS)
SP - 44
EP - 57
BT - Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov.
PB - Springer
CY - Berlin, Heidelberg
ER -