Beneficial Sequential Combination of Data Mining Algorithms

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Depending on the goal of an instance of the Knowledge Discovery in Databases (KDD) process, there are instances that require more than a single data mining algorithm to determine a solution. Sequences of data mining algorithms offer room for improvement that are yet unexploited. If it is known that an algorithm is the first of a sequence of algorithms and there will be future runs of other algorithms, the first algorithm can determine intermediate results that the succeeding algorithms need. The anteceding algorithm can also determine helpful statistics for succeeding algorithms. As the anteceding algorithm has to scan the data anyway, computing intermediate results happens as a by-product of computing the anteceding algorithm's result. On the one hand, a succeeding algorithm can save time because several steps of that algorithm have already been pre-computed. On the other hand, additional information about the analysed data can improve the quality of results such as the accuracy of classification, as demonstrated in experiments with synthetical and real data.
Original languageEnglish
Title of host publicationProceedings of the 8th International Conference on Enterprise Information Systems (ICEIS 2006), 23-27, May 2006 Paphos, Cyprus
Editors Yannis Manolopoulos, Joaquim Filipe, Panos, Constantopoulos, José Eordeiro
Pages135-143
Number of pages9
Publication statusPublished - May 2006

Publication series

NameICEIS 2006 - 8th International Conference on Enterprise Information Systems, Proceedings
VolumeAIDSS

Fields of science

  • 102 Computer Sciences
  • 102015 Information systems

Cite this