preview

Hash-Based Analysis : Direct Hashing And Pruning Algorithm

Satisfactory Essays

Direct Hashing and Pruning Algorithm proposed by J.S. Park et al [27] in 1995 uses hash based techniques to effectively generate candidate sets. DHP uses hash- based techniques to generate candidate 2-itemsets which are smaller in number as compared to other previous methods and uses pruning techniques to curtail the database transaction count. Both of these features enhance its performance in comparison to Apriori algorithm.

7 Mining quantitative association rules in large relational tables1996R. Srikant and R. Agrawal [28] suggested association rule mining for qualitative and categorical data and coined the term “Quantitative Association Rules” for these discovered rules. They proposed an algorithm for mining quantitative association …show more content…

Due to the use of weights in calculation of support measure, downward closure property no longer holds, therefore, previous algorithms cannot be used. Authors also proposed a new measure called k-support bound to be used in mining process.

10 Pincer search1998Typical algorithms mine frequent itemsets using bottom–up and breadth first approach which show a reasonable performance as long as the size of maximal frequent itemsets is not large. In 1998 D. I. Lin and Z. M. Kedem[31] proposed Pincer search algorithm which works on both bottom-up and top-down approach. The bottom up search uses Apriori like technique while in top-down search a new data structure Maximum Frequent Candidate Set (MFCS) is used. The algorithm also defines another set called the Maximum Frequent Set (MFS) which is collection of all the itemsets which are frequent and no proper superset of them is frequent (maximal frequent itemsets). If any itemset in a pass in bottom-up search is found to be infrequent then it will be removed from the MFCS. Similarly, frequent itemset found in top-down approach, is used to prune the candidate set in bottom-up approach. The algorithm discovers the maximal frequent item- sets early in the process by using the MFCS, which in turn reduces the number of candidates and database scans.This improves efficiency when the large maximal frequent itemsets are present in the early passes.

11 CHARM 1999M.J. Zaki and C.J. Hsiao [32] developed an algorithm for

Get Access