2. The relevant research
exploitation Problem useful set high because Yao and Hamilton proposed in 2004 [4]. The authors propose UMining algorithm based on block (upper bound) of the HUI useful to exploit. Then UMining-H, a form of UMining heuristic calculation by the change in the useful block to prune candidate. Both UMining and UMining-H are likely trimmed HUI wrong files. In 2005, Liu and colleagues proposed a new block called on the TWU (Transaction Weighted Utilization) for HUI mining [6]. TWU agreed nature of the itemset closures should decrease can rely on for pruning candidates. Therefore, the authors proposed algorithm based on Two-Phase TWU to prune candidate. Two-Phase is divided into two phases include (1) Exploit all itemset with greater than or equal minutil TWU (the minimum threshold specified by the user included) (2) From the set of itemset with TWU agreed minutil , Two-Phase scan the database (DB) to calculate the usefulness of each itemset and filter out useful itemset have agreed minutil degrees. Two-Phase due to spend a lot of time scanning the database and cause many candidates in Phase 1 should not efficient on large databases.
đang được dịch, vui lòng đợi..