Publication:
An Efficient Method for Mining Top-<i>K</i> Closed Sequential Patterns

datacite.subject.fos oecd::Natural sciences::Mathematics
dc.contributor.author Thi-Thiet Pham
dc.contributor.author Tung Do
dc.contributor.author Anh Nguyen
dc.contributor.author Bay Vo
dc.contributor.author Tzung-Pei Hong
dc.date.accessioned 2022-11-03T09:02:10Z
dc.date.available 2022-11-03T09:02:10Z
dc.date.issued 2020
dc.description.abstract The problem of exploiting Closed Sequential Patterns (CSPs) is an essential task in data mining, with many different applications. It is used to resolve the situations of huge databases or low minimum support (minsup) thresholds in mining sequential patterns. However, it is challenging and needs a lot of time to customize the minsup values for generating appropriate numbers of CSPs desired by users. To conquer this issue, the TSP algorithm for mining top-k CSPs was previously proposed, with k being a given parameter. The algorithm would return the k CSPs which have the highest support values in a database. However, its execution time and memory usage were high. In this paper, an algorithm named TKCS (Top-K Closed Sequences) is proposed to mine the top-k CSPs ef ciently. To improve the execution time and memory usage, it uses a vertical bitmap database to represent data. Besides, it adopts some useful strategies in the process of exploiting the top-k CSPs such as: always choosing the sequential patterns with the greatest support values for generating candidate patterns and storing top-k CSPs in an ascending order of the support values to increase the minsup value more quickly. The empirical results show that TKCS has better performance than TSP for discovering the top-k CSPs in terms of both runtime and memory usage.
dc.identifier.doi 10.1109/ACCESS.2020.3004528
dc.identifier.uri http://repository.vlu.edu.vn:443/handle/123456789/848
dc.language.iso en_US
dc.relation.ispartof IEEE Access
dc.relation.issn 2169-3536
dc.subject Closed sequential pattern
dc.subject data mining
dc.subject sequential pattern
dc.subject top-k sequential patterns.
dc.title An Efficient Method for Mining Top-<i>K</i> Closed Sequential Patterns
dc.type journal-article
dspace.entity.type Publication
oaire.citation.volume 8
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
AS390.pdf
Size:
5.3 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: