Location-Based Parallel Sequential Pattern Mining Algorithmopen access
- Authors
- Kim, Byoungwook; Yi, Gangman
- Issue Date
- 2019
- Publisher
- IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
- Keywords
- Big data; MapReduce; PrefixSpan; sequential pattern mining
- Citation
- IEEE ACCESS, v.7, pp 128651 - 128658
- Pages
- 8
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE ACCESS
- Volume
- 7
- Start Page
- 128651
- End Page
- 128658
- URI
- https://scholarworks.dongguk.edu/handle/sw.dongguk/18717
- DOI
- 10.1109/ACCESS.2019.2939937
- ISSN
- 2169-3536
- Abstract
- Given a data sequence, sequential pattern mining, which finds frequent sequence patterns among them, is an important data mining problem. However, in the existing sequential pattern mining, only the purchase order of the items is considered, and the position where the item is purchased is not considered. In this paper, we developed a sequential pattern mining algorithm using Apache spark. The proposed algorithm finds frequent sequential patterns in parallel by distributing data to several machines. Experimentally, we performed a comprehensive performance study on the proposed algorithm by varying various parameter values using various synthetic data. Experimental results show that the proposed algorithm shows a linear speed improvement over the number of machines.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Advanced Convergence Engineering > Department of Computer Science and Artificial Intelligence > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.