Skip to content

Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach

Notifications You must be signed in to change notification settings

raulmoyareyes/prefixspan

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PrefixSpan

PrefiSpan --- An Implementation of Prefix-projected Sequential Pattern mining on Java

Authors: Raúl Moya Reyes <raulmoya.es> Agustín Ruiz Linares <agustruiz.es> University of Jaén

      (Based on Yasuo Tabei <[email protected]> code in C++, University of Tokyo)

==========================================================================================

FORMAT OF params.txt FILE

That file must contain 3 rows: PATH_TO_DATA_FILE MINIMUM_SUPPORT MAXIMUM:PATTERN

==========================================================================================

FORMAT OF INPUT DATA:

3 1 3 4 5 2 3 1 3 4 4 3 1 3 4 5 2 4 1 6 5 3

Each line corresponds to the each transaction which has a sequence of items separated by single space.

==========================================================================================

FORMAT OF RESULTS:

itemsets ( ids ) freq itemsets ( ids ) freq itemsets ( ids ) freq ...

This result means:

FREQUENT SEQUENCE : TRANSACTION ID : FREQUENCY 1 0 1 3 4 4 1 3 0 3 2 1 3 4 0 3 2 1 3 4 5 0 3 2 1 3 5 0 3 2 1 4 0 3 2 ...

Each line represents the frequent sequences whose frequency is no less than min_sup (-min_sup option) and the size of sequences is less than or equal max_pat (-max_pat option).

About

Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages