Skip to content

Latest commit

 

History

History
61 lines (43 loc) · 1.52 KB

README.md

File metadata and controls

61 lines (43 loc) · 1.52 KB

PrefixSpan

PrefiSpan --- An Implementation of Prefix-projected Sequential Pattern mining on Java

Authors: Raúl Moya Reyes <raulmoya.es> Agustín Ruiz Linares <agustruiz.es> University of Jaén

      (Based on Yasuo Tabei <[email protected]> code in C++, University of Tokyo)

==========================================================================================

FORMAT OF params.txt FILE

That file must contain 3 rows: PATH_TO_DATA_FILE MINIMUM_SUPPORT MAXIMUM:PATTERN

==========================================================================================

FORMAT OF INPUT DATA:

3 1 3 4 5 2 3 1 3 4 4 3 1 3 4 5 2 4 1 6 5 3

Each line corresponds to the each transaction which has a sequence of items separated by single space.

==========================================================================================

FORMAT OF RESULTS:

itemsets ( ids ) freq itemsets ( ids ) freq itemsets ( ids ) freq ...

This result means:

FREQUENT SEQUENCE : TRANSACTION ID : FREQUENCY 1 0 1 3 4 4 1 3 0 3 2 1 3 4 0 3 2 1 3 4 5 0 3 2 1 3 5 0 3 2 1 4 0 3 2 ...

Each line represents the frequent sequences whose frequency is no less than min_sup (-min_sup option) and the size of sequences is less than or equal max_pat (-max_pat option).