Compressed read index - bioinf data structure for fast querying substring in reads.
- SGA
- SDSL
- Boost (widely available as system package)
- PStreams (widely available as system package)
- libboost - "libboost-filesystem" and "libboost-system" libs
make examples
compiles examples and all dependencies
- test e-coli bacteria
- implement bloom filter?
- 6M reads (generated from yarrowia.fasta), read length 100, 1% of errors
- 352K critical reads
- superstring compress ratio 2.59
- superstring size 231M (1.85M reads were missing)
- RSS ~ 300MB