BBMap

BBMap uses a multi kmer seed and extend strategy for read mapping.

Indexing

Note

Indexing might take some time but only has to be run once per fasta file. Make sure to reuse already computed indices if possible.

DICAST will check if $indexdir/$indexname exists. If there is no index it will be automatically built. If you want to rebuild the index anyway set $recompute_index=true in scripts/mapping_config.sh. If you want to use your own precomputed index file copy it to index/bbmap-index/ and make sure the index is complete and named appropriately and according to the parameters set in the config files. We recommend including the name of the fasta file in the index name to avoid overwriting. Per default this is already the case and no parameter changes are needed.

Parameters

These are the default parameters set in the src/bbmap/ENTRYPOINT.sh script. If you want to change it you can do this in the ENTRYPOINT script directly. Please refer to the BBMap manual.

-in

Fastq filename of paired end read 1.

-in *yourFastqFile1_*1.fastq
-in2

Fastq filename of paired end read 2.

-in2 *yourFastqFile1_*2.fastq
-ref

Reference genome in fasta format.

-ref $fasta
-path

Base name of the index folder and files.

-path $indexdir/$indexname
-intronlen

Length of introns.

-intronlen 20
-xstag

Add sam flags to improve compatibility with alternative splicing tools.

-xstag us
-outm

The path to the mapped output file in sam format. The output will be separated into case and control folder based on the basefolder of the according fastq file.

-outm $outdir/$controlfolder/*yourFastqFile1_*bbmap.sam
-outu

The path to the unmapped output file in sam format. The output will be separated into case and control folder based on the basefolder of the according fastq file.

-outu $outdir/$controlfolder/*yourFastqFile1_*.bbmap_unmapped.sam

Known Issues

  • issue

  • another

  • another