BBMap
BBMap uses a multi kmer seed and extend strategy for read mapping.
Indexing
Note
Indexing might take some time but only has to be run once per fasta file. Make sure to reuse already computed indices if possible.
DICAST will check if $indexdir/$indexname exists. If there is no index it will be automatically built. If you want to rebuild the index anyway set $recompute_index=true
in scripts/mapping_config.sh.
If you want to use your own precomputed index file copy it to index/bbmap-index/ and make sure the index is complete and named appropriately and according to the parameters set in the config files.
We recommend including the name of the fasta file in the index name to avoid overwriting. Per default this is already the case and no parameter changes are needed.
Parameters
These are the default parameters set in the src/bbmap/ENTRYPOINT.sh script. If you want to change it you can do this in the ENTRYPOINT script directly. Please refer to the BBMap manual.
- -in
Fastq filename of paired end read 1.
-in *yourFastqFile1_*1.fastq- -in2
Fastq filename of paired end read 2.
-in2 *yourFastqFile1_*2.fastq- -ref
Reference genome in fasta format.
-ref $fasta- -path
Base name of the index folder and files.
-path $indexdir/$indexname- -intronlen
Length of introns.
-intronlen 20- -xstag
Add sam flags to improve compatibility with alternative splicing tools.
-xstag us- -outm
The path to the mapped output file in sam format. The output will be separated into case and control folder based on the basefolder of the according fastq file.
-outm $outdir/$controlfolder/*yourFastqFile1_*bbmap.sam- -outu
The path to the unmapped output file in sam format. The output will be separated into case and control folder based on the basefolder of the according fastq file.
-outu $outdir/$controlfolder/*yourFastqFile1_*.bbmap_unmapped.sam
Known Issues
issue
another
another