Skip to content

Matrix Assembly

BSBolt AggregrateMatrix takes a list of CGmap files, compressed or uncompressed, and assembles a consensus methylation matrix. Methylated sites that pass a read depth threshold and are present in a set proportion of samples are included in the matrix.

BSBolt AggregateMatrix Commands

bsbolt AggregateMatrix -F {file1.CGmap,file2.CGmap,...} -O {output_matrix.txt}

-h, --help              show this help message and exit

Options:
  -F File,File,.        comma separated list of CGmap files
                        or path to text file with list of line separated CGmap files
  -S Str,Str,.          comma separated list of samples labels. If sample labels are not provided sample labels
                        are extracted from CGmap files. Can also pass path to txt for line separated sample labels.
  -min-coverage Int     minimum site read depth coverage [10]
  -min-sample Float     proportion of samples that must have a valid site (above minimum coverage threshold) [0.8]
  -O File               Aggregate matrix output path
  -CG                   Only output CG sites
  -verbose              Verbose aggregation
  -t Int                Number of threads to use when assembling matrix [1]
  -count                Output a count matrix with count of methylated cytosines and total observed cytosines

Aggregate Matrix Default Settings

python3 -m bsbolt AggregateMatrix -F cgmap_1,cgmap_2,cgmap_3 -O ~/test_matrix.txt

Aggregate Matrix Default Settings - File List

python3 -m bsbolt AggregateMatrix -F cgmap_file_list.txt -O ~/test_matrix.txt

Aggregate Matrix Default Settings - File List, Sample Labels, Verbose

python3 -m bsbolt AggregateMatrix -F cgmap_file_list.txt -S sample1,sample2,sample3 -O ~/test_matrix.txt -verbose