SequencingSet.Rd
R6 Class for loading and analysing sequence sets
R6 Class for loading and analysing sequence sets
floundeR::FloundeR
-> SequencingSet
enumerate
prepares a simple 1D Angenieux
enumeration of the provided dataset
for quick visualisation of the dataset.
N50
Calculate and return the N50 value for passed quality sequence reads in
the current SequencingSet
object
mean
Calculate and return the mean sequence length for passed quality reads in
the SequencingSet
object
new()
Initialise a new instance of the R6 Class SequencingSet
SequencingSet$new(keycol, seqsum = NA)
keycol
a pointer to the column of interest in the seqsum
to
direct parsing and exploration of the file.
seqsum
a tibble of sequencing summary information
as_tibble()
Export the imported dataset(s) as a tibble
This object consumes a sequencing summary file (and optionally the corresponding barcoding_summary file) and creates an object in memory that can be explored, sliced and filtered. This method dumps out the in-memory object for further exploration and development.
SequencingSet$as_tibble()
A tibble representation of the starting dataset
read_length_bins()
bin the sequences in seqsum
content into bins of sequence length
The nanopore sequencing run is expected to return a collection of sequences that vary in their length distributions; this variance is a function of the sequencing library prepared, the starting DNA etc. This method is used to bin reads into uniform bins to assess the distribution of sequence lengths.
SequencingSet$read_length_bins( normalised = TRUE, cumulative = FALSE, bins = 20, outliers = 0.025 )
normalised
should the sequence collection be reported to normalise for the number of sequence bases sequenced or the number of sequence reads - TRUE by default to normalise for sequenced bases.
cumulative
defines whether cumulative sequence bases (reads) are reported per bin (FALSE by default).
bins
the number of sequence bins that should be prepared (20 by default)
outliers
defines the number of outliers (0.025 = 2.5%) that are excluded from the longest reads to prepare a richer distribution visulation - the plots can be bothered by the long tail of mini-whales.
Angenieux 2D graph object
quality_bins()
bin the sequences in seqsum
content into bins of quality
The nanopore sequencing run is expected to return a collection of sequences that vary in their quality distributions; this variance is a function of the sequencing library prepared, the starting DNA etc. This method is used to bin reads into uniform quality bins to assess the overall quality of the run and to identify potential issues
SequencingSet$quality_bins(bins = 20, outliers = 0)
bins
the number of sequence bins that should be prepared (20 by default)
outliers
defines the number of outliers (0 = 0%) that are excluded from the reads - should probably be deprecated for simplicity??
Angenieux 2D graph object
clone()
The objects of this class are cloneable with this method.
SequencingSet$clone(deep = FALSE)
deep
Whether to make a deep clone.