Results

Table of contents

What to expect
Explanation
Visualization

What to expect

After you run the pipeline, the output folder you chose at the beginning should look like this:

.
├── Output
│   ├── SpeciesDelim
│   ├── SpeciesID
│   ├── astraltree
│   ├── haplotypes
│   ├── results.html
│   ├── summary
│   └── tree
├── data
│   ├── demultiplexed
│   ├── filtered
│   └── raw
├── log
│   ├── SpecDelim
│   ├── SpecID
│   ├── ampliconsorter
│   ├── demulti
│   ├── html
│   ├── summary
│   └── tree
├── results
│   ├── SpeciesDelim
│   ├── SpeciesID
│   ├── astraltree
│   ├── consensus_seqs
│   ├── haplotypes
│   ├── html
│   ├── summary
│   └── tree
└── shell
    ├── demult1
    └── demult2

Explanation

`data` subfolder

This subfolder contains:

raw: raw fastq files, which are a copy of the ones contained in the samples csv file.
filtered: quality-filtered fastq files.
demultiplex: demultiplexed files (with a subfolder for each sample, containing a file for each demultiplexed locus).

`shell` subfolder

This subfolder contains:

demult1: shell script files for quality filtering and demultiplexing (executed in parallel when allowed).
demult2: shell script files for consensus generation and summarization (executed in parallel when allowed).

`results` subfolder

This subfolder contains:

consensus_seqs: in this sub-subfolder, we can find a directory for each sample, with a subdirectory for each locus: the Amplicon Sorter-generated consensus sequences are contained in this last level.
haplotypes: in this sub-subfolder, we can find a directory for each locus. In this, we will have a fasta file with the concatenated consensus sequences across all samples for the specific locus, along with their alignments and the MSA visualization image.
tree: in this sub-subfolder, we can find an ML tree for each locus, contained in a dedicated folder. You can also find the tree comparison heatmap and csv file.
astral: in this sub-subfolder, we can find the Astral-generated tree for the concatenated loci, both in pdf and png format.
SpeciesDelim: in this sub-subfolder, we can find the species delimitation outputs by ASAP for each locus and for Astral-concatenated loci. Here, there are partition-based representations, distance-based histograms and cumulative distributions, and text-based reports.
SpeciesID: in this sub-subfolder, we can find the species identification outputs divided according to the database from which they were obtained and subdivided among the loci. For what concerns BOLD outputs, we’ll have to pay attention to the final.csv output, which summarizes the overall species identification results.
html: in this sub-subfolder, we can find the html file to ease access to MSA visualization images.

`log` subfolder

This subfolder contains all the logs for the processes executed throughout the pipeline. It is advised that, if there is any problem with the pipeline, you trace it back here before reporting it, if possible.

`Output` subfolder

This subfolder contains all the information for results visualization (with results.html)

Visualization

The pipeline produces a results.html file to facilitate the visualization of the large amount of data output by the pipeline. Note that there are some compatibility issues with non-Linux browsers, so it is advised to open the file within a Linux terminal:

cd Output
firefox results.html
google-chrome --disable-gpu results.html

Results

What to expect

Explanation

data subfolder

shell subfolder

results subfolder

log subfolder

Output subfolder

Visualization

`data` subfolder

`shell` subfolder

`results` subfolder

`log` subfolder

`Output` subfolder