| layout | docpost |
|---|---|
| title | What happens to uploaded data on CLIMB |
| date_published | 2020-03-29 23:20:00 +0000 |
| date_modified | 2021-04-16 13:30:00 +0000 |
| author | samstudio8 |
| maintainer | samstudio8 |
Data uploaded by users to CLIMB with sufficient metadata is periodically pulled through the elan nextflow pipeline.
elan is responsible for basic quality checking including the following:
- Filtering unmapped reads to ensure this step is done, previously Elan also sorted BAMs but will now reject unsorted BAMs
- Ensuring the BAM is valid with
samtools quickcheck - Counting the proportion of non-ambiguous, ambigious and invalid bases in the consensus FASTA
- Counting the proportion of positions in the aligned BAM that are above certain coverage thresholds
- Pruning potentially spurious or human-looking reads from uploaded BAMs (for ENA uploads)
- Where applicable, checks the depth of tiles amplified by the ARTIC protocol
Once elan has finished, the following artifacts are automatically published with a version number based on the date:
fasta: All consensus FASTA with the naming strategy<coguk_id>.<run_name>.climb.fastaalignment: Each filtered, sorted and checked BAM with the naming strategy<coguk_id>.<run_name>.climb.bamqc: A basic quality report for each COGUK ID.
Data is automatically discarded by the following criteria (QC spec v2 2021-04-16):
- Illumina
- Average BAM depth less than 10x
- BAM depth less than 10x over at least 50% of the reference positions
- Consensus FASTA containing more than 50% Ns
- Nanopore
- Average BAM depth less than 20x
- BAM depth less than 20x over at least 50% of the reference positions
- Consensus FASTA containing more than 50% Ns
- Ion Torrent
- Average BAM depth less than 30x
- BAM depth less than 20x over at least 50% of the reference positions
- Consensus FASTA containing more than 50% Ns
- Illumina
- Average BAM depth less than 10x
- BAM depth less than 10x over at least 10% of the reference positions
- Consensus FASTA containing more than 10% Ns
- Nanopore
- Average BAM depth less than 20x
- BAM depth less than 20x over at least 10% of the reference positions
- Consensus FASTA containing more than 10% Ns
- Ion Torrent
- Average BAM depth less than 30x
- BAM depth less than 20x over at least 10% of the reference positions
- Consensus FASTA containing more than 10% Ns