Why do we do this (also in our other profile configs):
|
# Note: unaligned sequences are provided as "aligned" sequences to avoid an initial full-DB alignment |
|
# as we re-align everything after subsampling. |
|
inputs: |
|
- name: open |
|
metadata: "s3://nextstrain-data/files/ncov/open/metadata.tsv.gz" |
|
aligned: "s3://nextstrain-data/files/ncov/open/sequences.fasta.xz" |
|
skip_sanitize_metadata: true |
instead of something more self-explanatory like this?
-# Note: unaligned sequences are provided as "aligned" sequences to avoid an initial full-DB alignment
-# as we re-align everything after subsampling.
inputs:
- name: open
metadata: "s3://nextstrain-data/files/ncov/open/metadata.tsv.gz"
- aligned: "s3://nextstrain-data/files/ncov/open/sequences.fasta.xz"
+ aligned: "s3://nextstrain-data/files/ncov/open/aligned.fasta.xz"
skip_sanitize_metadata: true
Why do we do this (also in our other profile configs):
ncov/nextstrain_profiles/nextstrain-open/builds.yaml
Lines 28 to 34 in cfa73be
instead of something more self-explanatory like this?