Why use unaligned sequences as "aligned" input instead of the actually aligned sequences?

Why do we do this (also in our other profile configs):

https://github.com/nextstrain/ncov/blob/cfa73beacae90007648dbe1362af9e896cb3713e/nextstrain_profiles/nextstrain-open/builds.yaml#L28-L34

instead of something more self-explanatory like this?

```diff
-# Note: unaligned sequences are provided as "aligned" sequences to avoid an initial full-DB alignment
-# as we re-align everything after subsampling.
 inputs:
   - name: open
     metadata: "s3://nextstrain-data/files/ncov/open/metadata.tsv.gz"
-    aligned: "s3://nextstrain-data/files/ncov/open/sequences.fasta.xz"
+    aligned: "s3://nextstrain-data/files/ncov/open/aligned.fasta.xz"
     skip_sanitize_metadata: true
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use unaligned sequences as "aligned" input instead of the actually aligned sequences? #1054

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	# Note: unaligned sequences are provided as "aligned" sequences to avoid an initial full-DB alignment
	# as we re-align everything after subsampling.
	inputs:
	- name: open
	metadata: "s3://nextstrain-data/files/ncov/open/metadata.tsv.gz"
	aligned: "s3://nextstrain-data/files/ncov/open/sequences.fasta.xz"
	skip_sanitize_metadata: true

Why use unaligned sequences as "aligned" input instead of the actually aligned sequences? #1054

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions