Implement subsampling with strain lists instead of sequences and metadata

**Context**
Subsampling currently extracts metadata and sequences for each subsample rule, requiring each corresponding augur filter call to loop through the full sequence and metadata.

**Description**
We can speed up subsampling by rewriting these rules to output only strain names (or strain names and metadata) and then adding a rule that emits the subsampled sequences and metadata based on those strain names with augur filter's.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement subsampling with strain lists instead of sequences and metadata #807

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Implement subsampling with strain lists instead of sequences and metadata #807

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions