Skip to content

Number of queries stats in job.yaml #669

@tadast

Description

@tadast

Currently, Sequenceserver stores the total query length in job.yaml as query_length.

Add another datapoint that records the number of query sequences. This can be obtained by looking into the query (which ultimately is saved as query.fa, but hopefully this datapoint can be captured in-transit?).

The value would be a number of lines starting with > in the query in an ideal scenario. However, edge cases exist, when the > is omitted, e.g.:

  • if there are no lines starting with a >, the whole query is a single sequence
  • if the first line does not start with a >, the number of queries would be number of lines starting with > + 1 e.g. this query contains 2 sequences

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions