Workflow params by bentsherman · Pull Request #5929 · nextflow-io/nextflow

bentsherman · 2025-03-31T16:31:23Z

Related to #4669

This PR is the first step towards defining workflow params entirely in the pipeline script. It allows you to define a params block like so:

params {
  input: Path
  save_intermeds: Boolean = false
}

workflow {
  // ...
}

Instead of assigning individual params. The advantage is that Nextflow can validate params natively once the params block is defined, because it guarantees that all params are declared in one place.

There is still some work required to make the validation work, but the high-level flow is:

User specifies params on the command line / params file
Config files can override script params or define "config params" which are only used by the config
When the params block is defined in the script, config params are ignored and only overrides from the command line / config are applied. If a script param was not specified and has no default value, an error is reported. If a CLI param was not already defined in the config or script, an error is reported

TODO:

Separate config params from CLI params to identify invalid params (i.e. params that weren't declared in the script or config)

netlify · 2025-03-31T16:31:44Z

✅ Deploy Preview for nextflow-docs-staging ready!

Name	Link
🔨 Latest commit	`d6496c9`
🔍 Latest deploy log	https://app.netlify.com/projects/nextflow-docs-staging/deploys/68ac872c77040f0008a4d334
😎 Deploy Preview	https://deploy-preview-5929--nextflow-docs-staging.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

bentsherman · 2025-03-31T16:46:20Z

Regarding the schema generation, there are two approaches we could take:

Declare only the param name / type / default value in the script, put everything else in an auxiliary JSON/YAML file
Put everything in the script that is used for the schema -- description, icon, form validation rules, section headers, etc

I was initially leaning towards (2), because it would simplify the schema generation and we could validate the available schema properties. But this would also make the params definition really long and verbose in the script, whereas the pipeline code only cares about the name / type / default value.

So now I'm starting to lean towards (1). In that case we could have a really concise definition:

params {
  input: Path
  save_intermeds: boolean = false
}

workflow {
  println "input = ${params.input}"
  println "save_intermeds = ${params.save_intermeds}"
}

Or even directly in the entry workflow:

workflow {
  params:
  input: Path
  save_intermeds: boolean = false

  main:
  println "input = ${input}"
  println "save_intermeds = ${save_intermeds}"
}

This concise syntax will work only if we're certain we don't need anything else in the script. I thought maybe the help text would be useful for CLI help, but that could be provided through a Javadoc comment

In this case, the schema generation would look something like this:

Run nextflow schema to initialize a bare-bones JSON schema from the params definition
Populate the schema with extra information (help text, icons)
Run nextflow schema periodically to update the JSON schema from the params definition, overwriting fields like name / type / schema (perhaps with an appropriate warning)

ewels · 2025-04-01T15:57:10Z

This concise syntax will work only if we're certain we don't need anything else in the script.

I'm not entirely sure that this is the case, the schema is used for validation of more than just type. I know that some of these things can be handled with Records (eg. enum choices), but what about things like pattern, min/max and uniqueItems etc?

I thought maybe the help text would be useful for CLI help, but that could be provided through a Javadoc comment

I'd be curious to see how this might look - ideally for both description and helptext in one.

bentsherman · 2025-04-01T16:19:05Z

I know that some of these things can be handled with Records (eg. enum choices), but what about things like pattern, min/max and uniqueItems etc?

Any kind of validation should be possible through custom types and constructor functions (functions that create the custom type and just implements the validation logic). But not all of those cases can be automatically translated to the schema.

For example, I can automatically generate a pattern from a type definition, but not things like min and max. Unless we did something crazy like Min<0, Max<Integer, 10>> 😅

I could go either way at this point. I like the concise syntax of declaring params in the entry workflow, but CLI libraries like argparse are also pretty standard, so maybe the concise syntax is just too restrictive

I'd be curious to see how this might look - ideally for both description and helptext in one.

Copying your example from our slack convo:

workflow {
  params:
  /**
   * Path to comma-separated file containing information about the samples in the experiment.
   *
   * You will need to create a design file with information about the samples in your experiment
   * before running the pipeline. Use this parameter to specify its location.
   * It has to be a comma-separated file with 4 columns, and a header row.
   * See [usage docs](https://nf-co.re/rnaseq/usage#samplesheet-input).
   */
  input: Path

  /**
   * If generated by the pipeline save the STAR index in the results directory.
   *
   * If an alignment index is generated by the pipeline use this parameter
   * to save it to your results folder.
   * These can then be used for future pipeline runs, reducing processing times.
   */
  save_reference: boolean = false

  main:
  // ..
}

bentsherman · 2025-04-01T16:20:49Z

Using the Javadoc comment is "better" in the sense that you only need to parse the script to produce the CLI help, you don't have to execute it, which i think would be excessive

bentsherman · 2025-04-03T15:24:24Z

See also Pydantic: https://docs.pydantic.dev/latest/concepts/fields/#validate-default-values

Simple param:

myparam: String = "default-value"

Full param:

myparam: String = Field(default: "default-value", pattern: "/some.*regex/")

bentsherman · 2025-04-03T15:31:41Z

Declaring params in the entry workflow means that you don't need the params. prefix anymore:

workflow {
  params:
  input: Path
  save_intermeds: boolean = false

  main:
  println "input = ${input}"
  println "save_intermeds = ${save_intermeds}"
}

On the one hand I like that it makes params more like workflow takes. On the other hand, you still need the params. prefix to use params in the config, so I fear that the end result would just be more confusing?

That would suggest that the params block is needed just for consistency with the config. Maybe we could allow the short and long forms like Pydantic:

params {
  input: Path {
    description '...'
    pattern '*.csv'
  }
  save_intermeds: boolean = false
}

workflow {
  println "input = ${params.input}"
  println "save_intermeds = ${params.save_intermeds}"
}

Though I always hesitate to add shortcuts if it makes the code less consistent

ewels · 2025-04-03T15:43:14Z

+1 for the separate params block, I feel like for consistency that is easier to read and understand, also avoids confusion with take, which is doing quite a similar thing.

Not sure about the squiggly bracket syntax. I like the thinking, but it means that we now have three different types of syntax for them. Nothing for config, : for types and = for variables / others. I can see that being really annoying.

That said, the Field() syntax can be confusing in its own ways, see the Pydantic docs:

Using the f: <type> = Field(...) form can be confusing and might trick users into thinking f has a default value, while in reality it is still required.

bentsherman · 2025-04-03T15:54:20Z

Yeah, confusing the Field() with a default value is a serious drawback

The block syntax appeals to me because it is consistent with workflow outputs:

// fetchngs...
outputs {
  samples: Channel<Sample> {
    path '...'
    // ...
  }
}

// rnaseq...
params {
  input: List<Sample> {
    // ...
  }
}

Since we want to be able to match outputs to inputs for pipeline chaining, it makes sense to me that the syntax for inputs and outputs mirror each other.

The config is another issue. Let me think through that and write a separate comment...

bentsherman · 2025-04-03T16:01:42Z

We could add the same params block syntax to config files if consistency is an issue. But I fear this might feel too "weird" in a configuration context.

The nice thing about config params is that they basically have to be simple values (numbers, strings, booleans, etc). So being able to declare the type isn't so important because it can be inferred from the default value.

Meanwhile, if we take the hybrid approach of generating a skeleton schema that the user can annotate manually as needed, we don't need to add new syntax to the config file to support things like validation and help text, because those can just be defined in the JSON schema

kenibrewer · 2025-04-03T16:38:05Z

I really like this syntax proposal a lot:

params {
  input: Path {
    description '...'
    pattern '*.csv'
  }
  save_intermeds: boolean = false
}

This is likely in part because it mirrors Python/Pydantic but I think that's a good thing for us to emulate. Imitating the syntax of the most popular typed python extension will make it easier for folks to learn and feel like they can read and understand.

ewels · 2025-04-03T18:09:26Z

If we're leaning more towards that syntax, we could arguably have all JSON schema parameters covered. I think we need to make sure we're crystal clear on what we want to support.

For example, maybe no description as that's "decorative" and too verbose, so the example above becomes simply:

params {
  input: Path { pattern '*.csv' }
  save_intermeds: boolean = false
}

ewels · 2025-04-08T12:46:48Z

@bentsherman - any ideas how the schema builder might be able to reach into Nextflow code to update these definitions based on changes made in the GUI / JSON?

bentsherman · 2025-04-08T12:56:11Z

I don't think that will be possible. I think the flow will have to be:

Write params definition in main script
Generate skeleton JSON schema
Extend the schema by hand or via CLI wizard or upload to schema builder
Don't edit things that are sourced from the main script

Or if you use the schema builder from scratch, you have to update the params definition by hand (or not use it at all)

mashehu · 2025-04-08T13:17:40Z

I am just thinking of the following user story: I set the type of a parameter to boolean in the parameter defintion. While writing the param definition in the schema builder I see that the tool actually accepts 0,1, and 2 as values. In the current version of the builder I switch the parameter type in the GUI to integer and set a min and a max value. In the currently proposed setup, I would need to go back to the nextflow code, export a new schema and open the new schema in the builder. not optimal imo.

bentsherman · 2025-04-08T15:48:14Z

The kind of discovery work you describe (i.e. figuring out the appropriate type of a param) is exactly the kind of thing that needs to happen in the pipeline code, so that the Nextflow compiler can verify param names and types as they are used in the entry workflow. You can't get that kind of validation in the schema builder.

Instead, the schema builder should be used to annotate a fixed set of params with things like help text, icons, form validation rules, etc. It should be primarily concerned with how params are accessed by external systems.

ewels · 2025-04-09T08:48:30Z

I agree that the logic / definition should be in the pipeline code: at least, that should be the source of truth. I was mostly wondering if we could have some way to update the Nextflow code from the JSON schema builder, to have the best of both worlds. The schema builder has some nice beginner-friendly functionality in it, for example a GUI with a built-in regex tester for writing patterns, and a bunch of built-in help text.

Maybe this is something that we could do with @RefTrace ? eg. From the Python CLI that launches the schema builder GUI, then go back and access the Nextflow code to edit it in place. Not sure if that's possible.

Or if we launch the schema GUI editor from Nextflow itself (with a local server etc) could there be a callback which is able to edit the Nextflow code? 🤔 We would know the param name and attributes..

bentsherman · 2025-04-09T13:46:17Z

The furthest I would go is to generate a code block in the schema builder that the user can copy into their Nextflow pipeline if they want.. Automatically updating code from an external source is an anti-pattern in my view

ewels · 2025-04-24T14:47:05Z

Suggestion from call: Would be nice to support single-line comments (//) in addition to Javadoc multi-line comments. This drops the number of lines per-param from 4 to 2, which makes quite a bit of difference if there are a lot of parameters.

christopher-hakkaart

I've added some suggestions to the docs. Looking good.

docs/migrations/25-04.md

docs/vscode.md

docs/workflow.md

christopher-hakkaart

Docs looking good 👍

pditommaso

I like the general idea, but there are some points that needs to be improved.

I'm a bit concerned about this:

Config files can override script params or define "config params" which are only used by the config

Maybe I'm misunderstanding, but I don't config params should be managed differently from script params

docs/migrations/25-04.md

docs/workflow.md

pditommaso

I believe, this needs some work (as per previous comment)

bentsherman · 2025-07-01T19:12:58Z

Went ahead and added type annotations for params in this PR, since it seemed weird to add the params block without the ability to specify types.

One thing we need to figure out is what to do with the type casting of CLI params. We don't know at this point whether the user is using the new params definition, so I think we would need to either disable it by default or somehow "undo" it during the params validation.

EDIT: Since the params block is only supported by the v2 parser, I just disable the CLI params type casting when v2 parser is enabled.

christopher-hakkaart · 2025-07-04T01:40:24Z

Docs are awesome.

bentsherman · 2025-07-09T15:17:09Z

@pditommaso ~~this is ready to merge~~ never mind, another test is failing...

muffato · 2025-07-09T20:46:32Z

A parameter that doesn't specify a default value is a required param.

Can I specify null as the default value to indicate an optional param ? Something like

params {
  index: Path = null
}

That would be like Optional[Path] in Python

bentsherman · 2025-07-10T13:30:35Z

@muffato yes, you can default to null. We will also add nullable types so that you have to specify it as index: Path? = null if the param can be null. That will be in a separate PR.

bentsherman · 2025-07-10T17:50:50Z

@pditommaso now tests are passing, ready for you

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

Stale

pditommaso force-pushed the master branch from f6a3696 to 49b58d2 Compare April 9, 2025 16:18

bentsherman marked this pull request as ready for review April 18, 2025 14:07

bentsherman requested review from a team as code owners April 18, 2025 14:07

bentsherman requested a review from pditommaso April 18, 2025 14:07

christopher-hakkaart reviewed Apr 27, 2025

View reviewed changes

docs/migrations/25-04.md Outdated Show resolved Hide resolved

docs/vscode.md Outdated Show resolved Hide resolved

docs/workflow.md Outdated Show resolved Hide resolved

christopher-hakkaart approved these changes Apr 28, 2025

View reviewed changes

bentsherman linked an issue Apr 28, 2025 that may be closed by this pull request

Print warning for unused CLI parameters #4567

Closed

bentsherman added this to the 25.04 milestone May 1, 2025

pditommaso approved these changes May 2, 2025

View reviewed changes

docs/migrations/25-04.md Outdated Show resolved Hide resolved

docs/migrations/25-04.md Outdated Show resolved Hide resolved

docs/workflow.md Outdated Show resolved Hide resolved

docs/workflow.md Outdated Show resolved Hide resolved

pditommaso previously requested changes May 2, 2025

View reviewed changes

pditommaso force-pushed the master branch 3 times, most recently from b4b321e to 069653d Compare June 4, 2025 18:54

bentsherman force-pushed the workflow-params-1 branch from db9d727 to 082d590 Compare July 1, 2025 18:54

bentsherman changed the title ~~Workflow params (part 1)~~ Workflow params Jul 2, 2025

bentsherman force-pushed the workflow-params-1 branch from 082d590 to d4c5bf3 Compare July 4, 2025 01:14

christopher-hakkaart approved these changes Jul 4, 2025

View reviewed changes

bentsherman force-pushed the workflow-params-1 branch 2 times, most recently from e48aa31 to b3e79ba Compare July 9, 2025 13:59

bentsherman force-pushed the workflow-params-1 branch from e722241 to 95f9fa5 Compare July 10, 2025 16:26

bentsherman force-pushed the workflow-params-1 branch from 95f9fa5 to 8bb2b36 Compare July 15, 2025 01:35

bentsherman mentioned this pull request Jul 15, 2025

Type annotations #6278

Merged

bentsherman force-pushed the workflow-params-1 branch from 8bb2b36 to 414f513 Compare July 18, 2025 22:47

bentsherman force-pushed the workflow-params-1 branch 3 times, most recently from 8d676e4 to 79e1c57 Compare August 8, 2025 15:59

bentsherman mentioned this pull request Aug 21, 2025

camelCase params duplicated with hypens #2061

Closed

Workflow params [e2e prod]

d6496c9

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

bentsherman force-pushed the workflow-params-1 branch from 79e1c57 to d6496c9 Compare August 25, 2025 15:54

bentsherman merged commit 876d805 into master Aug 25, 2025
23 checks passed

bentsherman deleted the workflow-params-1 branch August 25, 2025 16:45

This was referenced Jan 30, 2026

Fix parameter priority using Nextflow session variables nf-core/differentialabundance#623

Merged

param priority in multi-config nf-core/differentialabundance#472

Closed

Conversation

bentsherman commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for nextflow-docs-staging ready!

Uh oh!

bentsherman commented Mar 31, 2025

Uh oh!

ewels commented Apr 1, 2025

Uh oh!

bentsherman commented Apr 1, 2025 • edited by ewels Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bentsherman commented Apr 1, 2025

Uh oh!

bentsherman commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bentsherman commented Apr 3, 2025

Uh oh!

ewels commented Apr 3, 2025

Uh oh!

bentsherman commented Apr 3, 2025

Uh oh!

bentsherman commented Apr 3, 2025

Uh oh!

kenibrewer commented Apr 3, 2025 • edited by ewels Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ewels commented Apr 3, 2025

Uh oh!

ewels commented Apr 8, 2025

Uh oh!

bentsherman commented Apr 8, 2025

Uh oh!

mashehu commented Apr 8, 2025

Uh oh!

bentsherman commented Apr 8, 2025

Uh oh!

ewels commented Apr 9, 2025

Uh oh!

bentsherman commented Apr 9, 2025

Uh oh!

ewels commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christopher-hakkaart left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

christopher-hakkaart left a comment

Choose a reason for hiding this comment

Uh oh!

pditommaso left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pditommaso left a comment

Choose a reason for hiding this comment

Uh oh!

bentsherman commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

christopher-hakkaart commented Jul 4, 2025

Uh oh!

bentsherman commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

muffato commented Jul 9, 2025

Uh oh!

bentsherman commented Jul 10, 2025

Uh oh!

bentsherman commented Jul 10, 2025

bentsherman commented Mar 31, 2025 •

edited

Loading

netlify bot commented Mar 31, 2025 •

edited

Loading

bentsherman commented Apr 1, 2025 •

edited by ewels

Loading

bentsherman commented Apr 3, 2025 •

edited

Loading

kenibrewer commented Apr 3, 2025 •

edited by ewels

Loading

ewels commented Apr 24, 2025 •

edited

Loading

bentsherman commented Jul 1, 2025 •

edited

Loading

bentsherman commented Jul 9, 2025 •

edited

Loading