Skip to content

Fix #6182: Add Google Batch LogsPolicy PATH option for GCS bucket log…#6431

Merged
bentsherman merged 21 commits intonextflow-io:masterfrom
dglazer:fix-issue-6182
Nov 13, 2025
Merged

Fix #6182: Add Google Batch LogsPolicy PATH option for GCS bucket log…#6431
bentsherman merged 21 commits intonextflow-io:masterfrom
dglazer:fix-issue-6182

Conversation

@dglazer
Copy link
Contributor

@dglazer dglazer commented Sep 26, 2025

… storage

  • Add logsBucket configuration option to BatchConfig
  • Support both CLOUD_LOGGING (default) and PATH log destinations
  • Add validation for GCS bucket paths (must start with gs://)
  • Include comprehensive tests for new functionality
  • Maintain backward compatibility

🤖 Generated with Claude Code
NOTE: this change still needs human review.

Signed-off-by: David Glazer dglazer@verily.com

@netlify
Copy link

netlify bot commented Sep 26, 2025

Deploy Preview for nextflow-docs-staging canceled.

Name Link
🔨 Latest commit 0f791be
🔍 Latest deploy log https://app.netlify.com/projects/nextflow-docs-staging/deploys/69162b88819cc800084b7af2

@dglazer dglazer marked this pull request as draft September 26, 2025 14:00
@emmarogge emmarogge force-pushed the fix-issue-6182 branch 3 times, most recently from be1236d to 71f6f4a Compare September 29, 2025 19:35
… bucket log storage

- Add logsBucket configuration option to BatchConfig
- Support both CLOUD_LOGGING (default) and PATH log destinations
- Add validation for GCS bucket paths (must start with gs://)
- Include comprehensive tests for new functionality
- Maintain backward compatibility

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: David Glazer <dglazer@verily.health>
David Glazer and others added 2 commits September 29, 2025 19:39
- Add helper methods to extract bucket names and convert GCS paths to mount paths
- Ensure logs bucket is mounted as Volume in GoogleBatchScriptLauncher
- Update LogsPolicy to use container mount paths instead of GCS paths
- Add comprehensive tests for bucket mounting and path conversion
- Addresses reviewer feedback about missing bucket mounting requirements

This ensures Google Batch can write logs to the specified GCS bucket by
properly mounting it before referencing it in the LogsPolicy PATH.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: Emma Rogge <emmarogge@verily.health>
1. Rename variable due to existing variable with same name.
2. Specify parameter type as string.

Signed-off-by: Emma Rogge <emmarogge@verily.health>
@vdauwera
Copy link

vdauwera commented Oct 2, 2025

@bentsherman I had a quick chat with @dglazer, this PR is ready for review (sounds like marking the PR as not draft can only be done by someone with write access? idk)

@bentsherman bentsherman marked this pull request as ready for review October 2, 2025 17:50
@bentsherman
Copy link
Member

@dglazer thanks for the contribution. What's the benefit of being able to send logs to a bucket instead of the standard cloud logging?

@dglazer
Copy link
Contributor Author

dglazer commented Oct 2, 2025

Thanks for taking a look @bentsherman . The short answer is that it lets us provide a unified management experience to researchers, with the same ACLs controlling access to workflow inputs, outputs, and logs, and without them having to be familiar with GCP-native logging tools. @emmarogge can provide more details if needed.

@bentsherman bentsherman self-requested a review November 12, 2025 17:35
@bentsherman bentsherman linked an issue Nov 12, 2025 that may be closed by this pull request
@bentsherman
Copy link
Member

@dglazer @emmarogge I tried to push some updates to better incorporate your changes but for some reason it is not showing up on this PR. Can you see my commit from yesterday?

@dglazer
Copy link
Contributor Author

dglazer commented Nov 13, 2025

I don't -- the most recent entry on the Commits tabs is from me merging master yesterday (3cdcdb0)

Signed-off-by: Ben Sherman <bentshermann@gmail.com>
@bentsherman bentsherman requested a review from a team as a code owner November 13, 2025 17:33
@bentsherman
Copy link
Member

Okay I think it's fixed now. You might want to do another round of testing to make sure everything still works on your end with my changes

@dglazer
Copy link
Contributor Author

dglazer commented Nov 13, 2025

Thank you @bentsherman -- yes, we'll confirm things still work after your changes; I'll defer to @emmarogge on how quickly but it won't be long.

@emmarogge
Copy link

I will do another round of testing now and report back -- thank you so much for the review @bentsherman !

@emmarogge
Copy link

Update: I have once again built Nextflow from this PR locally and tested out running some workflows with logging to a target GCS bucket configured. Logging appeared as expected in the bucket and the workflows succeeded. @bentsherman It's ready for your stamp when you have a chance!

@bentsherman bentsherman merged commit 5b61afe into nextflow-io:master Nov 13, 2025
24 checks passed
@bentsherman
Copy link
Member

Thanks again for the contribution. This feature will show up in the next edge release (25.11.0-edge) which should be in the next few weeks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Google Batch log to bucket

4 participants