Research object support by johanneskoester · Pull Request #1 · snakemake/snakemake

johanneskoester · 2019-10-04T15:15:08Z

Plan

depend on https://github.com/ResearchObject/ro-python
similar to snakemake --report, create research objects on request, from the info that is already captured via the persistence module under .snakemake

albangaignard · 2019-10-04T15:23:09Z

Hi, I would be happy to contribute and provide source code on that topic (in particular provenance / PROV-O).

johanneskoester · 2019-10-07T15:40:21Z

@albangaignard your help is greatly appreciated. I have added a skeleton. Basically, one just needs to feed the information that is retrieved in the skeleton with the python-ro API.

mdehollander · 2020-03-10T10:45:41Z

This would be a nice addition to snakemake. How does this compare to cwlprov and dataprov?

…ted to the SnakeMake architecture

albangaignard · 2020-10-23T12:57:42Z

This would be a nice addition to snakemake. How does this compare to cwlprov and dataprov?

Thanks very much for your feedback. This would be completely in line with the PROV profile of CWLprov (https://github.com/common-workflow-language/cwlprov/blob/main/prov.md). Regarding dataprov the approach is interesting but apparently it does leverage a standard for representing provenance metadata as proposed by the W3C (https://www.w3.org/TR/prov-primer/).

…research-objects # Conflicts: # setup.py

github-actions · 2020-10-29T11:29:35Z

Please format your code with black: black snakemake tests/*.py.

sonarqubecloud · 2020-11-04T14:19:04Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities (and 8 Security Hotspots to review)
0 Code Smells

No Coverage information
0.0% Duplication

albangaignard · 2020-11-04T14:25:50Z

@johanneskoester, I recently reviewed the code quality based on the automatic checks (SonarCloud), and code formatting best practicies (Black tool). Would you have time to review this pull request ?

In summary :

--provenance option
provenance capture in the AbstractExecutor,
two provenance serializations (RDF and JSON) in the working directory

sonarqubecloud · 2021-08-09T08:16:27Z

SonarCloud Quality Gate failed.

0 Bugs
0 Vulnerabilities
6 Security Hotspots
0 Code Smells

No Coverage information
0.0% Duplication

johanneskoester · 2021-08-09T08:19:16Z

snakemake/cwl.py

    def workdir_entry(i, f):
        location = "??inputs.input_files[{}].location??".format(i)
-        if f.is_directory:
+        if os.path.isdir(f):


This I don't understand. We use the is_directory property of IOFile here, because the files may not yet be present. Then, isdir would not work.

johanneskoester · 2021-08-09T08:20:58Z

snakemake/executors/__init__.py

 from snakemake.logging import logger
 from snakemake.stats import Stats
 from snakemake.utils import format, Unformattable, makedirs
+from snakemake.provenance_tracking.provenance import provenance_manager


Since the provenance manager only has to be part of AbstractExecutor, I think we could avoid the singleton and just keep it in there instead.

johanneskoester · 2021-08-09T08:21:54Z

snakemake/executors/__init__.py

+            # print(job.params['biotools_id'])
+            tool_name = ""
+            if "biotools_id" in job.params.keys():
+                tool_name = job.params["biotools_id"]


Mhm, is it possible to extract the biotools_ids from the conda packages instead?

johanneskoester · 2021-08-09T08:25:06Z

snakemake/executors/__init__.py

+                    input_id_list=job.input,
+                    tool_name=tool_name,
+                    job_uri=job.uri,
+                )


As you know, Snakemake has its own metadata tracking. I wonder if it would make sense to export the research object from there instead of doing this while running in here. The advantage is that you will get information also for stuff that happened in a previous run. The flag would then rather be a post-hoc command, just like --report, instead of requiring the user to remember to add it while running. Also, multiple partial runs of the same workflow do not result in several separate provenance information files.

johanneskoester · 2021-08-09T08:26:59Z

I am so sorry for the late response. This completely slipped my attention (I get so many Github notifications that I sometimes miss one). Nice work, please see my comments above.

@epruesse

- Incorporates @epruesse's fix for MRE snakemake#1 - Adds a fix for MRE snakemake#2 - properly marks group jobs as finished - Some minor updates to tests

@epruesse

* add failing tests 823 * fix mistakes * black * Fix the first two MREs from #823. - Incorporates @epruesse's fix for MRE #1 - Adds a fix for MRE #2 - properly marks group jobs as finished - Some minor updates to tests * Fix tests on Windows * Skip MRE 2 from 823 on Windows due to `pipe()` output Co-authored-by: Maarten-vd-Sande <maartenvandersande@hotmail.com> Co-authored-by: Johannes Köster <johannes.koester@uni-due.de>

@epruesse

* add failing tests 823 * fix mistakes * black * Fix the first two MREs from snakemake#823. - Incorporates @epruesse's fix for MRE #1 - Adds a fix for MRE #2 - properly marks group jobs as finished - Some minor updates to tests * Fix tests on Windows * Skip MRE 2 from 823 on Windows due to `pipe()` output Co-authored-by: Maarten-vd-Sande <maartenvandersande@hotmail.com> Co-authored-by: Johannes Köster <johannes.koester@uni-due.de>

sonarqubecloud · 2023-11-29T15:53:22Z

SonarCloud Quality Gate failed.

0 Bugs
0 Vulnerabilities
6 Security Hotspots
1 Code Smell

No Coverage information
0.0% Duplication

Catch issues before they fail your Quality Gate with our IDE extension SonarLint

Pixi install github action is failing with "failed to parse pypi name mapping" errors likely due to rate limiting when 30+ jobs are kicked off nearly simultaneously I tested this fix on #3820 since the tests kept failing due to the `pixi` install action failing. After committing this change, [the actions ran successfully](https://github.com/snakemake/snakemake/actions/runs/20684893321). In [this failing run's](https://github.com/snakemake/snakemake/actions/runs/20682879051/job/59383597583#step:3:3261) debug logs we see: ``` pixi install -e py311 [...] WARN resolve_conda{group=py313 platform=win-64}: reqwest_retry::middleware: Retry attempt #1. Sleeping 1.225245051s before the next attempt Error: × failed to parse pypi name mapping ├─▶ error decoding response body ╰─▶ expected value at line 1 column 1 ``` This warning is repeated many times until finally pixi stops retrying - this is what suggested to me that some sort of rate limit was the issue. One downside is that this does make the CI take a bit longer to run. We could consider using the `cache` feature of the pixi action. And turning up the max-parallel, or reducing the number of test-groups ### QC  * [ ] The PR contains a test case for the changes or the changes are already covered by an existing test case. * [ ] The documentation (`docs/`) is updated to reflect the changes or this is not necessary (e.g. if the change does neither modify the language nor the behavior or functionalities of Snakemake).  ## Summary by CodeRabbit * **Chores** * Updated development toolchain dependencies for improved build and test infrastructure. <sub>✏️ Tip: You can customize this high-level summary in your review settings.</sub>

cademirch · 2026-03-15T08:17:37Z

going to close this since its quite old and main has diverged so far.

initial empty commit

3cf767b

johanneskoester added 4 commits October 5, 2019 11:42

switch to github

61103b0

Merge branch 'master' into research-objects

951b093

skeleton code.

cd3b55e

licensing

a3cfc50

Marc-Ruebsam mentioned this pull request Oct 23, 2019

Deleted log file in directory() output after encountering error #45

Closed

johanneskoester added 3 commits December 20, 2019 16:48

Refactoring

a673a70

minor

8d9fcb1

fmt

6c1f319

albangaignard added 7 commits April 7, 2020 15:11

added provenance_tracking source dir

2d95a25

added provenance at the end of successful job

3e0ccf5

added --provenance option

ee89ea3

legacy code for provenance capture. to be refactored / better integra…

8d16027

…ted to the SnakeMake architecture

merge code from master

dfb069b

provenance serialization must be at the end of the WF execution

2a1230c

logging

ef70850

jjarmagost mentioned this pull request Jun 26, 2020

Mandatory option --cores is breaking peoples workflows #308

Open

albangaignard added 5 commits October 23, 2020 15:07

Merge branch 'master' into research-objects

5011380

small fixes for CWL export

d795039

merge from master

3efed46

added executors to the install process

56ffaf1

Merge branch 'master' of https://github.com/snakemake/snakemake into …

c40a42c

…research-objects # Conflicts: # setup.py

albangaignard added 3 commits October 29, 2020 12:45

black code reformatting

15ab698

code formatting

7685dee

code formatting

fbdc9cf

albangaignard added 12 commits October 29, 2020 12:56

code formatting

d7bc8e6

merge from master

8a0378e

merge from master

9d25d53

Merge branch 'master' into research-objects

3d819ad

Merge branch 'master' into research-objects

03a1034

Merge branch 'master' into research-objects

2648434

removed unnecessary pass

d7784b4

fixed list of packages

b196efa

fixed code quality

b6e99ac

fixed code formatting

5951efa

fixed code quality

3fb43c2

fixed code quality

f17babd

albangaignard marked this pull request as ready for review November 4, 2020 14:20

Merge branch 'main' into research-objects

1ee4292

johanneskoester commented Aug 9, 2021

View reviewed changes

holtgrewe mentioned this pull request Jun 28, 2022

JSONDecodeError while handling job success on LSF #1342

Closed

charmoniumQ mentioned this pull request Feb 2, 2023

Allow generation of PROV-JSON for output files #2077

Open

fixed prov capture in real execution (no dry run)

97e6713

nh13 mentioned this pull request Dec 15, 2025

feat: add the --ancient command line option. #2264

Closed

2 tasks

cademirch closed this Mar 15, 2026

Conversation

johanneskoester commented Oct 4, 2019

Plan

Uh oh!

albangaignard commented Oct 4, 2019

Uh oh!

johanneskoester commented Oct 7, 2019

Uh oh!

mdehollander commented Mar 10, 2020

Uh oh!

albangaignard commented Oct 23, 2020

Uh oh!

github-actions bot commented Oct 29, 2020

Uh oh!

sonarqubecloud bot commented Nov 4, 2020

Uh oh!

albangaignard commented Nov 4, 2020

Uh oh!

sonarqubecloud bot commented Aug 9, 2021

Uh oh!

johanneskoester Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

johanneskoester Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

johanneskoester Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

johanneskoester Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

johanneskoester commented Aug 9, 2021

Uh oh!

sonarqubecloud bot commented Nov 29, 2023

Uh oh!

cademirch commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants