DOC add paragraph on "AI usage disclosure" to Automated Contributions Policy and PR Template by AnneBeyer · Pull Request #32566 · scikit-learn/scikit-learn

AnneBeyer · 2025-10-24T13:44:39Z

Reference Issues/PRs

First draft towards extending the Automated Contributions Policy for PRs to require a disclosure of AI usage, as discussed towards the end of #31679

What does this implement/fix? Explain your changes.

Adds a paragraph on required disclosure of AI usage to the Automated Contributions Policy and extends the PR template with a selection accordingly.

AI usage disclosure

I hereby confirm that no AI assistance was used in the creation of this PR.
(Though it could be useful to play around with different formulations/AI suggestions in this case...)

Any other comments?

Any comments/suggestions on the wording are welcome!

…icy and PR Template

github-actions · 2025-10-24T13:45:51Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 47c0a65. Link to the linter CI: here}

lucyleeow

Thanks @AnneBeyer , looks good! Just not sure how best to put the 'select one of the following' part.

.github/PULL_REQUEST_TEMPLATE.md

Co-authored-by: Lucy Liu <jliu176@gmail.com>

betatim · 2025-10-28T13:04:19Z

doc/developers/contributing.rst

+We agree that AI can be a useful development assistant, but the use of any kind of
+AI assistance has to be disclosed in the PR description. Not doing so is not only
+rude to the human maintainers, it also makes it diffictult to determine how much
+scrutiny needs to be applied to the contribution.


I'd make this shorter and more of a statement without opinion (e.g. is or isn't AI useful).

For example "If you use AI tools please state so in your Pull Request description."

Yes, I can make it shorter, but should this also become more of a "please do so", or keep the stricter tone? Or should we even be more explicit on the consequences, like in the paragraph above on fully-automatic submissions ("Maintainers reserve the right, at their sole discretion, to close such submissions")?

Not sure. I like being polite.

I also like to try and keep it short because no one reads anything, and the longer the text the less people read it :-/

…it-learn into doc_ai-disclosure

betatim · 2025-11-17T15:16:34Z

A AI contributions policy that is quite good https://github.com/zulip/zulip/blob/main/CONTRIBUTING.md#ai-use-policy-and-guidelines - maybe there is something we can borrow/copy from them

.github/PULL_REQUEST_TEMPLATE.md

AnneBeyer · 2025-11-18T15:43:07Z

I added an updated version that

shortens the PR template as suggested by @lucyleeow and
tries to make the AI contributions policy more concise.

@betatim: I like the zulip AI policy section and integrated some parts, but the whole thing kind of contradicts our keeping-it-short discussion here.
So we should decide whether we want this section to be instructional, or more of a reference for justifying closing AI PRs/banning AI contributors. Talking to @StefanieSenger, we would tend towards the latter, so this is what I tried to put into words, but if we want to go for the former, I can also add some more details similar to the zulip approach.

lucyleeow

Some nits only but looks good to me, thank you for the changes!

@StefanieSenger originally added both these sections I think I would like to wait for her opinion as she has more background/context here.

.github/PULL_REQUEST_TEMPLATE.md

doc/developers/contributing.rst

lucyleeow · 2025-11-19T00:19:38Z

I like the Zulip guide, e.g., this is nice:

A good rule of thumb is that if you can't make yourself carefully read some LLM output that you generated, nobody else wants to read it either.

It is long though and I am not sure if we want to spend the time agreeing on the nitty gritty details of our AI use policy (or maintain it as AI use and tool performance develops). It would be nice if we could all just point to one source for this.

Edit: FYI re-started the failing CI 🤞

Co-authored-by: Lucy Liu <jliu176@gmail.com>

lucyleeow · 2025-11-20T10:46:58Z

.github/PULL_REQUEST_TEMPLATE.md

+<!--
+If AI tools were involved in creating this PR, please disclose their usage here and make
+sure that you adhere to our Automated Contributions Policy:
+https://scikit-learn.org/dev/developers/contributing.html#automated-contributions-policy
+-->


Referring to an earlier version where you had a list of AI assistance that was used. stdlib uses a similar list (https://github.com/stdlib-js/stdlib/blob/develop/.github/PULL_REQUEST_TEMPLATE.md#ai-assistance) with checkboxes. Maybe a checkbox to state whether they used AI makes it more explicit/difficult to ignore. e.g., this is what a stdlib PR looks like:

I like the checkbox approach.
For reference, here is what I had initially (minus the correct formatting):

Please select one of the following:

No AI assistance was used in the creation of this PR.

I used AI assistance in the creation of this PR (specifically <ADD TOOLS/DETAILS HERE>),
but I confirm that I checked and understood all changes and can explain them on request.

This PR was created by an AI Agent.

The stdlib template goes on with a disclaimer section similar to what I added in the second option:

Disclosure

If you answered "yes" to using AI assistance, please provide a short disclosure indicating how you used AI assistance. This helps reviewers determine how much scrutiny to apply when reviewing your contribution. Example disclosures: "This PR was written primarily by Claude Code." or "I consulted ChatGPT to understand the codebase, but the proposed changes were fully authored manually by myself.".

{{TODO: add disclosure if applicable}}

I'm not sure what the best way here is to not add too much burden on contributors and maintainers side, but still to have some easy way of detecting non-compliant PRs.

Maybe we can start by copying just the upper part of the stdlib checklists and see how good Agents are at adapting to that?

Jumping in from the site with a comment 😅 : I think adding this to the PR Template is the wrong approach. I can really connect to the argument that we should not add additional burdens / bureaucracy to people who want to contribute nor on reviewers.

My hope on adding "disclose AI use" to our policy would be to easier tell people off that use AI in an irresponsible, harmful way, but I really don't care if someone has used a bit of AI or not at all and I don't want maintainers to be in the position to investigate what people claim to be doing compared to what they are doing.

In my opinion, informing people in contributing.rst that they have to disclose AI use on the PR, is enough. Hardly anybody will do that and that's fine. We don't want to discuss with ai-spammers whether they have pushed AI code they have not reviewed at all or whether they have only used AI as a helper.

If they fail to disclose their AI usage, we can then easily tell people who irresponsively open gen-ai PRs that they did two things wrong: 1. hardly or not supervised gen-ai PRs and 2. not telling us about it.

That puts us in a position where we don't need to hint people to the Automated Contributions Policy, because we expect them to be informed. This in turn makes it easier to deal with those cases. (I am speaking of the nastiest 3% of contributions and the rest of the other contributors would be untouched by that, no matter if they use AI as a helper or not.)

Yeah good thoughts.

Hardly anybody will do that and that's fine.

I think this is the situation that's not ideal, I have not seen anyone offer this info in a PR. What do you think of a simple checkbox yes/no for use of AI? It sort of forces a response as it makes it 'standard'. It would be nice to know before reviewing, and hopefully make people less reluctant to share - it is very commonly used for research/understanding (and even from a data collection perspective it may be interesting).

I see the value in having a clear signal from PR authors for reviewers. Sorry that I didn't reflect on that and only expressed concerns in my last message. I didn't mean it to sound dismissive, @lucyleeow and @AnneBeyer.

What do you think of a simple checkbox yes/no for use of AI?

I like it. Though posed like this, I am sure most people would have to check "yes". (I would certainly always do, since I constantly chat back and forth with llms to explain me the coding world.) What we mean is if they used AI as a coding assistance, I think.

What do you think of adding a simple checkbox "[ ] AI assistance used for coding?" without a "yes"/"no" option, only check if you have used AI for coding.

We could try it to collect some data on how people use it and if it is useful for reviewers and adjust later if it doesn't prove helpful.

I think, so far we have not asked people to disclose their AI usage (adding this to the guidelines is also part of this PR), so I'm not too surprised people don't do that yet.

I think we can go with a trial and error approach here. Adding the section heading to the template is a first step towards making it more obvious that this disclosure is expected. Adding a checklist could actually also make it less effort for both sides, because we might not get as detailed information as with a free text field (like which kind of tools people used), but people might be more likely to actually set a check mark than to fill in text. So we could go with something like this (and observe what AI Agents make of it for a while):

I used AI assistance for (please check all that apply)

Code generation (e.g., when writing an implementation or fixing a bug)

Test/benchmark generation

Documentation (including examples)

Research and understanding

What do you think? @lucyleeow @StefanieSenger

I'm fine with this, especially since it's important to others on the team.
I do still have concerns about adding an extra task for contributors while collecting information that may not be reliable but we can try it and adjust the PR template later if it turns out not to be helpful, or once we feel we’ve learned enough from it.

I'm +1 for this. I think a checkbox is easy enough to do, so I don't think it will be a burden in that sense.
But, I can understand that it would be a 'burden' because some people may feel like we would 'value' their contribution less if they say they used AI. As you said, we can always iterate.

doc/developers/contributing.rst

Co-authored-by: Tim Head <betatim@gmail.com>

betatim

I like it. Let's use it and see what happens.

I think the open discussion from Lucy has converged, but I'll let the participants declare that themselves. So won't merge yet.

AnneBeyer · 2025-11-28T09:54:21Z

Would you like to have another look, @lucyleeow, @StefanieSenger?
(maybe also @adrinjalali, @lesteve?)

StefanieSenger

Approving!

lucyleeow

Sorry 2 nits and then I will merge!

.github/PULL_REQUEST_TEMPLATE.md

lucyleeow · 2025-12-01T03:31:59Z

Enabling auto merge, hopefully no CI timeouts 🤞

… Policy and PR Template (scikit-learn#32566) Co-authored-by: Lucy Liu <jliu176@gmail.com> Co-authored-by: Tim Head <betatim@gmail.com>

… Policy and PR Template (#32566) Co-authored-by: Lucy Liu <jliu176@gmail.com> Co-authored-by: Tim Head <betatim@gmail.com>

…#4051) * Add an AI-assisted contributions policy taken mostly from Awkward Array's (https://github.com/scikit-hep/awkward/), which was based on Scikit-learn's Automated Contributions Policy. * Add AI-assistance disclosure checkboxes to pull request template. * c.f. - scikit-hep/awkward#3831 - scikit-learn/scikit-learn#32566 Note that the Awkward Array language is more pro-AI-usage while the Scikit-learn language is more neutral. ### Context This was discussed in the AI section of the [2026 Snakemake Hackathon](https://indico.cern.ch/event/1574891/) at TUM ([GitHub project board](https://github.com/orgs/snakemake/projects/8)).  ### QC  * [N/A] The PR contains a test case for the changes or the changes are already covered by an existing test case. * [x] The documentation (`docs/`) is updated to reflect the changes or this is not necessary (e.g. if the change does neither modify the language nor the behavior or functionalities of Snakemake).  ## Summary by CodeRabbit * **Documentation** * Added an "AI-assisted contributions" subsection to the contribution guide covering attribution, limits on automation, reviewer expectations, and when to disclose significant AI assistance. * Updated the pull request template to include an AI-assistance disclosure section and checklist to ensure contributors declare use of AI tools during submission and review.  Co-authored-by: Johannes Köster <johannes.koester@uni-due.de>

add paragraph on "AI usage disclosure" to Automated Contributions Pol…

b627865

…icy and PR Template

github-actions bot added the Documentation label Oct 24, 2025

lucyleeow mentioned this pull request Oct 28, 2025

Tell agents to disclose that they were used #32584

Merged

lucyleeow reviewed Oct 28, 2025

View reviewed changes

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

less formal tone

8c94f84

Co-authored-by: Lucy Liu <jliu176@gmail.com>

betatim reviewed Oct 28, 2025

View reviewed changes

AnneBeyer added 3 commits October 29, 2025 15:22

shorten and remove opinion on AI tools

65aa7c9

add note on understanding all proposed changes

344cbb8

Merge branch 'doc_ai-disclosure' of https://github.com/AnneBeyer/scik…

c4c1a18

…it-learn into doc_ai-disclosure

lucyleeow reviewed Nov 18, 2025

View reviewed changes

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

lucyleeow mentioned this pull request Nov 18, 2025

DOC Shorten PR template and improve PR attention FAQ #32734

Merged

AnneBeyer added 2 commits November 18, 2025 15:33

shorten PR template

f1d9d33

another attempt on rephrasing the AI contribution policy

37c2a64

lucyleeow reviewed Nov 19, 2025

View reviewed changes

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

.github/PULL_REQUEST_TEMPLATE.md Show resolved Hide resolved

doc/developers/contributing.rst Outdated Show resolved Hide resolved

AnneBeyer and others added 2 commits November 19, 2025 10:07

typo

d3c8cf6

Co-authored-by: Lucy Liu <jliu176@gmail.com>

integrate suggestion on AI usage for text

fd5bc68

lucyleeow reviewed Nov 20, 2025

View reviewed changes

betatim reviewed Nov 25, 2025

View reviewed changes

doc/developers/contributing.rst Outdated Show resolved Hide resolved

betatim reviewed Nov 25, 2025

View reviewed changes

doc/developers/contributing.rst Outdated Show resolved Hide resolved

betatim reviewed Nov 25, 2025

View reviewed changes

doc/developers/contributing.rst Outdated Show resolved Hide resolved

AnneBeyer and others added 3 commits November 26, 2025 11:24

language fix

a713f36

Co-authored-by: Tim Head <betatim@gmail.com>

remove bold face and restructure paragraph

7d585da

add AI usage checkboxes

05111ae

betatim approved these changes Nov 26, 2025

View reviewed changes

StefanieSenger approved these changes Nov 28, 2025

View reviewed changes

lucyleeow approved these changes Dec 1, 2025

View reviewed changes

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

lucyleeow added 2 commits December 1, 2025 14:29

Update .github/PULL_REQUEST_TEMPLATE.md

959da3d

Update .github/PULL_REQUEST_TEMPLATE.md

609ec67

lucyleeow reviewed Dec 1, 2025

View reviewed changes

.github/PULL_REQUEST_TEMPLATE.md Outdated Show resolved Hide resolved

Update .github/PULL_REQUEST_TEMPLATE.md

47c0a65

lucyleeow enabled auto-merge (squash) December 1, 2025 03:31

lucyleeow merged commit 3eecca6 into scikit-learn:main Dec 1, 2025
37 checks passed

AnneBeyer deleted the doc_ai-disclosure branch December 2, 2025 12:02

lesteve mentioned this pull request Dec 9, 2025

REL Release 1.8.0 #32871

Merged

14 tasks

lesteve pushed a commit that referenced this pull request Dec 9, 2025

DOC add paragraph on "AI usage disclosure" to Automated Contributions…

c3cd88a

… Policy and PR Template (#32566) Co-authored-by: Lucy Liu <jliu176@gmail.com> Co-authored-by: Tim Head <betatim@gmail.com>

StefanieSenger mentioned this pull request Jan 26, 2026

AI tools like Copilot Coding Agent don't know about / don't respect our Automated Contributions Policy #31679

Closed

matthewfeickert mentioned this pull request Mar 11, 2026

docs: Add AI-assisted contributions policy to contributing guidelines snakemake/snakemake#4051

Merged

1 task

Uh oh!

Conversation

AnneBeyer commented Oct 24, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

AI usage disclosure

Any other comments?

Uh oh!

github-actions bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

lucyleeow left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

betatim commented Nov 17, 2025

Uh oh!

Uh oh!

AnneBeyer commented Nov 18, 2025

Uh oh!

lucyleeow left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lucyleeow commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Disclosure

Uh oh!

Choose a reason for hiding this comment

Uh oh!

StefanieSenger Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AnneBeyer Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

StefanieSenger Nov 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

betatim left a comment

Choose a reason for hiding this comment

Uh oh!

AnneBeyer commented Nov 28, 2025

Uh oh!

StefanieSenger left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Oct 24, 2025 •

edited

Loading

lucyleeow left a comment •

edited

Loading

lucyleeow commented Nov 19, 2025 •

edited

Loading

StefanieSenger Nov 21, 2025 •

edited

Loading

AnneBeyer Nov 22, 2025 •

edited

Loading

StefanieSenger Nov 23, 2025 •

edited

Loading