-
Notifications
You must be signed in to change notification settings - Fork 19
Allow prallelization of grid search #650
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@tamarl08 you'll need to rebase. |
f67775e to
0f2f0ed
Compare
mulhod
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I kind of wonder about exposing this in the config since it's a detail that doesn't necessarily have to do with input/experiment. I don't know where else it would be put, but it seems slightly out of place. I'm not sure there's a better way to do it, but my initial thought was that it could be exposed by an environment variable. But I do see the benefit. I wonder if there's a possibility -- if it's not too complicated -- to also have this be controlled by an environment variable, though.
|
@mulhod I see your point. I also don't like the fact that it defaults to 1, but can get a value of None to enable parallelization. I didn't find anything else that we control using an environment variable, do we do that? Another option I thought of, is adding a general option of sending extra skll args dict in the call to |
|
Honestly, I am not that bothered by the existence of this parameter since it's meant for advanced users who will probably already be familiar with the same parameter in SKLL. I am not sure I see the benefits of using an environment variable over a configuration setting since then users have to think about two places rather than one for settings. I think the idea of being able to pass a dictionary is great for advanced users but we should make that a longer term goal rather than for this PR. |
|
I think what I was perhaps envisioning was something like: Get environment variable "SKLL_GRID_JOBS" if it exists; if not, use "grid_jobs" from config; if neither, use |
By the way, this is how it works in https://github.com/EducationalTestingService/skll/blob/main/skll/utils/constants.py#L138 |
Codecov ReportPatch coverage is
📢 Thoughts on this report? Let us know!. |
Honestly, I've never really been a fan of that approach and I think most users of SKLL either don't know about it or use it. It also doesn't seem to be mentioned in the docs at all so it's more of a hidden setting. |
|
@tamarl08 did you already check that automatic and interactive config generation work with this new field? |
|
I added the field to the output and the tests pass, do we usually do manual testing as well? |
|
Yes, I usually like to test the config generation by running |
Done! |
desilinguist
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Thank you!
No description provided.