[GLM-OCR] GLM-OCR Support#43391
Conversation
zucchini-nlp
left a comment
There was a problem hiding this comment.
Great use of modular! Only a few comments and I think the PR will be ready to merge! Also, since the model isn't yet released, let's skip slow integration tests with a "reason" so that CI doesn't complain
| ("glm_ocr", "GlmOcrConfig"), | ||
| ("glm_ocr_text", "GlmOcrTextConfig"), | ||
| ("glm_ocr_vision", "GlmOcrVisionConfig"), |
There was a problem hiding this comment.
note to myself: this is getting too reprtitive with more MM models added. Will try to come up with a cleaner way!
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
View the CircleCI Test Summary for this PR: https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43391&sha=48594f |
|
This PR also removes all restrictions on image sizes smaller than 28 in GLM-V (support in all GLM-V models) |
|
Oke, will fix tests and then can merge |
ArthurZucker
left a comment
There was a problem hiding this comment.
Thanks great modular, lgtm
|
[For maintainers] Suggested jobs to run (before merge) run-slow: auto, glm46v, glm4v, glm4v_moe, glm_ocr |
Need reivew for @zucchini-nlp