[Docs] Model_doc structure/clarity improvements#26876
[Docs] Model_doc structure/clarity improvements#26876MKhalusova merged 8 commits intohuggingface:mainfrom
Conversation
|
The documentation is not available anymore as the PR was closed or merged. |
stevhliu
left a comment
There was a problem hiding this comment.
Wow really nice! 🎉 I love the idea of standardizing the pages so it's more consistent when going from model page to model page.
|
If @ArthurZucker is busy, maybe @amyeroberts could take a look? |
amyeroberts
left a comment
There was a problem hiding this comment.
Mammoth piece of work! Thanks for all of this effort in making our documentation better ❤️
Main comment is about the tips for vision models mentioning the image processors. I realise this isn't something you added (I've approved in PRs) - it's just now in the refactoring and moving to tips I see it's not needed.
docs/source/en/model_doc/videomae.md
Outdated
| <Tip> | ||
|
|
||
| Use [`VideoMAEImageProcessor`] to prepare videos for the model. It will resize + normalize all frames of a video for you. | ||
| </Tip> |
There was a problem hiding this comment.
I'm not sure whether we need this tip. All the processing classes (tokenizers, image processors, feature extractors etc.) prepare the raw data for the model
docs/source/en/model_doc/bit.md
Outdated
| [[autodoc]] BitForImageClassification | ||
| - forward | ||
|
|
||
| No newline at end of file |
|
|
||
| ## Usage tips | ||
|
|
||
| - Use [`DeformableDetrImageProcessor`] to prepare images (and optional targets) for the model. |
There was a problem hiding this comment.
I'm not sure we need this tip - this is true for all vision models and their image processors
There was a problem hiding this comment.
Good point, I'll address this.
docs/source/en/model_doc/deta.md
Outdated
|
|
||
| <Tip> | ||
|
|
||
| Use [`DetaImageProcessor`] to prepare images and optional targets for the model. |
There was a problem hiding this comment.
Same here re needing the tip
docs/source/en/model_doc/focalnet.md
Outdated
| <Tip> | ||
|
|
||
| Use the [`AutoImageProcessor`] class to prepare images for the model. | ||
| </Tip> |
docs/source/en/model_doc/swinv2.md
Outdated
|
|
||
| If you're interested in submitting a resource to be included here, please feel free to open a Pull Request and we'll review it! The resource should ideally demonstrate something new instead of duplicating an existing resource. | ||
|
|
||
| <Tip> |
| Tips: | ||
|
|
||
| There are many pretrained variants. Select your pretrained model based on the dataset it is trained on. Moreover, the number of input frames per clip changes based on the model size so you should consider this parameter while selecting your pretrained model. | ||
| There are many pretrained variants. Select your pretrained model based on the dataset it is trained on. Moreover, |
There was a problem hiding this comment.
This seems like it falls under a useage tip
There was a problem hiding this comment.
Thanks for noticing! I missed this one.
| - A demo notebook for the Table Transformer can be found [here](https://github.com/NielsRogge/Transformers-Tutorials/tree/master/Table%20Transformer). | ||
| - It turns out padding of images is quite important for detection. An interesting Github thread with replies from the authors can be found [here](https://github.com/microsoft/table-transformer/issues/68). | ||
|
|
||
| <Tip> |
docs/source/en/model_doc/dinat.md
Outdated
|
|
||
| If you're interested in submitting a resource to be included here, please feel free to open a Pull Request and we'll review it! The resource should ideally demonstrate something new instead of duplicating an existing resource. | ||
|
|
||
| <Tip> |
docs/source/en/model_doc/dinov2.md
Outdated
| This model was contributed by [nielsr](https://huggingface.co/nielsr). | ||
| The original code can be found [here](https://github.com/facebookresearch/dinov2). | ||
|
|
||
| <Tip> |
* first batch of structure improvements for model_docs * second batch of structure improvements for model_docs * more structure improvements for model_docs * more structure improvements for model_docs * structure improvements for cv model_docs * more structural refactoring * addressed feedback about image processors
This PR streamlines pages in model_doc, so that a user who has seen at least one of these pages, should find it easier to navigate throughout any other model_doc page. These are the changes:
<Tip>has been added with a link to API reference of the original model.<framework>tag was added so that a user could collapse sections irrelevant to them.The PR aims to: