Is your feature request related to a problem? Please describe.
The information on OCR is broken up into three sections in the user guide:
Section 10. Content Recognition, which describes what OCR is, explains how it follows the navigator object, and describes the virtual window the results are presented in, and how to navigate them and close the window
Section 10.1. Windows 10 OCR immediately follows, which discusses the languages and points to the User Guide section to set the OCR language, and then provides instructions for starting OCR (NVDA+r).
Section 12.1.16. Windows 10 OCR Settings explains the settings in the OCR section of the NVDA settings dialog.
The two "parts" of section 10 are disjointed, with the specific instructions being separated (how to navigate and close the results comes before how to actually start OCR for instance). Additionally, users are often confused about how much information NVDA's OCR feature provides. A common scenario is to open a PDF file, run OCR on it, and wonder why only a small amount of text is recognised.
Describe the solution you'd like
I suggest for the information we have in section 10, it could be done all in one block. If we want to have that Section 10.1 to discuss our solution (perhaps with the idea of multiple possible solutions or discussion of specific add-ons at some point in the future?) then I suggest having only the following text prior to subsection 10.1:
"When authors don't provide sufficient information for a screen reader user to determine the content of something, various tools can be used to attempt to recognize the content from an image."
I would then take this paragraph and move it to the end of the section (after NVDA+r is mentioned):
"Once recognition is complete, the result will be presented in a document similar to browse mode, allowing you to read the information with cursor keys, etc. Pressing enter or space will activate (normally click) the text at the cursor if possible. Pressing escape dismisses the recognition result."
I would also move the discussion about languages above the paragraph about recognition being from the current navigator object. That would bring all discussion about "how to" together, and immediately before the specific instructions of what to press.
Finally, I would add the following paragraph after the current navigator information and before the "press NVDA+r" instructions:
OCR is only performed on text which is visible on screen. Text which is too small to be read is also not able to be processed. Where possible, setting the view to show at "100%" or to show "one page", or "one page width" should provide good results.
My proposed full reworded section 10 below (note I just copied the text, so links in the original text are lost):
`10. Content Recognition
When authors don't provide sufficient information for a screen reader user to determine the content of something, various tools can be used to attempt to recognize the content from an image.
10.1. Windows 10 OCR
NVDA supports the optical character recognition (OCR) functionality built into Windows 10 to recognize text from images. Additional content recognizers can be provided in NVDA add-ons.
Windows 10 includes OCR for many languages. NVDA can use this to recognize text from images or inaccessible applications.
You can set the language to use for text recognition in the Windows 10 OCR category of the NVDA Settings dialog. Additional languages can be installed by opening the Start menu, choosing Settings, selecting Time & Language -> Region & Language and then choosing Add a language.
When you use a content recognition command, NVDA recognizes content from the current navigator object. By default, the navigator object follows the system focus or browse mode cursor, so you can usually just move the focus or browse mode cursor where desired. For example, if you move the browse mode cursor to a graphic, recognition will recognize content from the graphic by default. However, you may wish to use object navigation directly to, for example, recognize the content of an entire application window.
OCR is only performed on text which is visible on screen. Text which is too small to be read is also not able to be processed. Where possible, setting the view to show at "100%" or to show "one page", or "one page width" should provide good results.
To recognize the text in the current navigator object using Windows 10 OCR, press NVDA+r.
Once recognition is complete, the result will be presented in a document similar to browse mode, allowing you to read the information with cursor keys, etc. Pressing enter or space will activate (normally click) the text at the cursor if possible. Pressing escape dismisses the recognition result.`
Describe alternatives you've considered
Additional context
Is your feature request related to a problem? Please describe.
The information on OCR is broken up into three sections in the user guide:
Section 10. Content Recognition, which describes what OCR is, explains how it follows the navigator object, and describes the virtual window the results are presented in, and how to navigate them and close the window
Section 10.1. Windows 10 OCR immediately follows, which discusses the languages and points to the User Guide section to set the OCR language, and then provides instructions for starting OCR (NVDA+r).
Section 12.1.16. Windows 10 OCR Settings explains the settings in the OCR section of the NVDA settings dialog.
The two "parts" of section 10 are disjointed, with the specific instructions being separated (how to navigate and close the results comes before how to actually start OCR for instance). Additionally, users are often confused about how much information NVDA's OCR feature provides. A common scenario is to open a PDF file, run OCR on it, and wonder why only a small amount of text is recognised.
Describe the solution you'd like
I suggest for the information we have in section 10, it could be done all in one block. If we want to have that Section 10.1 to discuss our solution (perhaps with the idea of multiple possible solutions or discussion of specific add-ons at some point in the future?) then I suggest having only the following text prior to subsection 10.1:
"When authors don't provide sufficient information for a screen reader user to determine the content of something, various tools can be used to attempt to recognize the content from an image."
I would then take this paragraph and move it to the end of the section (after NVDA+r is mentioned):
"Once recognition is complete, the result will be presented in a document similar to browse mode, allowing you to read the information with cursor keys, etc. Pressing enter or space will activate (normally click) the text at the cursor if possible. Pressing escape dismisses the recognition result."
I would also move the discussion about languages above the paragraph about recognition being from the current navigator object. That would bring all discussion about "how to" together, and immediately before the specific instructions of what to press.
Finally, I would add the following paragraph after the current navigator information and before the "press NVDA+r" instructions:
OCR is only performed on text which is visible on screen. Text which is too small to be read is also not able to be processed. Where possible, setting the view to show at "100%" or to show "one page", or "one page width" should provide good results.
My proposed full reworded section 10 below (note I just copied the text, so links in the original text are lost):
`10. Content Recognition
When authors don't provide sufficient information for a screen reader user to determine the content of something, various tools can be used to attempt to recognize the content from an image.
10.1. Windows 10 OCR
NVDA supports the optical character recognition (OCR) functionality built into Windows 10 to recognize text from images. Additional content recognizers can be provided in NVDA add-ons.
Windows 10 includes OCR for many languages. NVDA can use this to recognize text from images or inaccessible applications.
You can set the language to use for text recognition in the Windows 10 OCR category of the NVDA Settings dialog. Additional languages can be installed by opening the Start menu, choosing Settings, selecting Time & Language -> Region & Language and then choosing Add a language.
When you use a content recognition command, NVDA recognizes content from the current navigator object. By default, the navigator object follows the system focus or browse mode cursor, so you can usually just move the focus or browse mode cursor where desired. For example, if you move the browse mode cursor to a graphic, recognition will recognize content from the graphic by default. However, you may wish to use object navigation directly to, for example, recognize the content of an entire application window.
OCR is only performed on text which is visible on screen. Text which is too small to be read is also not able to be processed. Where possible, setting the view to show at "100%" or to show "one page", or "one page width" should provide good results.
To recognize the text in the current navigator object using Windows 10 OCR, press NVDA+r.
Once recognition is complete, the result will be presented in a document similar to browse mode, allowing you to read the information with cursor keys, etc. Pressing enter or space will activate (normally click) the text at the cursor if possible. Pressing escape dismisses the recognition result.`
Describe alternatives you've considered
Additional context