System tests foreground fixes by feerrenrut · Pull Request #14054 · nvaccess/nvda

feerrenrut · 2022-08-23T09:04:46Z

Link to issue number:

Related to #13983
Follow on from #14004

Summary of the issue:

System tests continue to fail intermittently.

In particular, occasionally docker desktop pops up a net promoter score survey window before/early in the build.
This docker desktop window opened in the foreground (meaning Appveyor build window no longer is in the foreground) and caused system tests that rely on a 3rd party window taking the foreground to fail.
Appveyor build systems seem to be getting slower. Several key moments in the tests are approaching the limit of how long NVDA will wait for speech to complete.
After an update, chrome takes a long time to be ready for NVDA to interact.
Some tests use the same sample, resulting in matching titles. This can lead to setup code unable to uniquely identify the tab/title of the test.
From RF logs, it can be hard to tell what the foreground window is.
Visually following review/focus can be difficult when monitoring the tests.
Some settings were modified before setting up chrome, which could affect the setup procedure (I.E. review not following focus).

Description of user facing changes

For developers, the system tests shouldn't fail intermittently.

Description of development approach

When the required process (chrome / notepad) doesn't have the foreground, the task switcher is used to give it the foreground.
Extend the time limit waiting for speech to complete.
Start chrome early in the build process, giving it time to complete any post update actions.
Append a time stamp to the test sample, this is included in the hash of the sample used in the title.
Added more logging to report the foreground window
Enable NVDA highlighter.
Ensure that settings are modified after chrome is setup.

Testing strategy:

Tests run locally
Tests run in a try-build
PR build
Due to the intermittent nature of the failures, many builds are required to be sure that the situation has improved. To prevent interfering with other work (by occupying the CI), this can be delivered and monitored on alpha.

Known issues with pull request:

None

Change log entries:

None

Code Review Checklist:

Pull Request description:
- description is up to date
- change log entries
Testing:
- Unit tests
- System (end to end) tests
- Manual testing
API is compatible with existing add-ons.
Documentation:
- User Documentation
- Developer / Technical Documentation
- Context sensitive help for GUI changes
UX of all users considered:
- Speech
- Braille
- Low Vision
- Different web browsers
- Localization in other languages / culture than English
Security precautions taken.

tests/system/libraries/WindowsLib.py

tests/system/nvdaSettingsFiles/standard-dontShowWelcomeDialog.ini

tests/system/libraries/NotepadLib.py

tests/system/libraries/WindowsLib.py

AppVeyorBot · 2022-08-29T09:52:50Z

PASS: Translation comments check.
PASS: Unit tests.
PASS: Lint check.
FAIL: System tests. See test results for more information.
Build (for testing PR): https://ci.appveyor.com/api/buildjobs/v4eqsvlr58x2m85r/artifacts/output/nvda_snapshot_pr14054-26390,6d05096a.exe

See test results for failed build of commit 6d05096a6c

feerrenrut · 2022-08-29T11:45:30Z

The test failed due to a popup "Restore pages? Chrome didn't shutdown correctly."

One thing worth noting, none of the other tabs (test cases) are open. Perhaps chrome ran out of memory.
To address this :

I'll look for a way to disable the popup.
I'll re-enable the code to close the tab, but only if the tab/window has focus.

AppVeyorBot · 2022-08-29T13:43:00Z

PASS: Translation comments check.
PASS: Unit tests.
PASS: Lint check.
FAIL: System tests. See test results for more information.
Build (for testing PR): https://ci.appveyor.com/api/buildjobs/53q2bmqy83tqk9it/artifacts/output/nvda_snapshot_pr14054-26394,f30e7bd1.exe

See test results for failed build of commit f30e7bd108

feerrenrut · 2022-08-30T11:52:34Z

The last CI run had test failures:

I7562

From RF log. Last check while waiting for Chrome (total 3 seconds of waiting):

21:21:53.695 No windows found matching the pattern: re.compile('^NVDA Browser Test Case \\(1887919363\\)')

From NVDA log NVDA report Chrome opening the tab:

IO - speech.speech.speak (13:21:53.938) - MainThread (3456):
Speaking [LangChangeCommand ('en'), 'NVDA Browser Test Case (1887919363) - Google Chrome', CancellableSpeech (still valid)]

ARIA checkbox

From RF log. Trying to read title (NVDA+t), speech didn't finish within 5 seconds

From NVDA log, error with gesture NVDA+t. Second time it has been run.

IO - inputCore.InputManager.executeGesture (13:23:00.057) - RF Test Spy Thread (4228):
Input: kb(desktop):NVDA+t
ERROR - scriptHandler.executeScript (13:23:00.073) - MainThread (1088):
error executing script: <bound method GlobalCommands.script_title of <globalCommands.GlobalCommands object at 0x067BE990>> with gesture 'NVDA+t'
Traceback (most recent call last):
  File "scriptHandler.pyc", line 289, in executeScript
  File "globalCommands.pyc", line 2302, in script_title
AttributeError: 'NoneType' object has no attribute 'name'

This seems strange.
Looking at the script_title method in GlobalCommands it seems like api.getForegroundObject() returned None.

Starts from desktop shortcut

RF log:

21:35:10.211	INFO	emulateKeyPress ('control+alt+n',)	
21:35:17.569	FAIL	Connection to remote server broken: [WinError 10061] No connection could be made because the target machine actively refused it	
21:35:17.569	DEBUG	Traceback (most recent call last):
  File "C:\projects\nvda\tests\system\robot\startupShutdownNVDA.py", line 124, in test_desktop_shortcut
    spy.emulateKeyPress("control+alt+n")
  File "C:\projects\nvda\tests\system\libraries\NvdaLib.py", line 256, in runKeyword
    return lib.run_keyword(keyword, args, kwargs)
  File "C:\projects\nvda\.venv\lib\site-packages\robot\libraries\Remote.py", line 106, in run_keyword
    result = RemoteResult(self._client.run_keyword(name, args, kwargs))
  File "C:\projects\nvda\.venv\lib\site-packages\robot\libraries\Remote.py", line 264, in run_keyword
    raise RuntimeError(message)

Note NVDA is already running, and is providing the connection to RF.
NVDA is used to send a "control+alt+n" shortcut to start NVDA.
Starting a new instance of NVDA will cause the currently running instance to exit.
This will cause the server to disconnect.
In the failed test image there is no indication that NVDA is running, when NVDA is running the highlighter should be visible.

NVDA log:

INFO - core.main (13:35:12.499) - MainThread (4652):
Exiting
DEBUG - core.triggerNVDAExit (13:35:12.499) - MainThread (4652):
_doShutdown has been queued

This indicates it was about 5 seconds from NVDA shutdown until the RF remote error.

Test SelByWord

Failed because there were multiple notepad windows open with the same title:
From RF log:

	Too many windows to focus [Window(hwndVal=524936, title='test (1548220167) - Notepad'), Window(hwndVal=4653638, title='test (1548220167) - Notepad')]

It's unknown why one of the notepad windows didn't close at the end of test, the screenshot doesn't give any indication, the "old" notepad instance is obscured by the new one.
The prior test (from RF log) indicates the notepad
This can be fixed by ensuring every test gets a unique title

Several tests use this same sample/title, from test "Test MoveByWord"

KEYWORD WindowsLib . Log Foreground Window Title
21:38:36.555	INFO	Foreground window title: test (1548220167) - Notepad
...
KEYWORD NotepadLib . Exit Notepad
Start / End / Elapsed:	20220829 21:38:36.562 / 20220829 21:38:48.697 / 00:00:12.135
21:38:36.562	INFO	Is Start process still running (True expected): True	
21:38:36.562	INFO	Test case in foreground, trying to close	
21:38:36.562	INFO	emulateKeyPress ('alt+f4',)
21:38:38.594	INFO	Waiting for process to complete.	
21:38:48.697	INFO	Process did not complete in 10 seconds.	
21:38:48.697	INFO	Leaving process intact.	
21:38:48.697	INFO	Is Start process still running (False expected): True

This is the only instance of "leaving process intact" in the log.
It may be helpful in these instances to get a screenshot when notepad can't exit, was there a "save dialog"?

AppVeyorBot · 2022-08-31T10:47:56Z

PASS: Translation comments check.
PASS: Unit tests.
PASS: Lint check.
FAIL: System tests. See test results for more information.
Build (for testing PR): https://ci.appveyor.com/api/buildjobs/9ii59v27w699cxew/artifacts/output/nvda_snapshot_pr14054-26425,a399f46a.exe

See test results for failed build of commit a399f46ae7

AppVeyorBot · 2022-09-05T15:05:53Z

PASS: Translation comments check.
PASS: Unit tests.
PASS: Lint check.
FAIL: System tests. See test results for more information.
Build (for testing PR): https://ci.appveyor.com/api/buildjobs/5xj9u4upkan8b0nh/artifacts/output/nvda_snapshot_pr14054-26461,1892cd58.exe

See test results for failed build of commit 1892cd58a9

AppVeyorBot · 2022-09-06T08:39:45Z

PASS: Translation comments check.
PASS: Unit tests.
PASS: Lint check.
FAIL: System tests. See test results for more information.
Build (for testing PR): https://ci.appveyor.com/api/buildjobs/i5yutnlq0ujw7ov5/artifacts/output/nvda_snapshot_pr14054-26475,0e1983cc.exe

See test results for failed build of commit 0e1983ccbf

… environment var (PR #14055) Related to PR #14054 Summary : When tests are failing in an unusual way, it may be helpful to be able to review verbose debug logging for interaction with MSAA or UIA. For developers: VERBOSE_SYSTEM_TEST_LOGGING can be set (to "true") via Appveyor settings to enable high verbosity NVDA logging. Development approach: When the environment variable VERBOSE_SYSTEM_TEST_LOGGING is set on appveyor, the system tests are started with an extra parameter (verboseDebugLogging='True') which enables the advanced logging categories in NVDA.

Since Appveyor seems to be less performant than previously, give NVDA more time to handle events and get all speech out. This will increase the time system tests take. System tests could be optimised to reduce usages of wait_for_speech_to_finish. In many cases, wait_for_specific_speech could be preferable.

Seeing focus / nav / browsemode during tests can help some developers with debugging issues with the tests.

Some config can affect the way the test samples are prepared, E.G. moving focus / review to the start location.

Prior approach was flowed, the 'start.exe' process was being tracked instead. The 'start.exe' process exited immediatly after starting notepad. Now it waits until the process is complete. Additionally the notepad RF lib has access to the Windows HWND allowing for checking if the window is in the foreground.

…cess

The test 'Starts from desktop shortcut' relies on NVDA to be running to send the hotkey. This interferes with the logic of the test. The test intermittently fails, see: #14054 Disable test until this can be addressed. Issue #14293 has been opened to resolve this and re-enable the test.

feerrenrut · 2022-11-09T03:45:53Z

This work has been split up, and entirely delivered via other PRs. Closing.

Splitting up PR #14054 Summary of the issue: When a system test fails a screenshot is captured. However, this does not make it clear where the focus/ nav / virtual cursor is positioned. Description of user facing changes None Description of development approach Seeing focus / nav / virtual cursor during tests can help some developers with debugging issues with the tests.

feerrenrut mentioned this pull request Aug 23, 2022

control high verbosity nvda logging during system tests with appveyor environment var #14055

Merged

6 tasks

feerrenrut marked this pull request as ready for review August 24, 2022 03:04

feerrenrut requested a review from a team as a code owner August 24, 2022 03:04

feerrenrut requested review from seanbudd and removed request for a team August 24, 2022 03:04

seanbudd reviewed Aug 24, 2022

View reviewed changes

seanbudd changed the title ~~systemTest foreground fixes~~ System tests foreground fixes Aug 24, 2022

feerrenrut marked this pull request as draft August 25, 2022 02:32

feerrenrut changed the base branch from beta to master August 25, 2022 06:59

feerrenrut marked this pull request as ready for review August 25, 2022 07:34

feerrenrut requested a review from seanbudd August 25, 2022 08:24

lukaszgo1 reviewed Aug 25, 2022

View reviewed changes

tests/system/libraries/NotepadLib.py Outdated Show resolved Hide resolved

seanbudd approved these changes Aug 26, 2022

View reviewed changes

tests/system/libraries/WindowsLib.py Outdated Show resolved Hide resolved

seanbudd reviewed Aug 29, 2022

View reviewed changes

tests/system/libraries/WindowsLib.py Outdated Show resolved Hide resolved

seanbudd approved these changes Aug 30, 2022

View reviewed changes

seanbudd marked this pull request as draft September 5, 2022 05:31

feerrenrut added 6 commits September 14, 2022 12:52

Ensure to speech has started when waiting for the address bar speech

998f0e0

Use highligher during tests

3acceeb

Seeing focus / nav / browsemode during tests can help some developers with debugging issues with the tests.

Fix docs presentation

48699ba

Change config after preparing chrome tests

168f4c0

Some config can affect the way the test samples are prepared, E.G. moving focus / review to the start location.

feerrenrut added 12 commits September 14, 2022 12:54

Clean up helper method and add typing

a4a8df6

Fix lint check

24f1777

remove spy argument

d8a1a8f

Close chrome tab after test completes

703ace5

prevent chrome session crashed bubble

670ed94

fix lint

6bc4c31

Give chrome more time to start up

5a20724

No need to try to report chrome title a second time after initial suc…

3a1cc75

…cess

ensure notepad testcases have a unique title

c49df41

Do speech logging in another thread

6d5dec4

More detailed logging of spy plugin

9745e6a

Tune _blockUntilConditionMet inverval param

34361ba

feerrenrut force-pushed the systemTests-foregroundFixes branch from 04531a5 to 34361ba Compare September 14, 2022 04:54

michaelDCurran mentioned this pull request Oct 11, 2022

Announce sort state on a column header when changed with an inner button #14234

Merged

6 tasks

This was referenced Oct 25, 2022

Track SUT app lifetime in system tests #14289

Merged

System test 'Starts from desktop shortcut' relies on NVDA. #14293

Closed

Disable 'Starts from desktop shortcut' system test #14294

Merged

feerrenrut mentioned this pull request Oct 27, 2022

Handle SUT Application title report failure #14305

Merged

6 tasks

feerrenrut closed this Nov 9, 2022

feerrenrut added the component/system-testing label Dec 8, 2022

Uh oh!

Conversation

feerrenrut commented Aug 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Link to issue number:

Summary of the issue:

Description of user facing changes

Description of development approach

Testing strategy:

Known issues with pull request:

Change log entries:

Code Review Checklist:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AppVeyorBot commented Aug 29, 2022

Uh oh!

feerrenrut commented Aug 29, 2022

Uh oh!

AppVeyorBot commented Aug 29, 2022

Uh oh!

feerrenrut commented Aug 30, 2022

I7562

ARIA checkbox

Starts from desktop shortcut

Test SelByWord

Uh oh!

AppVeyorBot commented Aug 31, 2022

Uh oh!

AppVeyorBot commented Sep 5, 2022

Uh oh!

AppVeyorBot commented Sep 6, 2022

Uh oh!

feerrenrut commented Nov 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feerrenrut commented Aug 23, 2022 •

edited

Loading