Performance enhancement: use CUDA in ImageItem by outofculture · Pull Request #1466 · pyqtgraph/pyqtgraph

outofculture · 2020-12-12T08:03:48Z

cupy is a mostly-compatible drop-in replacement for numpy that performs its work on a CUDA-enabled GPU. This PR lets ImageItem.setImage accept an image on either substrate, and otherwise behaves identically.

Also, windows memory allocation requests are slow, so pre-allocating buffers for processing improves performance ( for both cpu and gpu ). Since writing this, I've identified other places that could benefit from this treatment, but every bit helps.

The examples/VideoSpeedTest.py script was improved to accept command line arguments and to test the cuda-enabled processing.

Only one cupy function was found to be anything other than identical ( cupy.take does not support mode="clip" and needed to be clipped explicitly ), but behavior will of course need to be verified everywhere. Thankfully, this PR shouldn't significantly alter the way numpy-based images are processed, so we can safely leave cuda as an optional/experimental feature while we feel out how it behaves on more diverse systems than I was able to use for my tests.

In testing, we found one issue. On windows systems with CUDA Toolkit < 11.1, an int16-dtype image with a lookup table will be incorrectly processed ( the result gets entirely mapped to 255; it whites out ). I didn't know where to document or enforce this requirement.

This also begs the question of what else would benefit from becoming cupy/numpy-agnostic, but we should leave that until we've proven this small slice of functionality.

this should improve performance under windows

j9ac9k · 2020-12-12T08:15:06Z

Qt4 :shakes-fist-at-cloud:

outofculture · 2020-12-12T08:15:39Z

Oh, testing has happened on 3 systems so far.

Linux, Pop!OS, cuda:10.2, GeForce GTX 1660 Ti Mobile

Windows 10 Pro, cuda:11.1, Quadro P1000

Windows 10 Pro, cuda:11.1, GeForce RTX 2060 SUPER

j9ac9k · 2020-12-12T08:18:47Z

@outofculture thanks for the PR! I am 100% onboard w/ making cupy an optional dependency.

There is another PR that @campagnola has been working on to improve image item performance ( #669 ) ...which it appears you are the author of some commits on that PR. Can you expand on how the two PRs compare?

outofculture · 2020-12-12T08:29:25Z

Yeah, @j9ac9k I originally forked off of Luke's work there, but then I sorta ignored what was there already and went off in a different direction. I was able to rebase and isolate this changeset to just the cupy and pre-alloc stuff. I would totally be willing to group them together into a single PR to ease the testing burden; I would just need to confer with @campagnola to move #669 out of wip status.

j9ac9k · 2020-12-12T09:11:09Z

I'll try and ping him; but I don't want to hold off for too long in case we don't hear back. Thanks again for this PR!

outofculture · 2021-01-12T17:49:21Z

Woo! Checks pass! I have no idea why macos was failing there for a few runs, and I did nothing to try to fix anything, but the build fails went away on their own. 🤷🏽‍♀️

j9ac9k · 2021-01-12T17:51:59Z

hi @outofculture we've had some intermittent macOS failures on openGL related examples/tests, that I'm pretty sure are CI system related; so yeah, don't sweat those...

I'll try and dive into this tonight... I'm likely going to have some pretty simple/dumb questions, is this your preferred medium to communicate on; or should we just stick here to this pull request/issue tracker?

outofculture · 2021-01-12T18:01:02Z

@j9ac9k github is fine. Depending on exactly when you look at it, I could be available on a higher-bandwidth chat platform ( live text, audio or video ) of your choosing. I'll dm my contact info.

j9ac9k · 2021-01-13T05:09:19Z

pyqtgraph/functions.py

-            data = data.astype(data.dtype.newbyteorder('='))
-        if not dtype.isnative:
-            weaveDtype = dtype.newbyteorder('=')
+    # p = np.poly1d([scale, -offset*scale])


can we remove commented out lines?

I didn't know if this project had a strong preference to keep old bits of code like this around, but I would be happy to drop unused comments.

I don't mind leaving some old bits like this in, but probably should add a note as to why you're wanting to keep them around

j9ac9k · 2021-01-13T05:11:09Z

pyqtgraph/functions.py

+    ----------
+    lut : ndarray
+        Either cupy or numpy arrays are accepted, though this function has
+        problems in cupy on windows with some versions of the cuda toolkit.


is this with cuda versions < 11.0? ...probably should try and be a bit more specific here (doesn't have to be exhaustive, but documenting what's been observed would be good).

outofculture · 2021-01-13T07:27:44Z

All comments should be addressed.

j9ac9k · 2021-01-13T22:01:02Z

There is something the CI doesn't like on this PR...given it's failing on multiple platforms I need to investigate further. @outofculture can you replicate the issue on ubuntu/pyside2/python3.9?

outofculture · 2021-01-13T23:24:05Z

Yeah, the tests fail for me in 3.9, but I'm getting failures in master as well.

On master, test_Viewbox is failing intermittently, and test_ErrorBarItem_defer_data is failing consistently.

On my branch, the segfault and timeout reproduced the first time I ran the tests, but since then, I can't get it to happen again. I do also get the same failures as master.

j9ac9k · 2021-01-15T06:37:34Z

Those errors I'm not too concerned about (test_ErrorBarItem_defer_data has been intermittent if I remember right, haven't had any failures with `test_Viewbox before); the issues I'm concerned with are the timeouts/segfaults; historically, some segfaults have come up with small/unrelated changes in the code. Annoyingly, I have never been able to replicate them locally, they just happen on CI platforms periodically.

I'll run locally on some various platforms (due to my hidpi laptop display the test suite fails horribly on windows), and see if I'm able to replicate...

outofculture · 2021-01-15T07:44:43Z

Arg; I can't reproduce the segfault. I've tried repeatedly re-running the tests, switching branches, rebuilding the env, and rebooting. It happened the first time for me, but never since. The stack in the logs doesn't look familiar to me at all; it's not in any of the code I worked on.

j9ac9k · 2021-01-15T20:36:49Z

For the record, I'm not suggesting the issue is one of your code, but with segfaults, from what I understand, changes in one place can reveal issues elsewhere.

Conflicts: README.md - same line, unrelated changes examples/VideoSpeedTest.py - new pyside6 import v. cupy and args at same line

j9ac9k · 2021-01-20T05:26:16Z

Thanks so much for this feature @outofculture !

campagnola and others added 24 commits December 1, 2020 14:54

Add CLI args to video speed test for easier / automated benchmarking

9c2f278

use a buffer-qimage so we can avoid allocing so much

094692f

this should improve performance under windows

playing with numba

656a8c3

oh, mins/maxes in the other order

f02219f

maybe put the cupy in here and see what happens

e9d2647

pre-alloc for gpu and cpu

51da060

handle possibility of not having cupy

4104fc3

no numba in this branch

2fbf749

organize imports

373a264

name them after their use, not their expected device

fab89f5

cupy.take does not support clip mode, so do it explicitly

70ed545

add CUDA option to the VideoSpeedTest

2b00a7f

rename private attr xp to _xp

b1a8cca

handle resizes at the last moment

357ba9b

cupy is less accepting of lists as args

5ec615f

or somehow range isn't allowed? what histogram is this?

8ba537f

construct the array with python objects

7d44883

get the python value right away

261294f

put LUT into cupy if needed

a701387

docstring about cuda toolkit version

55992f5

better handling and display of missing cuda lib

1a9cd3c

lint

f42e8bb

import need

58af8bd

handle switching between cupy and numpy in a single ImageItem

167db04

outofculture mentioned this pull request Dec 18, 2020

Use cupy for camera display acq4/acq4#109

Merged

raise error to calm linters; rename for clarity

716f35b

j9ac9k reviewed Jan 13, 2021

View reviewed changes

j9ac9k and others added 5 commits January 12, 2021 22:26

Add Generated Template Files

ac5c876

document things better

86d03d1

cruft removal

0070e0c

warnings to communicate when cupy is expected but somehow broken

59e0f2b

Merge branch 'master' into cupy-rebase

ccb95fb

j9ac9k mentioned this pull request Jan 13, 2021

some fixes for PySide6 future compatibility #1495

Merged

outofculture added 5 commits January 18, 2021 23:19

playing with settings to suss out timeout

a12171c

playing with more stuff to suss out timeout

7db829c

replace with empty list

df88059

Merge remote-tracking branch 'origin/master' into cupy-rebase

e126269

skip test_ExampleApp on linux+pyside2 only

0702875

This was referenced Jan 19, 2021

[wip] Imageitem performance #669

Closed

Small ImageItem-related improvements #1501

Merged

Merge remote-tracking branch 'origin/master' into cupy-rebase

90da49b

Conflicts: README.md - same line, unrelated changes examples/VideoSpeedTest.py - new pyside6 import v. cupy and args at same line

j9ac9k merged commit f2b4a15 into pyqtgraph:master Jan 20, 2021

j9ac9k mentioned this pull request Jan 25, 2021

Rationale for creating a copy in makeQImage #19

Closed

pijyoi mentioned this pull request Feb 5, 2021

Should ImageItem take its opacity into consideration when saving? #1544

Open

Uh oh!

Conversation

outofculture commented Dec 12, 2020

Uh oh!

j9ac9k commented Dec 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

outofculture commented Dec 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

j9ac9k commented Dec 12, 2020

Uh oh!

outofculture commented Dec 12, 2020

Uh oh!

j9ac9k commented Dec 12, 2020

Uh oh!

outofculture commented Jan 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

j9ac9k commented Jan 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

outofculture commented Jan 12, 2021

Uh oh!

j9ac9k Jan 13, 2021

Choose a reason for hiding this comment

Uh oh!

outofculture Jan 13, 2021

Choose a reason for hiding this comment

Uh oh!

j9ac9k Jan 13, 2021

Choose a reason for hiding this comment

Uh oh!

j9ac9k Jan 13, 2021

Choose a reason for hiding this comment

Uh oh!

outofculture commented Jan 13, 2021

Uh oh!

j9ac9k commented Jan 13, 2021

Uh oh!

outofculture commented Jan 13, 2021

Uh oh!

j9ac9k commented Jan 15, 2021

Uh oh!

outofculture commented Jan 15, 2021

Uh oh!

j9ac9k commented Jan 15, 2021

Uh oh!

j9ac9k commented Jan 20, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

j9ac9k commented Dec 12, 2020 •

edited

Loading

outofculture commented Dec 12, 2020 •

edited

Loading

outofculture commented Jan 12, 2021 •

edited

Loading

j9ac9k commented Jan 12, 2021 •

edited

Loading