BUG: Fix `highspy` `__getitem__` speed by HaoZeke · Pull Request #1438 · ERGO-Code/HiGHS

HaoZeke · 2023-09-30T21:42:15Z

Closes #1277. For this test case (adapted from here):

import highspy
import timeit

h = highspy.Highs()

h.readModel("80bau3b.mps")
h.setOptionValue("time_limit", 10)

# Solve
start_time = timeit.default_timer()
h.run()
elapsed = timeit.default_timer() - start_time

print(f"Time taken to solve: {elapsed:.4f}")

solution = h.getSolution()
num_vars = h.getNumCol()

# Extract values using __getitem__ N times
start_time = timeit.default_timer()
values = [solution.col_value[icol] for icol in range(num_vars)]
elapsed = timeit.default_timer() - start_time
print(f"__getitem__ method for {num_vars} vars: {elapsed:.4f}")


# Extract values by copying first to list
start_time = timeit.default_timer()
col_values = list(solution.col_value)
values = [col_values[icol] for icol in range(num_vars)]
elapsed = timeit.default_timer() - start_time
print(f"Copy-first method for {num_vars} vars: {elapsed:.4f}")

Where the 80bau3b.mps is part of this repo, we have:

# Original
Time taken to solve: 0.1016
__getitem__ method for 9799 vars: 0.7423
Copy-first method for 9799 vars: 0.0003

As a first approximation, we can use the buff protocol and numpy:

# Numpy readonly (33b6634823b41252f28fdd9daa5b553c6b9b43cd)
Time taken to solve: 0.0974
__getitem__ method for 9799 vars: 0.0956
Copy-first method for 9799 vars: 0.0010

However, this isn't equivalent to readwrite and isn't very nice conceptually, so we can try to use an opaque type:

# Opaque type (standard) for std::vector<double> (2aa08d9c8471a86aafa544d7976de1e520c7357f)
Time taken to solve: 0.1007
__getitem__ method for 9799 vars: 0.0064
Copy-first method for 9799 vars: 0.0015

This can be optimized a bit more:

# Custom opaque type and class
Time taken to solve: 0.0919
__getitem__ method for 9799 vars: 0.0046
Copy-first method for 9799 vars: 0.0015

Note that this method has the caveats listed in the documentation. I think the standard opaque type is probably about as far as can be optimized safely, though there seem to be some micro-optimizations discussed on Gitter (which I admit are beyond me).

__getitem__ method for 9799 vars: 0.0956 Copy-first method for 9799 vars: 0.0010

__getitem__ method for 9799 vars: 0.0064 Copy-first method for 9799 vars: 0.0015 ERGO-Code#1277

From ERGO-Code#1438

HaoZeke · 2023-10-01T12:20:19Z

One of the biggest issues with this is that now users need to be careful to cast:

# Before:
lp.col_cost_ = c

# After:
vector_double_c = highspy._highs.VectorDouble(c.tolist())
lp.col_cost_ = vector_double_c

This can be sidestepped with the NumPy approach (or just documented). Alternatively, since for the modeling API we have the Python wrapper over the C++ internals anyway, we can just extract it into a list before operating on it (as is done in the test case).

HaoZeke · 2023-10-01T15:37:19Z

I don't like any of these here actually. They're a mess for downstream library integration (e.g. SciPy). The solution should be to rework the highs python module instead to extract the list and store it.

From ERGO-Code#1438

HaoZeke added 4 commits September 30, 2023 21:40

MAINT: Try readonly numpy return for HighsSolution

6748905

__getitem__ method for 9799 vars: 0.0956 Copy-first method for 9799 vars: 0.0010

BUG: Use an opaque variant for HighsSolution

e6bef68

__getitem__ method for 9799 vars: 0.0064 Copy-first method for 9799 vars: 0.0015 ERGO-Code#1277

ENH: Rewrite vector with custom class for speed

a545fd1

MAINT: Slightly safer (fully featured) variation

78abe0d

HaoZeke added a commit to HaoZeke/HiGHS that referenced this pull request Sep 30, 2023

ENH: Add a variant of the fix in on vector<double>

5cb1508

From ERGO-Code#1438

HaoZeke added a commit to HaoZeke/HiGHS that referenced this pull request Sep 30, 2023

ENH: Add a variant of the fix in on vector<double>

8245e1c

From ERGO-Code#1438

HaoZeke closed this Oct 1, 2023

HaoZeke added a commit to HaoZeke/HiGHS that referenced this pull request Feb 3, 2024

ENH: Add a variant of the fix in on vector<double>

86094cf

From ERGO-Code#1438

HaoZeke added a commit to HaoZeke/HiGHS that referenced this pull request Feb 3, 2024

ENH: Add a variant of the fix in on vector<double>

c0bf4e6

From ERGO-Code#1438

HaoZeke added a commit to HaoZeke/HiGHS that referenced this pull request Feb 3, 2024

ENH: Add a variant of the fix in on vector<double>

fea558d

From ERGO-Code#1438

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Fix `highspy` `getitem` speed#1438

BUG: Fix `highspy` `getitem` speed#1438
HaoZeke wants to merge 4 commits intoERGO-Code:masterfrom
HaoZeke:fixGetItemSpeed

HaoZeke commented Sep 30, 2023

Uh oh!

HaoZeke commented Oct 1, 2023

Uh oh!

HaoZeke commented Oct 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

HaoZeke commented Sep 30, 2023

Uh oh!

HaoZeke commented Oct 1, 2023

Uh oh!

HaoZeke commented Oct 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant