Faster TensorProductOperator by vpuri3 · Pull Request #59 · SciML/SciMLOperators.jl

vpuri3 · 2022-06-17T15:04:07Z

fixes #58

using SciMLOperators, LinearAlgebra
using BenchmarkTools

A = TensorProductOperator(rand(12,12), rand(12,12), rand(12,12))

u = rand(12^3, 100)
v = rand(12^3, 100)

A = cache_operator(A, u)

mul!(v, A, u) # dunny
@btime mul!($v, $A, $u); # 4.510 ms (17 allocations: 31.36 KiB)

julia> versioninfo()
Julia Version 1.8.0-rc1
Commit 6368fdc6565 (2022-05-27 18:33 UTC)
Platform Info:
  OS: macOS (x86_64-apple-darwin21.4.0)
  CPU: 4 × Intel(R) Core(TM) i5-5257U CPU @ 2.70GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, broadwell)
  Threads: 4 on 4 virtual cores
Environment:
  JULIA_NUM_PRECOMPILE_TASKS = 4
  JULIA_DEPOT_PATH = /Users/vp/.julia
  JULIA_NUM_THREADS = 4

this is on par with linearmaps in terms of speed, and miles ahed in terms of allocations. ref #58 (comment)

codecov · 2022-06-17T15:08:11Z

Codecov Report

Merging #59 (2eaede7) into master (746c0d5) will not change coverage.
The diff coverage is 0.00%.

@@          Coverage Diff           @@
##           master     #59   +/-   ##
======================================
  Coverage    0.00%   0.00%           
======================================
  Files           6       6           
  Lines         792     822   +30     
======================================
- Misses        792     822   +30

Impacted Files	Coverage Δ
src/basic.jl	`0.00% <0.00%> (ø)`
src/sciml.jl	`0.00% <0.00%> (ø)`
src/utils.jl	`0.00% <ø> (ø)`

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

vpuri3 · 2022-06-17T16:08:04Z

@ChrisRackauckas good to go

vpuri3 · 2022-06-18T01:00:33Z

for reference, a single component matvec of the above example takes ~ 600 \mu s.

using LinearAlgebra, BenchmarkTools
u = rand(12, 12*12*100)
v = rand(12, 12*12*100)
A = rand(12,12)
mul!(v, A, u) # dummy
@btime mul!($v, $A, $u) # 616.410 μs (0 allocations: 0 bytes)

The entire tensor product takes 7x that time (4.5ms), and we have only three matvecs ~2ms. and the permutedims only take 200\mu s each. so there is surely some performance we can find. @chriselrod @ChrisRackauckas

inital commit

5660912

vpuri3 added 4 commits June 17, 2022 11:14

[ci-skip]

9fb96b8

[ci-skip] 3 arg mul, ldiv bangs

e625ac6

[ci-skip] 5 arg mulbang

3dd0280

2 arg ldiv bang

daa7cae

vpuri3 changed the title ~~[WIP] Faster TensorProductOperator~~ Faster TensorProductOperator Jun 17, 2022

vpuri3 added 6 commits June 17, 2022 12:11

comment

d3fc991

rm unnecessary methods

9dcde88

fixed default zero/one for abstraactscimlop

4908913

rm import

f1f2fa5

comments

a83d75a

short circuit when identity. will speed things up with kronsum

2eaede7

vpuri3 mentioned this pull request Jun 18, 2022

make tensor products faster #58

Open

ChrisRackauckas merged commit c948888 into SciML:master Jun 18, 2022

vpuri3 deleted the permutedims branch June 18, 2022 03:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Faster TensorProductOperator#59

Faster TensorProductOperator#59
ChrisRackauckas merged 11 commits intoSciML:masterfrom
vpuri3:permutedims

vpuri3 commented Jun 17, 2022 •

edited

Loading

Uh oh!

codecov bot commented Jun 17, 2022 •

edited

Loading

Uh oh!

vpuri3 commented Jun 17, 2022

Uh oh!

vpuri3 commented Jun 18, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

vpuri3 commented Jun 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vpuri3 commented Jun 17, 2022

Uh oh!

vpuri3 commented Jun 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vpuri3 commented Jun 17, 2022 •

edited

Loading

codecov bot commented Jun 17, 2022 •

edited

Loading

vpuri3 commented Jun 18, 2022 •

edited

Loading