More array benchmarks by aspiwack · Pull Request #451 · tweag/linear-base

aspiwack · 2023-05-03T18:11:45Z

Benchmark raw allocation
Benchmark writes
Benchmark against the Seq type

Still a draft because I need to format all this, and maybe I need to revisit a commit message: it would seem that the overhead that my refactoring introduced essentially compensates the difference between Vector and our linear arrays. Which proves that the extra Ur and (,) are probably where the issue is coming from (probably mostly Ur since, as we can see in #328 , the (,) are unboxed. So having the ability to unbox Ur will really be critical for the extra bit of performance).

That being said, there is something to be said for comparing the same scenarios. There is also something to be said for comparing idiomatic scenarios. In neither case is the comparison completely accurate anyway.

Here are my figures after this PR (note how good we are with writes, the figures are not too shabby honestly):

  arrays
    alloc
      Data.Array.Mutable.Linear:                                                 OK (0.41s)
        329  ns ±  29 ns
      Data.Vector:                                                               OK (0.45s)
        451  ns ±  23 ns
      Data.Sequence:                                                             OK (0.29s)
        3.88 μs ± 377 ns
    toList
      Data.Array.Mutable.Linear:                                                 OK (0.36s)
        5.70 μs ± 384 ns
      Data.Vector:                                                               OK (0.35s)
        5.82 μs ± 347 ns
      Data.Sequence:                                                             OK (0.58s)
        3.02 μs ±  95 ns
    map
      Data.Array.Mutable.Linear:                                                 OK (0.32s)
        2.24 μs ± 220 ns
      Data.Vector:                                                               OK (0.33s)
        2.62 μs ± 166 ns
      Data.Sequence:                                                             OK (1.14s)
        7.20 μs ± 436 ns
    reads
      Data.Array.Mutable.Linear:                                                 OK (0.44s)
        9.11 μs ± 355 ns
      Data.Vector:                                                               OK (0.32s)
        11.3 μs ± 755 ns
      Data.Sequence:                                                             OK (0.44s)
        73.8 μs ± 2.7 μs
    successive writes (very unfair to vector)
      Data.Array.Mutable.Linear:                                                 OK (0.37s)
        13.6 μs ± 759 ns
      Data.Vector:                                                               OK (0.57s)
        1.93 ms ± 184 μs
      Data.Sequence:                                                             OK (0.21s)
        116  μs ±  11 μs

We probably have a bit of overhead due to unsafe casts. So it's more fair to have a reasonably sized array. The micro-benchmarking aspect is taken care of by our benchmarking framework, so the precision should still be as good. And it allows for slower tests.

I simply introduced a type class to abstract over the differences. I almost didn't do it because I was afraid that the extra `Ur` and `(,)` would pollute the result and bias the result in our favours. Maybe it does. The reads are a bit less different in this style. I guess it was to be expected because reads are so fast. But the benchmark still says that our implementation is slow. So I say: good enough.

aspiwack · 2023-05-04T07:58:14Z

Should be good now.

tbagrel1

I'm not sure I can convince myself that this code benchmarks each collection type without introducing extra noise that would impact performance.

That being said, at a syntactic/organisational level, the code is quite readable.

Using a single version of the code has created less tight benchmarks. This lets more specialisation happen. Though I can't say it's perfect either.

aspiwack · 2023-05-04T14:55:39Z

I'm not sure I can convince myself that this code benchmarks each collection type without introducing extra noise that would impact performance.

It does introduce noise. Now the question of whether it models actual real-life performance better or worse is much harder. I don't really know.

I've added a commit that helps with inlining and specialisation to remove some of that noise.

aspiwack added 7 commits May 3, 2023 15:42

Help for the future peerers-at-Core

bbe73e5

Add array benchmark for alloc and sets

f5f53f6

Array benchmarks: clean-up refactoring

7fa9f08

Array benchmark: refactor to avoid a little noise in test decls

0346704

Array benchmarks: compare against Sequence

50a72f5

aspiwack force-pushed the benchmarks/more-arrays branch from 25f62bb to 6c2e181 Compare May 4, 2023 07:57

aspiwack marked this pull request as ready for review May 4, 2023 07:57

aspiwack requested a review from tbagrel1 May 4, 2023 07:58

tbagrel1 approved these changes May 4, 2023

View reviewed changes

aspiwack added 2 commits May 4, 2023 16:52

Array benchmarks: clean-up

90dd6f0

Array benchmarks: allow more optimisations in scenarios

09a25b2

Using a single version of the code has created less tight benchmarks. This lets more specialisation happen. Though I can't say it's perfect either.

aspiwack force-pushed the benchmarks/more-arrays branch from 6c2e181 to 09a25b2 Compare May 4, 2023 14:53

aspiwack enabled auto-merge May 4, 2023 14:56

aspiwack merged commit 66a9bd9 into master May 4, 2023

aspiwack deleted the benchmarks/more-arrays branch May 4, 2023 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More array benchmarks#451

More array benchmarks#451
aspiwack merged 9 commits intomasterfrom
benchmarks/more-arrays

aspiwack commented May 3, 2023

Uh oh!

aspiwack commented May 4, 2023

Uh oh!

tbagrel1 left a comment

Uh oh!

aspiwack commented May 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aspiwack commented May 3, 2023

Uh oh!

aspiwack commented May 4, 2023

Uh oh!

tbagrel1 left a comment

Choose a reason for hiding this comment

Uh oh!

aspiwack commented May 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants