Resample TOA-TOF-converted histogram data using `rebin` #245

SimonHeybrock · 2025-05-26T08:34:10Z

This replaces the to_events mechanism, which introduced random noise in the bin-edges, causing problems in stream processing. For $N\rightarrow\infty$ to_events should produce the same result as the rebin solution (module differences in how the bin size is chosen).

This also fixes the (I think) bug in to_events, which included counts from unphysical negative-size bins between frames.

nvaytet · 2025-05-26T09:20:52Z

src/ess/reduce/time_of_flight/resample.py

+    of the strictly increasing sections, with a step size equal to the difference
+    between the first two values of the section with the minimum start value (which is


with a step size equal to the difference between the first two values of the section with the minimum start value

In the previous verison, we were taking like a mean bin size. I felt it's was little safer than just taking the first one?

My idea was to minimize the amount of resampling performed. This implementation should effectively "do nothing" for one of the frames. Can we keep it like this and see if it works well in practice?

nvaytet · 2025-05-26T09:21:58Z

src/ess/reduce/time_of_flight/resample.py

+    between the first two values of the section with the minimum start value (which is
+    not necessarily the first section).
+    """
+    min_val, max_val = get_min_max(var, dim=dim, slices=slices)


Are we searching on the slices instead of just a nanmin/nanmax of var because the bounds of var might not belong to a section?

Correct, we want to get the range that the union of sections covers.

nvaytet · 2025-05-26T09:27:31Z

src/ess/reduce/time_of_flight/resample.py

+    return sc.arange(
+        dim=dim,
+        start=min_val.value,
+        stop=max_val.value + step.value,  # Ensure the last bin edge is included


I remember struggling with arange and step before, with cases where if max_val was an exact multiple of step, then the max of the arange would be max_val.
E.g.

When using this to histogram afterwards, the max_val would then be dropped (because we include the min value and exclude the max when using bin edges).

Can the same thing happen here? (or maybe rebin does not have the same issue as hist?)

rebin does have the same issue as hist. But this case is different: We did not compute a max of events. We computed a max of bin edges, so the resulting value should already be an epsilon above the last event (if there were events, which there are not).

Or did you ask about sth. else?

No, it wasn't something different. Because it's max of bin edges, it's not an issue here. I think that's correct.

nvaytet · 2025-05-26T09:29:11Z

tests/time_of_flight/resample_tests.py

+        assert sections == [slice(0, 2), slice(2, 5), slice(5, 8)]
+
+    def test_given_flat_sections_finds_strictly_increasing_parts_only(self):
+        var = sc.array(dims=['x'], values=[1, 2, 2, 3, 4, 4, 5])


Question: what happens with something like [1, 2, 2, 2, 3, 4, 4, 5]? (note the three 2s)

Added a test now.

nvaytet · 2025-05-26T09:32:20Z

tests/time_of_flight/resample_tests.py

+        assert max_val.value == 50  # The max value is at the end of the second slice
+
+    def test_with_float_data(self):
+        var = sc.array(dims=['x'], values=[1.1, 2.2, 3.3, 4.4, 5.5])


Does it also work with datetimes?

Do we expect datetime input coords? Does tof support that?

We don't expect datetimes. I was just wondering if it could be used as a general tool.
But let's not complicate things.

nvaytet

Looks good. Thanks for getting this to work in a proper way 👍

SimonHeybrock added 11 commits May 26, 2025 06:27

Begin adding new resample approach

dcaf8a5

Simplify

4c5c0db

Simplify

5a66e6f

Continue implementation

0a77eae

Make regular grid

a6c429e

Test and implement

0281ed9

Test more single-section cases

5da6118

Use rebin_Strictly_increasing instead of to_events

5276af9

Add test

98429c7

Remove to_events

bfedd4b

Comment on dropped negative bins

d9b5147

SimonHeybrock requested a review from nvaytet May 26, 2025 08:34

SimonHeybrock added this to Development Board May 26, 2025

github-project-automation bot moved this to In progress in Development Board May 26, 2025

SimonHeybrock moved this from In progress to Selected in Development Board May 26, 2025

SimonHeybrock mentioned this pull request May 26, 2025

Remove separate resampling step from TofWorkflow #246

Merged

nvaytet reviewed May 26, 2025

View reviewed changes

Add test for longer flat section

840b3b2

nvaytet approved these changes May 26, 2025

View reviewed changes

SimonHeybrock merged commit 70f9261 into main May 26, 2025
4 checks passed

SimonHeybrock deleted the toa-tof-rebin branch May 26, 2025 10:22

github-project-automation bot moved this from Selected to Done in Development Board May 26, 2025

SimonHeybrock mentioned this pull request Jun 10, 2025

Exception when passing event-mode monitors to time-of-flight workflow? #222

Closed

		of the strictly increasing sections, with a step size equal to the difference
		between the first two values of the section with the minimum start value (which is

Resample TOA-TOF-converted histogram data using rebin #245

Resample TOA-TOF-converted histogram data using rebin #245

Uh oh!

Conversation

SimonHeybrock commented May 26, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nvaytet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Resample TOA-TOF-converted histogram data using `rebin` #245

Resample TOA-TOF-converted histogram data using `rebin` #245