Convert ParticleSet to structure of arrays #678

angus-g · 2019-10-20T22:51:14Z

This means initialisation of particles is very cheap from Python, and shouldn't be any worse from the C kernel. This gives a massive speedup. See #665 for other discussion.

As part of this change, the semantics of accessing properties on a ParticleSet has changed: pset[idx].property becomes pset.property[idx]. It would be possible to rearrange __getitem__ to return some kind of accessor, like the ParticleAccessor to make this backward-compatible, but I'm not sure of the efficiency implications of such a change. Speaking of which, I created ParticleAccessor before implementing __getattr__ on ParticleSet. It might be worth ripping this back out since it's somewhat unneeded.

A few of the tests are (randomly) failing in JIT mode with segfaults. It looks like maybe something isn't being initialised correctly but it's being a little tricky to debug. I thought I'd just put the bulk of the work up here first.

There are a couple of other remaining tasks:

get ParticleFile working again. I hacked this in another branch at a6684db, but that was before the writing to temporary dictionaries, so it'd probably be worth just responding to the few errors that pop up, rather than trying to merge this other change across. I'll do this after dealing with the JIT segfaults.
test all tutorials/examples (I imagine things like particle plotting probably won't work)

This means initialisation of particles is very cheap from Python, and shouldn't be any worse from the C kernel. This gives a massive speedup, and now the majority of time is spent in the kernel, where you'd expect.

# Conflicts: # parcels/particlefile.py

Note, not all unit tests pass yet. Am still working on that

# Conflicts: # parcels/examples/example_globcurrent.py # tests/test_kernel_language.py

including flake8 fixes, and making ParticleSet object indexable and iterable again

…with small p for particle iterator

erikvansebille · 2020-01-03T17:11:21Z

Thanks so much for all this development, @angus-g! I'd really like to try give it a spin in a real-life setup to investigate how much faster Parcels becomes!

As you've seen, I did quite some development over the last few weeks. I've today finally fixed the ParticleFile, so that works too

As I can see, there are two open issues:

print(ParticleSet) is now very ugly
there are Segmentation faults. I've now found why they happen: because particle.xi is not set to an array the size of the number of Grids (we need a particle.xi for each grid separately). For some reason, https://github.com/angus-g/parcels/blob/soa-storage/parcels/particle.py#L239-L254 is not called anymore?

The second open issue obviously is much more problematic than the first one. But we're almost there!

For backward compatibility checks

# Conflicts: # parcels/particlefile.py

To try fix breaking CI on Windows Github Actions

# Conflicts: # parcels/particleset.py

Updating to 2019 version of windows server for GitHub actions

angus-g · 2020-01-29T01:55:08Z

Thanks for rolling with this! In response to your second issue, you're correct that individual particles aren't initialised any more: the class is used as a structure to hold information about the required variables for which to create arrays. A lot of the work has been shifted to the particle set instead. I suppose this introduces a level of indirection that may make it more difficult to reason about what's going on.

This seems like the sort of thing that needs to be solved with more metaprogramming, so that we can treat particles as individual (for initialisation and printing), whilst having them stored in the more efficient format.

# Conflicts: # tests/test_fieldset.py

CKehl

I have addressed some points in the files that I found non-intuitive to understand, and where an explanation would be welcome. Outside of the explicit comments, the changes made look good and appropriate.

parcels/codegenerator.py

parcels/kernel.py

parcels/particleset.py

.github/workflows/ci-workflow.yml

parcels/codegenerator.py

Reverting to windows-2016

angus-g · 2020-02-12T10:30:25Z

Good call on the revert; it looks like anything written to stderr is considered an error in 2019? Weird!

Undoing logger.info comment now that Windows CI seems fixed (related to e.g. microsoft/azure-pipelines-tasks#12173)

angus-g · 2020-02-20T23:42:36Z

Just pushed through a little change in the recovery kernel logic: we can handle all the particles with the Repeat state at once, which again removes a Python loop over all particles (e.g. in the case of the first execution step with deferred load, where all particles will be Repeat). The recovery kernels are still executed in a Python loop, which could be slow if there are many particles with errors.

I'm able to initialise and sample about 7 million particles on 0.1 deg global surface velocities, though the actual advection is letting me down for speed now.

CKehl · 2020-02-21T19:19:47Z

@angus-g : I like the work you are recently doing on optimizing Parcels, and I would appreciate some talk via Slack, Mail or Skype to coordinate our optimization efforts a bit. I am preparing some branches at the moment for clean benchmarking of the source code and, afterwards, iterative hypothesis testing on some (code) optimization procedures and algorithmic changes. That benchmarking is also meant as an optimization guideline. Please contact me if you're in for a conversation, also on your goals and development- or research intents. Cheers, @CKehl

angus-g · 2020-02-23T19:57:48Z

@CKehl no problem, I’m happy to discuss via Slack, preferably

CKehl · 2020-02-24T09:01:09Z

How can I find you on Slack ? Alternatively, you can find me by searching for my full name.

CKehl

Apart from the state variable access comments, the PR looks fine and ready-to-merge to me now.

parcels/kernel.py

parcels/particle.py

parcels/particleset.py

angus-g · 2020-03-24T21:57:33Z

Thanks @erikvansebille and @CKehl!

This fixes a bug introduced in #678

After #678, `print(ParticleSet)` did not include the custom Variables anymore. This PR fixes that

angus-g and others added 11 commits October 21, 2019 09:19

Convert ParticleSet to structure of arrays

04e0e26

This means initialisation of particles is very cheap from Python, and shouldn't be any worse from the C kernel. This gives a massive speedup, and now the majority of time is spent in the kernel, where you'd expect.

Merge remote-tracking branch 'OceanParcels/master' into soa-storage

d0d5995

# Conflicts: # parcels/particlefile.py

Some improvements to particlefile.py to support soa-storage

08a3fbf

Note, not all unit tests pass yet. Am still working on that

Fixing ParticleFile for SOA

aa9b45a

Merge remote-tracking branch 'OceanParcels/master' into soa-storage

9036fee

# Conflicts: # parcels/examples/example_globcurrent.py # tests/test_kernel_language.py

Merge branch 'master' into soa-storage

929eaab

Fixing bug in particleset.__getattr__

30ecd4a

Fixing bugs in SOA implementation

e8de417

including flake8 fixes, and making ParticleSet object indexable and iterable again

Changing density test cases to iterable ParticleSet

a3b3205

Changing pressure in seawater density to capital P to avoid conflict …

4a0addb

…with small p for particle iterator

Fixing test_particles

d2f8ac5

erikvansebille added 9 commits January 9, 2020 15:31

Using c_void_p in JITParticle declaration to fix segmentation faults

cf1087a

Merge remote-tracking branch 'OceanParcels/master' into soa-storage

8731211

Fixing bug with MPI runs

a466206

Reverting some ParticleSet Iterator behaviour in examples

1653f8c

For backward compatibility checks

Merge remote-tracking branch 'OceanParcels/master' into soa-storage

0e4030f

# Conflicts: # parcels/particlefile.py

Adding np.uint64 check in codegenerator

9e10f5b

To try fix breaking CI on Windows Github Actions

Merge remote-tracking branch 'OceanParcels/master' into soa-storage

8b39848

# Conflicts: # parcels/particleset.py

Second attempt to fix uint64 error on windows

3fc2a3f

Update ci-workflow.yml

f73fa80

Updating to 2019 version of windows server for GitHub actions

erikvansebille mentioned this pull request Jan 27, 2020

Index errors in MPI mode with >1 process. #717

Closed

Merge remote-tracking branch 'OceanParcels/master' into soa-storage

ec616cc

# Conflicts: # tests/test_fieldset.py

angus-g mentioned this pull request Feb 2, 2020

Convert parcels to closer to upstream angus-g/lagrangian-filtering#43

Closed

3 tasks

CKehl reviewed Feb 4, 2020

View reviewed changes

parcels/codegenerator.py Show resolved Hide resolved

CKehl force-pushed the soa-storage branch from 1b148ea to ec616cc Compare February 5, 2020 11:09

erikvansebille mentioned this pull request Feb 6, 2020

Fix dtype_to_ctype for 64-bits inducer/cgen#26

Merged

minimum cgen version to fix windows issue with 64-bit

be0fb32

erikvansebille added 3 commits February 12, 2020 10:31

Merge branch 'master' into soa-storage

bff3417

fixing flake8 error

2a4bcae

Update ci-workflow.yml

8a2a0f2

Reverting to windows-2016

erikvansebille and others added 2 commits February 12, 2020 11:48

Update kernel.py

d3f3e48

Undoing logger.info comment now that Windows CI seems fixed (related to e.g. microsoft/azure-pipelines-tasks#12173)

Merge branch 'master' into soa-storage

b103d72

Handle all "repeat" particles at once

4803a7d

angus-g force-pushed the soa-storage branch from 98cfa65 to 4803a7d Compare February 21, 2020 04:27

erikvansebille mentioned this pull request Mar 19, 2020

Fix small_dt increments error #762

Merged

erikvansebille added 2 commits March 20, 2020 11:13

Merge remote-tracking branch 'upstream/master' into soa-storage

7dac659

Fixing merging bugs

7bf3ff8

erikvansebille mentioned this pull request Mar 20, 2020

Support setting a particle state to skip advection #727

Closed

CKehl reviewed Mar 23, 2020

View reviewed changes

parcels/kernel.py Show resolved Hide resolved

parcels/kernel.py Show resolved Hide resolved

parcels/particle.py Show resolved Hide resolved

parcels/particleset.py Show resolved Hide resolved

Implementing Particle.set_state() method

a7112be

CKehl approved these changes Mar 24, 2020

View reviewed changes

erikvansebille merged commit be5dde9 into Parcels-code:master Mar 24, 2020

angus-g deleted the soa-storage branch March 24, 2020 21:57

erikvansebille added a commit that referenced this pull request Mar 25, 2020

Adding __repr__ method for ParticleAccessor class

006f7e3

This fixes a bug introduced in #678

erikvansebille added a commit that referenced this pull request Mar 25, 2020

Adding __repr__ method to ParticleAccessor class

f64b0f1

This fixes a bug introduced in #678

erikvansebille mentioned this pull request Mar 25, 2020

Adding __repr__ method to ParticleAccessor class #775

Merged

erikvansebille mentioned this pull request Apr 1, 2020

Fixing a bug with recent Structure-of-arrays PR and multiple grids #782

Merged

angus-g linked an issue Apr 8, 2020 that may be closed by this pull request

Particle creation efficiency #665

Closed

erikvansebille added a commit that referenced this pull request Apr 23, 2020

Fixing printing of custom Variables

23b24f9

After #678, `print(ParticleSet)` did not include the custom Variables anymore. This PR fixes that

erikvansebille mentioned this pull request Apr 23, 2020

Fixing printing of custom Variables #819

Merged

erikvansebille mentioned this pull request Sep 2, 2023

Retiring AoS support #1423

Merged

Convert ParticleSet to structure of arrays #678

Convert ParticleSet to structure of arrays #678

Uh oh!

Conversation

angus-g commented Oct 20, 2019

Uh oh!

erikvansebille commented Jan 3, 2020

Uh oh!

angus-g commented Jan 29, 2020

Uh oh!

CKehl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

angus-g commented Feb 12, 2020

Uh oh!

angus-g commented Feb 20, 2020

Uh oh!

CKehl commented Feb 21, 2020

Uh oh!

angus-g commented Feb 23, 2020

Uh oh!

CKehl commented Feb 24, 2020

Uh oh!

CKehl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

angus-g commented Mar 24, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants