Handle case of n_outputs > n_inputs by nden · Pull Request #12 · embray/astropy

nden · 2014-12-22T17:17:42Z

@embray I feel the problem with n_outputs > n_inputs is relevant to this PR. Consider the following failing example:

g1 = models.Gaussian1D(9.4, 3, 1.2)
g2 = models.Gaussian1D(4.4, 2, 1.2)
g3 = models.Gaussian1D(9, 3.3, 1)
m = Mapping((0, 1, 0))
m1 = m | (g1 & g2 & g3)

(It may also be relevant to single models but I don't have an example of such a model now.)
This PR suggests a way of fixing the problem, making the assumption that all outputs from a model have the same shape. I think this is a reasonable assumption for now but it certainly may be revisited later.

parameter descriptors does not work yet.

…le output models.

… a True-like value, so ensure that we actually get a float when a float result is expected.

…e output models

…l as classes

…creating new model classes with custom names.

…s. In retrospect it should have been blatantly obvious that was a bad idea, and unnecessary anyways. Turns out Python creates a new method object every time you access a method on an object anyways (this is still reasonably efficient, as it uses a cache of already malloc'd PyMethodObjects and just updates the im_func and im_self pointers on those objects).

…erly support parameterless models here.

…om for improvement here but this is a start.

…y expression. This requires one input which is a mapping of operators to a numerical precedence so that the correct order of operations can be determined. This ended up looking a lot like ExpressionTree.evaluate, which might be able to be used to implement this with a little refactoring.

…afs. This method still needs more work though :/

…improved, later.

…il the enhancements to the tree display are complete.

… a new instance is returned, rather than the class itself. If the class is for some reason needed it can still be gotten at, but in most cases it seems to be clearer if we just return an instance.

…om so that it's out of the way while I do a little more reorg.

… process I actually rewrote traverse_postorder() to be, I think, a little clearer as to what's going on. But it still had the same bug (worse even because now it would go into an infinite loop). But simply ensuring that one node does not have two subtrees that are the same *object* seems good enough, I think.

…lasses.

…upport joining models. Now the eval chain is not built up of functions, but tuples of functions and their input/output counts. This information is required to make the join operator work (and could be useful for other validation purposes)

…s into a separate method; this might prove useful later.

…be looked up by their names. Now we also need to add this capability for CompoundModel instances.

…maintain a separate name->submodel mapping just maintain an ordered tuple of the names of the submodels (with ordering corresponding to their order in the ._submodels list). In particular I think this will prove more useful for compound model instances

…s as well (though indexing does not work yet on instances; that's going to take some more reworking

…ormatting

… list) rather than a lambda--turns out lambdas will create problems for pickling)

…his converts the values in Model._param_metrics to a dict rather than a tuple for easier understanding of the code (maybe at some point a namedtuple?) and merged the _param_broadcast_shape attribute into _param_metrics (no reason they should be separate)

…ce--ensure that regardless of whether the compound model's template used model classes or instances a *new* model instance is returned whose parameters are linked to the compound model's parameters. This was done for now by changing Model._from_existing to a sharedmethod. Add a reminder to remove code that is only there to support legacy code, once said legacy code has been removed.

…eventing adding a name to Rotation2D)

…, since it's not very useful in its current state. I have a new implementation of it that is much better and is 90% done, but has a few corner cases to be worked out. I will add it back in when it's ready.

fix model slicing corner cases

…ndex, which is doing the same thing, essentially.

…he print() function.

…n 2. There is still a separate bug that seems to affect both Python versions, but this fixes the case that was currently in the test suite.

…methods; it seems at some point I accidentally reversed the logical sense of this. The doctest is still broken on Python 3, but it's a mystery that it's even running at all (doctests shouldn't be running on Python 3, generally)

compound models created from model instances, brought up in this comment: astropy#3231 (comment) This is still a bit of a hack, but not such a bad one, and far less brittle than the temporary fix that was introduced in 2112c12 (I didn't necessarily intend it to be temporary but I at least deeply hoped that a better way could be found later--for now this is it). This makes it possible to pass in additional class members that should be added to classes generated from _CompoundModelMeta._from_operator, and includes a helper function _model_operator that makes it easy to make replacements for the standard operator functions (operator.add, etc.) that can include additional members in the returned classes. This way, when taking slices, we can include the critical _slice_offset attribute in the class creation as soon as it is needed. Clearly this is only an internal detail right now, but I can envision this being potentially useful to open this up, in some form, to a documented feature for customizing the behavior of model expressions (likely with the help of some context manager in which the expressions are evaluated...)

…aken from compound models.

…Parameter will do since that's where its __init__ parameters are documented.

idea--from a class's point of view, and from the point of view of any code inspecting a class's __dict__ (particularly, API doc generation by sphinx) this makes a sharedmethod indistinguishable from a normal classmethod which is as it should be.

turbulence should still be expected, but not as *much*, at least not for the most common use cases, and I want to give more of a guarantee of backward compat support as much as possible. I don't want to scare people away from getting what they can out of this. [ci skip]

to the original report of this issue, explaining how I solved the problem: As I suspected, this had to do with a concession I had to make way back in this PR: astropy#2634 This concession is mentioned in the section of the PR text "Caveats" under item 3. The rules I came up with for broadcasting arrays work for the vast majority of models, but there are some (particularly ones like AffineTransformation2D that involve multiple coordinates) where they don't quite work. As a temporary workaround I added a `standard_broadcasting = False` flag to these models--this allows them to opt out of the standard assumptions (and, if they wish, implement their own prescriptions by overriding prepare_inputs and prepare_outputs). The trick here was to make sure that a compound model containing AffineTransformation2D (or others like it) propagate the `standard_broadcasting = False` flag. If one model in a compound model has this limitation, then it must, for now, be applied to the entire model. In the near future I hope to do something a little less brute-force here. I envision models being able to provide some sort of broadcast_hints data structure, which I think can be fairly simple. Most models wouldn't need to use this (the defaults are fine). But this could explain to the code exactly what inputs and parameters are used with each other, and how they should fit with each other. (Obviously reimplementing prepare_inputs/outputs could do this too, but I think the realistic cases are still limited enough that this can be done in a declarative manner and avoid having to reimplement lots of code over and over again). In the meantime, the fix I will add shortly should be good, I think, for most cases we care about?

embray · 2014-12-22T19:04:10Z

I see. Instead of making a PR against my compound modeling branch it would be fine to just send a PR against the main Astropy--while I agree this affects compound models it's still a more general problem. And include a functional test of some sort.

I think this is a reasonable workaround for now, though I think maybe this could be done in a way that's a little more clear about what's happening; it's pretty unclear from the code here why or where an IndexError might be expected (though this is admittedly dangerous ground to begin with). I'll see if I can think of a way to explicitly check for this case. Though I agree with your approach for the end result, for lack of a better way to specify how the outputs should be formatted.

embray · 2014-12-22T19:13:21Z

I'm working on some tests for this.

embray · 2014-12-22T19:51:23Z

I have a PR with a test that I'll go ahead and submit. It takes the same approach this this fix, but instead tries to make it more explicit that this is supported at least in the basic sense. I tested the above example against the fix too and it works as expected.

they have inputs, at least in the most basic cases. This sort of works the same as how models with more inputs than they have outputs work, which is that each input is not necessarily directly tied to each output; instead all outputs are just assumed to have the same dimensions, which are determined from the result of broadcasting all inputs with all parameters. In the future there may be more ways for individual model implementations to control this (besides outright overriding prepare_inputs and/or prepare_outputs). This provides an alternate to this PR: #12

Fix #12

embray added 30 commits December 17, 2014 12:27

Fix handling of __init__ generation in cases where getattr() of

b167cb0

parameter descriptors does not work yet.

WIP: Initial commit. Working prototype for at least single input/sing…

194b186

…le output models.

Add a very simple test for model composition

fd8415e

It turns out comparing a single-element ndarray with a scalar returns…

970491c

… a True-like value, so ensure that we actually get a float when a float result is expected.

Properly support compositions with array-like values and with multipl…

faf1be3

…e output models

Rudimentary support for using model *instances* in expressions as wel…

b809e64

…l as classes

Initial support for giving custom names to model classes

9473784

Add initial support for assigning custom names to models, as well as …

99cbe3b

…creating new model classes with custom names.

This is still temporary code, I think. But this fix is needed to prop…

3131a5d

…erly support parameterless models here.

Some improved representation for model classes themselves. There's ro…

807e05e

…om for improvement here but this is a start.

Reformat format_ascii_tree a bit, supporting numerical labeling of le…

dad6f5a

…afs. This method still needs more work though :/

Initial work on better repr for CompoundModel classes.

c0864a8

A better default docstring for __call__ -- this can still be greatly …

fd35bcc

…improved, later.

Fix basic composition of models with different numbers of outputs

b8628b1

Allow a name to be specified on compound model instances

355b34a

Don't repr ExpressionTrees as an ascii tree for now, at least not unt…

938479a

…il the enhancements to the tree display are complete.

Fixed some model repr-related tests that broke as a result of 569acf9

d0a1c4e

Updated so that when combining two model *instances* in an expression…

676fde9

… a new instance is returned, rather than the class itself. If the class is for some reason needed it can still be gotten at, but in most cases it seems to be clearer if we just return an instance.

A bit of reorg--just moving the old composite model stuff to the bott…

9781537

…om so that it's out of the way while I do a little more reorg.

A couple fixes to improve debugging of non-concrete models or model c…

0febe98

…lasses.

Break off initialization of the ._parameters array and ._param_metric…

8375c69

…s into a separate method; this might prove useful later.

Improve indexing of CompoundModel classes so that submodels can also …

2e2dc05

…be looked up by their names. Now we also need to add this capability for CompoundModel instances.

Gets the .submodel_names attribute working on compound model instance…

98aa74b

…s as well (though indexing does not work yet on instances; that's going to take some more reworking

Fix minor formatting issue for Model.__repr__

9904bd1

Fixed the order of operators and added several tests for expression f…

81704b2

…ormatting

embray and others added 20 commits December 17, 2014 17:12

Use the list class directly (since calling list() returns a new empty…

60b5130

… list) rather than a lambda--turns out lambdas will create problems for pickling)

Get rid of no longer useful Rotation2D.__init__ (which in fact was pr…

0dda7ee

…eventing adding a name to Rotation2D)

Removing the ExpressionTree.format_tree_ascii method entirely for now…

6a24f59

…, since it's not very useful in its current state. I have a new implementation of it that is much better and is 90% done, but has a few corner cases to be worked out. I will add it back in when it's ready.

fix model slicing corner cases

0d66a05

Merge pull request astropy#11 from nden/slicing2

3021380

fix model slicing corner cases

Replaced an additional bit of existing code with check_for_negative_i…

f878379

…ndex, which is doing the same thing, essentially.

Fix doctest that was failing on Python 2 due to assumption of using t…

c5452e6

…he print() function.

This fixes one of the slicing-related tests that was failing on Pytho…

2112c12

…n 2. There is still a separate bug that seems to affect both Python versions, but this fixes the case that was currently in the test suite.

Fix the isinstancemethod function's doctest to work on Python 2 and 3.

f0f68ed

Fixes handling of the .parameters array for subexpressions (slices) t…

88361d5

…aken from compound models.

Don't bother linking to Parameter.__init__; just linking directly to …

a23161e

…Parameter will do since that's where its __init__ parameters are documented.

Adding a little more info to this test's docstring

204f728

Handle case of n_outputs > n_inputs

859c07e

embray mentioned this pull request Dec 22, 2014

Modeling/handle more out than in astropy/astropy#3250

Merged

embray force-pushed the modeling/compound branch 2 times, most recently from 0ae8a95 to 8bdb9bc Compare December 30, 2014 16:30

embray pushed a commit that referenced this pull request Jan 23, 2015

Use six.iteritems for compatibility with python 3

51c0065

Fix #12

embray closed this Mar 12, 2015

nden deleted the large_n_outputs branch April 30, 2018 12:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle case of n_outputs > n_inputs#12

Handle case of n_outputs > n_inputs#12
nden wants to merge 82 commits into
embray:modeling/compoundfrom
nden:large_n_outputs

nden commented Dec 22, 2014

Uh oh!

embray commented Dec 22, 2014

Uh oh!

embray commented Dec 22, 2014

Uh oh!

embray commented Dec 22, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nden commented Dec 22, 2014

Uh oh!

embray commented Dec 22, 2014

Uh oh!

embray commented Dec 22, 2014

Uh oh!

embray commented Dec 22, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants