Add argmax_param "axis" to maximise output along the specified axis #3069

timmeinhardt · 2015-09-15T13:44:20Z

This PR adds an option to the ArgMaxLayer which lets the user specify an axis along which the layer should maximise the output. This now works similar to e.g. numpy.argmax and might be useful for deploying segmentation task networks.
The default option is axis = 0, which computes the argmax of the flattened bottom blob per image. So existing deployments of the layer work as expected. Negative indexing is also possible, but eventually the axis value must be between 0 and 3.
If axis != 0 and out_max_val is set to true the layer outputs max_val instead of max_ind. Specifying an axis and outputting max_val and max_ind at the same time was not possible due to the general architecture of a blob.

seanbell · 2015-09-15T14:40:26Z

Wouldn't axis = 1 make more sense as a default? By convention in caffe, axis = 0 is the dimension that indexes across images in a batch. Typically, axis = 1 is the "channel" that separates classes.

timmeinhardt · 2015-09-15T15:12:15Z

In order to argmax a classification task output as (10,1000,1,1) with 10 images and 1000 classes you could either set axis = 0 or axis = 1. The result would be the same, but in the first case the bottom blob is treated as a flattened array and in the second the width and height of the blob are considered.
I went for axis = 0 as default option because that's how the layer worked before and processing the flattened input usually is the default option for argmax implementations.

seanbell · 2015-09-16T03:14:30Z

OK I see what you've done. This is a great improvement to the layer, but I think the convention you've chosen for the axis parameter is confusing and can be improved.

If axis > 0, then your proposed layer is computing what you expect (argmax along that axis). If axis == 0, then the layer reverts to the old behavior (which is not actually computing argmax along axis 0). I think it's unintuitive and surprising to do it this way. For example, I would expect that for a (N, C) shape input, computing argmax along axis 0 should give an output shape of (top_k, C). I would be surprised to discover that it has actually given me an output of shape (N, top_k) and this would seem like a bug to me.

The current ArgmaxLayer didn't get updated during the 4D --> ND conversion in #1970, and has extra unnecessary dimensions of size 1 in the output. Also, we have easy efficient reshaping as of #2217. So the automatic reshaping that you get with this layer isn't really worth trying to preserve.

I think a better way to preserve backwards compatibility would be: if axis is not specified (check with has_axis), perform the old behavior. Otherwise, if it is set, compute the argmax along that actual axis, rather than making axis: 0 do something special that's not actually argmax along axis 0.

Or, you could have a flag (e.g. flatten with default true) that enables automatic flattening of the blob, and make the default axis: 1. This would also preserve the legacy behavior of this layer.

timmeinhardt · 2015-09-16T14:31:48Z

I got your point. Basically I was already using axis = 0 as if no axis was specified so I changed that to use has_axis instead. Now one can even argmax across multiple images using axis = 0. Perhaps this might be useful for someone but it is definitely more intuitiv. Thank you! Sometimes you do not see the wood for the trees when you are in the middle of something...
So now if no axis is specified the maximisation is computed along the flattened bottom blob per image and if axis is set it maximises along the specified axis.

shelhamer · 2015-09-24T03:13:23Z

src/caffe/layers/argmax_layer.cpp

This should check bottom[0]->num_axes() for generality now that blobs are N-D.

shelhamer · 2015-09-24T03:19:34Z

Thanks for contributing this @timmeinhardt and thanks for guiding the development @seanbell! This looks good in general but I'd like to see it done in N-D instead of the fixed 4 dimensions since it's nearly there. You could look at ConcatLayer or other layers with an axis arg.

timmeinhardt · 2015-09-25T16:21:27Z

I adjusted my commits and now it should work for N-D blobs as well. I updated the documentation as well. I did not know that you made this update to the blob architecture 👍

Add argmax_param "axis" to maximise output along the specified axis

shelhamer · 2015-10-01T00:43:11Z

Thanks @timmeinhardt !

AndKo1201 · 2015-10-28T12:53:19Z

@timmeinhardt could you please check the ArgMaxLayer::Reshape.

I'm getting a assertion when i work with debug lvl == 2 while executing the caffe unit tests.
It crashes in NetTest, TestSkipPropagateDown.

In this test the num_axes of bottom[0] is 2 and the value of has_axis_ is false.
But within the else you are trying to change the value of shape[2], which is an OutOfBounds access.

timmeinhardt force-pushed the argmax branch from c1da114 to b40a237 Compare September 15, 2015 14:59

timmeinhardt changed the title ~~Add argmax_param axis to maximise output along the specified axis~~ Add argmax_param "axis" to maximise output along the specified axis Sep 15, 2015

timmeinhardt force-pushed the argmax branch from b40a237 to 9d59979 Compare September 16, 2015 13:55

timmeinhardt force-pushed the argmax branch from 9d59979 to f95bf35 Compare September 22, 2015 14:10

shelhamer reviewed Sep 24, 2015
View reviewed changes

src/caffe/layers/argmax_layer.cpp Outdated

Copy link

Member

shelhamer Sep 24, 2015

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should check bottom[0]->num_axes() for generality now that blobs are N-D.

timmeinhardt force-pushed the argmax branch from f95bf35 to f9cad0a Compare September 24, 2015 08:35

timmeinhardt added 5 commits September 25, 2015 12:05

Add argmax_param axis

6c02c8b

Implement ArgMaxLayer forward_cpu and reshape for axis param

c77d5e5

Update ArgMaxLayer documentation for axis param

9b2d267

Generalise ArgMaxLayerTest bottom blob shape

a2a5e22

Implement ArgMaxLayerTest for axis param

def3d3c

timmeinhardt force-pushed the argmax branch from f9cad0a to def3d3c Compare September 25, 2015 10:06

shelhamer added a commit that referenced this pull request Oct 1, 2015

Merge pull request #3069 from timmeinhardt/argmax

01e15d0

Add argmax_param "axis" to maximise output along the specified axis

shelhamer merged commit 01e15d0 into BVLC:master Oct 1, 2015

timmeinhardt deleted the argmax branch October 1, 2015 16:47

AndKo1201 mentioned this pull request Nov 3, 2015

ArgMaxLayer Reshape missing checks for legacy path #3274

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add argmax_param "axis" to maximise output along the specified axis #3069

Add argmax_param "axis" to maximise output along the specified axis #3069

Uh oh!

timmeinhardt commented Sep 15, 2015

Uh oh!

seanbell commented Sep 15, 2015

Uh oh!

timmeinhardt commented Sep 15, 2015

Uh oh!

seanbell commented Sep 16, 2015

Uh oh!

timmeinhardt commented Sep 16, 2015

Uh oh!

shelhamer Sep 24, 2015

Uh oh!

shelhamer commented Sep 24, 2015

Uh oh!

timmeinhardt commented Sep 25, 2015

Uh oh!

shelhamer commented Oct 1, 2015

Uh oh!

AndKo1201 commented Oct 28, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add argmax_param "axis" to maximise output along the specified axis #3069

Add argmax_param "axis" to maximise output along the specified axis #3069

Uh oh!

Conversation

timmeinhardt commented Sep 15, 2015

Uh oh!

seanbell commented Sep 15, 2015

Uh oh!

timmeinhardt commented Sep 15, 2015

Uh oh!

seanbell commented Sep 16, 2015

Uh oh!

timmeinhardt commented Sep 16, 2015

Uh oh!

shelhamer Sep 24, 2015

Choose a reason for hiding this comment

Uh oh!

shelhamer commented Sep 24, 2015

Uh oh!

timmeinhardt commented Sep 25, 2015

Uh oh!

shelhamer commented Oct 1, 2015

Uh oh!

AndKo1201 commented Oct 28, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants