Skip to content

Conversation

@shelhamer
Copy link
Member

The sig-ce loss was somehow passed over when it came to GPU implementation, but this PR completes GPU mode by including GPU forward. This resolves a (perhaps surprisingly severe) performance penalty due to communication overhead. When the target is high-dimensional, say a whole image for an FCN, then the time to go back-and-forth could slow iterations by >10x. Not anymore!

Closes #3004.

@shelhamer shelhamer force-pushed the sigce-gpu branch 2 times, most recently from b375432 to 3b5a352 Compare October 27, 2016 08:04
@shelhamer shelhamer merged commit 6491504 into BVLC:master Oct 27, 2016
@shelhamer shelhamer deleted the sigce-gpu branch October 27, 2016 20:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

SigmoidCrossEntropyLossLayer::Forward_gpu() is not implemented

1 participant