Skip to content

Conversation

@ducha-aiki
Copy link
Contributor

This layer could be used as building brick for custom learnable activation functions. PR is ready for review.

@siddharthm83
Copy link

@ducha-aiki , as per #3229 (comment)
Attached

  1. Sample output when this layer is used in CPU and GPU
  2. Prototxt file used (caffenet +bn +ea)

eltwiseaffine_gpu_bug.txt
train_val_prototxt.txt

@ducha-aiki
Copy link
Contributor Author

@siddharthm83 Thanks, I will take a look on it.

@siddharthm83
Copy link

@ducha-aiki , the log link seems broken. When you say restarting training, it is gone, what do you exactly mean (are you restarting with a saved solverstate?)

@ducha-aiki
Copy link
Contributor Author

@siddharthm83 sorry, filenames changed after repo update.

But it does`t matter - I have done 100 restarts in script and it seems code is broken - it recovers only 2 times :( So time for debugging for me
By restart I mean start from same initialization - I initialize net in python and save weights.

@siddharthm83
Copy link

@ducha-aiki , any luck in identifying the issue? No hurry, just checking.

@ducha-aiki
Copy link
Contributor Author

@siddharthm83 sorry, ICLR rebuttal deadline. Going to fix next week.

@itijyou
Copy link

itijyou commented Jan 11, 2016

@ducha-aiki , I think here might be the problem, not CblasTrans but CblasNoTrans.

// Gradient with respect to bias
for (int n = 0; n < num; ++n) {
  caffe_gpu_gemv<Dtype>(CblasNoTrans, channels, height * width, (Dtype)1.,
      top_diff + top[0]->offset(n), bias_multiplier_.gpu_data(), (Dtype)1.,
      bias_diff);
}

@ducha-aiki
Copy link
Contributor Author

@wuzifeng thanks! Looks like it was the cause. I will finish training caffenet with it and update PR.

@siddharthm83
Copy link

awesome, thanks @ducha-aiki and @wuzifeng

@ducha-aiki
Copy link
Contributor Author

Travis crashes before it tries to compile anything, on stage of downloading packages.

@jeffdonahue
Copy link
Contributor

Replaced and merged in #3591. Thanks again for this work and your benchmarks @ducha-aiki!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants