Skip to content

Segfault using cudnn v3 #30

@maingoh

Description

@maingoh

Hello,

I am having a segfault with the version 0.13.1/0.13.2 and cudnn v3 (everything works fine when disabling cudnn or using the last BVLC version). The code i am using :

    Caffe::set_mode(Caffe::GPU);
    Caffe::SetDevice(0);

    Net<float> net("deploy.prototxt", caffe::TEST);
    net.CopyTrainedLayersFrom("test.caffemodel");

Here is the trace from gdb :

#0  0x00007ffff780ef27 in caffe::caffe_rng_rand() () from ./build/lib/libcaffe-nv.so.0
#1  0x00007ffff78fa210 in caffe::DataTransformer<float>::InitRand() () from ./build/lib/libcaffe-nv.so.0
#2  0x00007ffff78a9914 in caffe::BaseDataLayer<float>::LayerSetUp(std::vector<caffe::Blob<float>*, std::allocator<caffe::Blob<float>*> > const&, std::vector<caffe::Blob<float>*, std::allocator<caffe::Blob<float>*> > const&) () from ./build/lib/libcaffe-nv.so.0
#3  0x00007ffff7918fc1 in caffe::Net<float>::Init(caffe::NetParameter const&) () from ./build/lib/libcaffe-nv.so.0
#4  0x00007ffff791a807 in caffe::Net<float>::Net(std::string const&, caffe::Phase) () from ./build/lib/libcaffe-nv.so.0
#5  0x000000000040179e in main () at test.cpp:10

Debugging a little more brought me to Caffe::rng_stream() at this line :

Get().random_generator_.reset(new RNG());

Also I have a similar segfault when I just set the mode with Caffe::set_mode(Caffe::GPU);. Here is the stack :

#0  0x00007ffff7806f05 in boost::detail::shared_count::~shared_count() () from ./build/lib/libcaffe-nv.so.0
#1  0x00007ffff7903e85 in caffe::Caffe::~Caffe() () from ./build/lib/libcaffe-nv.so.0
#2  0x00007ffff7905ed1 in boost::thread_specific_ptr<caffe::Caffe>::delete_data::operator()(void*) () from ./build/lib/libcaffe-nv.so.0
#3  0x00007fffef8b2b21 in boost::detail::set_tss_data(void const*, boost::shared_ptr<boost::detail::tss_cleanup_function>, void*, bool) ()
   from /usr/lib/x86_64-linux-gnu/libboost_thread.so.1.54.0
#4  0x00007ffff7905bee in boost::thread_specific_ptr<caffe::Caffe>::~thread_specific_ptr() () from ./build/lib/libcaffe-nv.so.0
#5  0x00007ffff6fc25ea in __cxa_finalize (d=0x7ffff7dd7c08) at cxa_finalize.c:56
#6  0x00007ffff77b0a23 in __do_global_dtors_aux () from ./build/lib/libcaffe-nv.so.0
#7  0x00007fffffffdfe0 in ?? ()
#8  0x00007ffff7dea73a in _dl_fini () at dl-fini.c:252

Thank you for your help !
Best regards,
Hugo

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions