Conversation
d7a0b07 to
e3f7f04
Compare
a3443b2 to
4556c35
Compare
30e0e73 to
6f40d90
Compare
2a606f2 to
608d82c
Compare
102102c to
ea8ffe9
Compare
* Add loading of decoder layers in Model * Multiply by hypernet_layer_cnt for ctx_size on model load * Add decoder layers to py conversion script * Fix wrong and reversed tensor sizes for decoder * Add decoder transformer implementation * Add decoder hypernet and iou prediction mlps * Add transpose convolution operation and unit test * Finish mask decoder and write the decoder output in the model state * Output masks to png after removing padding and upsampling to original size - Also filter based on the iou treshold - Additionally filtering based on the stability score and crop boxes should be done * Add stb image write in order to output masks from SAM * Add transpose convolution 2d name and symbol to ggml ops static arrays * Comment out debug print in transpose convolution test to fix compilation ggml-ci
|
Add README.md with instructions for obtaining and converting the model and we can merge it. We can continue optimizations from
|
Additionally I think we should:
Aded all those next steps to the README and I am going to start working on them |
|
@ggerganov I just stumbled upon your SAM code and this comment: |
|
@cmp-nct Ah good to know - thanks for looking into this |
Initial version: #418 (comment)
PTH tensors for ViT-B