Shenyuan Gao

Results 40 comments of Shenyuan Gao

Hi. I think the following code may help you to solve this issue. In my case, the CPU occupation can be reduced by inserting these code, and the inference speed...

None, but from their [code](https://github.com/Stability-AI/generative-models/blob/main/sgm/modules/video_attention.py#L266-L276), I think such cues are fed to temporal blocks via frame index embedding.

I think the minor difference may come from the loss of autoencoder.

Yes! I also think it will be amazing if its generation quality can be extended to much longer sequences. BTW, I guess the temporal-aware deflickering decoder may also affect the...

Yes, I have also tried the image decoder, but it doesn't help. The temporal-aware decoder can greatly eliminate the jittering but doesn't affect the content. To enable identical preservation, it...

For SD project, set `use_checkpoint` as `False` can solve this issue. It won't affect the performance, but may increase your GPU memory requirement a little bit.

> I do implement the multi-gpu version for V2X-ViT, and the results actually raise some points. For people who want to use, please leave your email here. Thanks a lot...

As far as I know, there is no official complete code currently open-sourced online. You need to implement the remaining functions according to the paper by yourself.

Hi. I am also curious about how you made it. Did you apply any other techniques to ensure the identity of the first frame, or did you simply fix the...

The public code is not perfectly organized with several missing parts, but it should be easy to reproduce their reported performance even you directly ignore and bypass those uncompleted implementations....