[WIP] Word level timestamp for long-form generation#28984
[WIP] Word level timestamp for long-form generation#28984patrickvonplaten wants to merge 2 commits intomainfrom
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
It's actually much harder to do this than I thought and I sadly won't have time to finish this PR, so I'll leave it in this form. We're facing the following problematic here.
If anybody in the community is willing to give this PR a try, feel free to use any/all my code. cc @sanchit-gandhi as well |
|
I will be taking over this issue, since I found that no-one else is working on it. |
|
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
What does this PR do?
Fixes: #28977
We haven't added word level timestamp for long-form generation yet. It's definitely possible, but it'll require some more changes in
generate. Happy to take a closer look here the next days.With the PR in its current state, one can retrieve word level timestamps, but they are not correct because the
_postprocess_outputsis not correct. Test it with: