Skip to content

giganttheo/mmmmmm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

♨️ mmmmmm toolkit: presentations Made Manageable & Memorable with MultiModal Models

A set of tools designed to help you understand and process multimodal presentations using deep learning models. Whether you need to navigate through complex presentations, extract key information, or analyze content, this toolkit provides the tools and features to make presentations more manageable and memorable.


[===WIP===]

♨️ Be patient, it's cooking! ♨️


Tools

  • transcriber: automatic speech recognition with word-level timestamps
  • slide extractor: lightweight methods to extract the slides from a video record
  • interleaver: Create and visualize interleaved slides-transcript representation of multimodal presentations

[WIP]

  • Generate an abstractive summary
  • Extract important slides and figures
  • Generate a cheat sheet

About

♨️ mmmmmm toolkit: presentations Made Manageable & Memorable with MultiModal Models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages