Skip to content

Similar to Whisper PR#228, this adds -ocsv, aka --output-csv, writing CSV file containing millisecond timestamps#340

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
NielsMayer:master
Dec 29, 2022
Merged

Similar to Whisper PR#228, this adds -ocsv, aka --output-csv, writing CSV file containing millisecond timestamps#340
ggerganov merged 1 commit intoggml-org:masterfrom
NielsMayer:master

Conversation

@NielsMayer
Copy link
Copy Markdown
Contributor

For efficiency in storage, parsing and accuracy from representing timestamps in milliseconds (rather than floating-point numbers), this PR, like openai/whisper#228 adds feature to output CSV file containing lines formatted like:
<startTime-in-integer-milliseconds>, <endTime-in-integer-milliseconds>, "<transcript-line-including-commas>"

The CSV file is created when option -ocsv, or --output-csv is provided to the 'main' command-line.

Similar to the output file naming conventions for --output_vtt or --output_srt, the resulting CSV file for an input file named <mediaBaseName>.wav is named <mediaBaseName>.wav.csv

…feature to examples/main, which outputs a CSV file containing lines formatted as follows <startTime-in-integer-milliseconds>, <endTime-in-integer-milliseconds>, "<transcript-line-including-commas>".
@NielsMayer
Copy link
Copy Markdown
Contributor Author

One of the big advantages of whisper.cpp is that when you're processing a 31 HOUR video file, it doesn't just die with "KILLED" and no other debug info, like whisper does on the exact same 16k .wav file (alongside medium model, 16 threads, 4.45Ghz 8 core AMD 4750g CPU):

image

I also like how whisper.cpp correctly transcribes a C++ header file as "child.h" and not, say "child dot h" . (original source video: https://www.youtube.com/watch?v=8jLOx1hD3_o ) :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants