Skip to content

Add Whisper benchmark script#17043

Merged
kunal-vaishnavi merged 19 commits intomicrosoft:mainfrom
kunal-vaishnavi:kvaishnavi/add-whisper-scripts
Aug 23, 2023
Merged

Add Whisper benchmark script#17043
kunal-vaishnavi merged 19 commits intomicrosoft:mainfrom
kunal-vaishnavi:kvaishnavi/add-whisper-scripts

Conversation

@kunal-vaishnavi
Copy link
Contributor

Description

This PR adds a benchmark script for Whisper. It is a follow-up to this PR that adds the LLaMA scripts.

Motivation and Context

This PR enables benchmarking Whisper across various configurations.

@kunal-vaishnavi
Copy link
Contributor Author

Splitting changes for LLaMA and Whisper into separate PRs as discussed offline

Comment on lines +271 to +274
# def get_logits(inputs):
# # Inference pass without decoding
# outputs = model(**inputs)
# return outputs

Check notice

Code scanning / CodeQL

Commented-out code

This comment appears to contain commented-out code.
@cloudhan
Copy link
Contributor

#16895 is needed for ROCm EP to work as workaround. Otherwise, alot of patching is need for optimum.

kunal-vaishnavi added a commit that referenced this pull request Aug 23, 2023
### Description
This PR adds the following scripts for LLaMA:
- LLaMA conversion (support for TorchScript and Dynamo exporters)
- LLaMA parity
- LLaMA benchmark
- LLaMA quantization
- LLaMA integration with [Hugging Face
Optimum](https://github.com/huggingface/optimum)



### Motivation and Context
This PR adds scripts for using LLaMA. There is a [follow-up
PR](#17043) for adding
scripts for Whisper.
@kunal-vaishnavi kunal-vaishnavi merged commit 4b3477f into microsoft:main Aug 23, 2023
tianleiwu pushed a commit that referenced this pull request Oct 31, 2023
### Description
This PR adds the following scripts for LLaMA:
- LLaMA conversion (support for TorchScript and Dynamo exporters)
- LLaMA parity
- LLaMA benchmark
- LLaMA quantization
- LLaMA integration with [Hugging Face
Optimum](https://github.com/huggingface/optimum)



### Motivation and Context
This PR adds scripts for using LLaMA. There is a [follow-up
PR](#17043) for adding
scripts for Whisper.
tianleiwu pushed a commit that referenced this pull request Oct 31, 2023
### Description
This PR adds benchmark scripts for Whisper. It is a follow-up to [this
PR](#17020) that adds the
LLaMA scripts.



### Motivation and Context
This PR enables benchmarking Whisper across various configurations.
kleiti pushed a commit to kleiti/onnxruntime that referenced this pull request Mar 22, 2024
### Description
This PR adds the following scripts for LLaMA:
- LLaMA conversion (support for TorchScript and Dynamo exporters)
- LLaMA parity
- LLaMA benchmark
- LLaMA quantization
- LLaMA integration with [Hugging Face
Optimum](https://github.com/huggingface/optimum)



### Motivation and Context
This PR adds scripts for using LLaMA. There is a [follow-up
PR](microsoft#17043) for adding
scripts for Whisper.
kleiti pushed a commit to kleiti/onnxruntime that referenced this pull request Mar 22, 2024
### Description
This PR adds benchmark scripts for Whisper. It is a follow-up to [this
PR](microsoft#17020) that adds the
LLaMA scripts.



### Motivation and Context
This PR enables benchmarking Whisper across various configurations.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants