CPU time optimization for GraphInputMatcher by JackCaoG · Pull Request #7895 · pytorch/xla

JackCaoG · 2024-08-20T23:16:17Z

couple optimiztion I did in this pr for the Graph_Input_matched

cache the seed_info_id instead of getting it every time
cache the arg_idxs, it should be the same for all calls so no need to calculte for every execution
pre-allocate the real_input instead of calling append over and over
move the if tensor_id == self.seed_info_id check into the if arg_idx is None since it is a uncommon case

with all of these I was able to reduce the runtime of this function from 0.3ms to 0.2ms for the llama3 8B case for vllm

CPU time optimization for GraphInputMatcher

4cd841e

JackCaoG added the dynamo label Aug 20, 2024

JackCaoG marked this pull request as ready for review August 21, 2024 01:12

JackCaoG requested review from alanwaketan and lsy323 August 21, 2024 02:16

qihqi approved these changes Aug 21, 2024

View reviewed changes

JackCaoG merged commit ae28308 into master Aug 21, 2024

JackCaoG deleted the JackCaoG/dynamo_input_matcher_optimization branch August 21, 2024 16:53

yitongh pushed a commit to AlibabaPAI/xla that referenced this pull request Oct 11, 2024

CPU time optimization for GraphInputMatcher (pytorch#7895)

e5f1c7b

yitongh pushed a commit to AlibabaPAI/xla that referenced this pull request Dec 11, 2024

CPU time optimization for GraphInputMatcher (pytorch#7895)

624a3d5

yitongh pushed a commit to AlibabaPAI/xla that referenced this pull request Dec 11, 2024

CPU time optimization for GraphInputMatcher (pytorch#7895)

10b55d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU time optimization for GraphInputMatcher#7895

CPU time optimization for GraphInputMatcher#7895
JackCaoG merged 1 commit intomasterfrom
JackCaoG/dynamo_input_matcher_optimization

JackCaoG commented Aug 20, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JackCaoG commented Aug 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JackCaoG commented Aug 20, 2024 •

edited

Loading