Claude Code Token Optimization: Stop the…

Jenny Ouyang

Apr 14

The setup changes, session habits, and Desktop tips that keep costs from compounding

Read →

26 Comments

Dana Forfa

Apr 15

I wrote about this same topic today!

https://procureandprosper.substack.com/p/how-to-budget-for-claude-seats-tokens?r=1vdkei&utm_campaign=post&utm_medium=web

Reply (1)

Jenny Ouyang

Apr 17

Nice read, Dana, thanks for sharing :)

Alejandro Aboy

Apr 15

Amazing Jenny! Some extras you could find interesting: RTK for token compression, claude-hud plugin to track usage and other artifacts in the terminal. They are really cool tools!

Reply (1)

Jenny Ouyang

Apr 15

Great to learn about those Alejandro! Thanks for sharing it!

Bianca Schulz

Apr 14

Have ever thought about using a local model on a strong hardware, or open source frameworks for AI agents with a cheaper model subscription? I guess there must be reason why you stay with Claude. What have been your thoughts?

Reply (1)

Jenny Ouyang

Apr 15

I did try different models, especially locally hosted ones. My argument to myself was that if I want to stay on top of the latest changes and really take advantage of Claude’s instruction-following capabilities, this is the price I’m willing to pay.

Of course, now I’m really paying the price 😂

Patrick Schaber

Apr 14

Wow, Jenny! There were a few in here that I did not know about. Thank you so much for putting this together. I think you saved me a ton of money!

Reply (1)

Jenny Ouyang

Apr 14

Glad it helps Patrick!

Abby Keul

Apr 18

This is really helpful, Jenny. I have been burning quickly through my Claude Pro usage. I fed your advice into Claude and it helped me customize my token reduction strategy even further.

Reply (1)

Jenny Ouyang

Apr 21

I’m so happy to hear that, Abby! Glad it is of practical use for you :)

My other self

Apr 16

Thank you, thank you, thank you!

Reply (1)

Jenny Ouyang

Apr 17

You are very welcome!

Hope this saves you some thing :)

Karen Spinner

Apr 15

All great tips! Thank you for sharing the surprise bill from hell and explaining how to avoid it!

Reply (1)

Jenny Ouyang

Apr 15

Thank you for reading it! I hope no one gets the same kind of bill :)

Reply (1)

Karen Spinner

Apr 15

💯

Julia | Taking you global

Apr 14

Great article, Jenny!

Thank you Julia 🤗

Nothing teaches prompt discipline quite like a $1600 bill hitting my inbox. 🫣Loved how practical this was, especially the token-saving tips.🩷🦩

Reply (1)

Jenny Ouyang

Apr 15

I learnt this the hard way, and hope anyone reading this won’t get hit by it the same ways 😅

Thank you Pinkie!

David Richard

Apr 14

I just started using (discovered?) the /advisor setting for my coding work. Have you explored it?

Reply (1)

Jenny Ouyang

Apr 14

I have not used it before, looks like it's a very useful setting. Been eagerly exploring it now :)

Reply (1)

David Richard

Apr 24

This just happened on my end:

⏺ Now I have everything I need. Let me get the advisor's perspective before writing a complex new file.

⏺ Advising using Opus 4.7

⎿ ✔ Advisor has reviewed the conversation and will apply the feedback

So I guess Claude is getting a second opinion to check what Sonnet came up with. Kinda cool.

Peter Simmons

Apr 16

https://substack.com/@plutarchtx/note/p-193812735?r=3gqi5k&utm_medium=ios&utm_source=notes-share-action

Reply (1)

Peter Simmons

Apr 16

That is a context-memory manager designed for you to save everything important without filling up actual context and only retrieving what you need.

Reply (1)

Jenny Ouyang

Apr 17

Great work Peter!

Luc B. Perussault-Diallo

Apr 30

The tool output accumulation insight is underrated. It's not just that the output is big. It's that a 2,000-line raw test log dumped into context doesn't actually help the model reason about the failure any better than a 20-line structured summary would. Volume and usefulness are different things, and most optimization advice only addresses the first one.

Subagent delegation helps. But the other option is giving the model pre-analyzed context (what calls what, what's in scope for this change) so it doesn't need to discover relationships through expensive tool calls in the first place. That's the angle I'm exploring with Sense (https://luuuc.github.io/sense).

Build to Launch

Claude Code Token Optimization: Stop the…