Here's the output after some agentic iteration, so it wont output correct tool calling instead it outputs xml style tool_call
<think>
.....
<tool_call> <function=execute_bash_command> <parameter=command> node --check export.js </parameter> </function> </tool_call> </think>
After some frustrating time looking for solution, I thought maybe its the cache ? Since I reset the messages sent to the LLM if it call certain tool_name, and thats the answer
After I add just --no-cache_prompt, it works fine - but very slowly, understandable since so cache exists
Maybe my resetting the messages data, still have some similarity but different with the cached messages or something I'm not sure
9b is q6, 4b is q8
4b is less frequent (thats why I tried using 9b)
Here's the output after some agentic iteration, so it wont output correct tool calling instead it outputs xml style tool_call
After some frustrating time looking for solution, I thought maybe its the cache ? Since I reset the messages sent to the LLM if it call certain tool_name, and thats the answer
After I add just --no-cache_prompt, it works fine - but very slowly, understandable since so cache exists
Maybe my resetting the messages data, still have some similarity but different with the cached messages or something I'm not sure
9b is q6, 4b is q8
4b is less frequent (thats why I tried using 9b)