Background Description
The llama-server takes around 20-30s to compile, making the development experience not very pleasant.
By abstracting the HTTP layer, it can be compiled independently from the main server logic.
As a side effect, this may allow downstream project to replace the HTTP layer with their own implementation.
Possible Refactor Approaches
No response