Add async store and pipelining code#4
Conversation
|
A general comment on the design choices in coding: This has at least 2 benefits:
Note that this is less about the runtime performance but more related to the readability and extendability of the code. |
|
Also, we need to have the updated Dockerfile and the deploy instructions (now it does not include installation of |
* [Add] optimized NIXL backend (WIP) * [Add] better async primitives * update cache engine log message Signed-off-by: ApostaC <yihua98@uchicago.edu> * [stash] debug works Signed-off-by: ApostaC <yihua98@uchicago.edu> * [Fix] performance and correctness bugs Signed-off-by: ApostaC <yihua98@uchicago.edu> * fix format checker issues Signed-off-by: ApostaC <yihua98@uchicago.edu> * [fix] isort errors Signed-off-by: ApostaC <yihua98@uchicago.edu> * fix format * fix format again * disable debug logs * update Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> * updated Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> * merge conflict Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com> * updated Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> * updated Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> * updated Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> --------- Signed-off-by: ApostaC <yihua98@uchicago.edu> Signed-off-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by: ApostaC <yihua98@uchicago.edu> Co-authored-by: rshaw@neuralmagic.com <robertgshaw2@gmail.com>
add full_lookup in api_server
Add async store and pipelining code
Add async store and pipelining code
…request the current log belongs to (LMCache#4) LMCache#2812 Signed-off-by: baoloongmao <baoloongmao@tencent.com>
No description provided.