llama-server: Where can I find detailed explanations for the timing
sub-items in the chat completions response?
#14095
Unanswered
dev-jonghoonpark
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
hello.
Where can I find detailed explanations for the
timing
sub-items in the chat completions response?I couldn't find them at https://github.com/ggml-org/llama.cpp/tree/master/tools/server, and they don't seem to appear in search results under that repository either.
If they don't exist, could they be added?
While I can roughly infer their meanings, having a clear explanation documented would be helpful.
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions