Add FIM functionalty for VLLM provider #132

aponcedeleonch · 2024-11-29T10:51:18Z

This enables FIM completion for VLLM. Tested with Continue

"tabAutocompleteModel": {
       "title": "CodeGate - Stacklok Hosted",
       "provider": "openai",
       "model": "Qwen/Qwen2.5-Coder-14B",
       "apiKey": "$token",
       "apiBase": "http://localhost:8989/vllm"
},

This enables FIM completion for VLLM. Tested with Continue ```json "tabAutocompleteModel": { "title": "CodeGate - Stacklok Hosted", "provider": "openai", "model": "Qwen/Qwen2.5-Coder-14B", "apiKey": "stlk_033ad7c6cb39721306afdc2d9fad6185422af668fde9ffb867205740833f1b9a", "apiBase": "http://localhost:8989/vllm" }, ```

aponcedeleonch · 2024-11-29T10:53:01Z

src/codegate/providers/vllm/provider.py

+        completion_handler = LiteLLmShim(
+            stream_generator=sse_stream_generator, fim_completion_func=atext_completion
+        )


This is to specify litellm to use atext_completion instead of acompletion when is FIM. Continue give us prompt instead of messages but atext_completion is able to handle it

Downside of using atext_completion is that it returns a TextCompletionResponse instead of a ModelResponse like our regular acompletion. I checked and the parameters of both objects are almost identical, hopefully it doesn't cause too much problems

Would that have an impact on the pipeline processing @jhrozek ?

I will do some testing. The pipeline does expect a ModelResponse maybe we could normalize TextCompletion into ModelResponse..

OK, I had another look and the biggest difference between atext_completion and acompletion in liteLLM is that acompletion receives a conversation with multiple prompts and roles and atext_completion just receives a prompt.

If this is OK for you then let's go ahead. I'm not sure if FIM will use any system prompts or such.

I will merge and attempt to do the normalization of TextCompletionResponse to ModelResponse in a separate PR. I agree is better if we try to keep the stuff as normalized as possible.

aponcedeleonch requested review from jhrozek, yrobla and lukehinds November 29, 2024 10:51

aponcedeleonch commented Nov 29, 2024

View reviewed changes

jhrozek approved these changes Nov 29, 2024

View reviewed changes

aponcedeleonch merged commit de91231 into main Nov 29, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add FIM functionalty for VLLM provider #132

Add FIM functionalty for VLLM provider #132

Uh oh!

aponcedeleonch commented Nov 29, 2024 •

edited

Loading

Uh oh!

aponcedeleonch Nov 29, 2024

Uh oh!

aponcedeleonch Nov 29, 2024

Uh oh!

lukehinds Nov 29, 2024

Uh oh!

jhrozek Nov 29, 2024

Uh oh!

jhrozek Nov 29, 2024

Uh oh!

aponcedeleonch Nov 29, 2024

Uh oh!

Uh oh!

Uh oh!

Add FIM functionalty for VLLM provider #132

Add FIM functionalty for VLLM provider #132

Uh oh!

Conversation

aponcedeleonch commented Nov 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aponcedeleonch Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

aponcedeleonch Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

lukehinds Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

jhrozek Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

jhrozek Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

aponcedeleonch Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aponcedeleonch commented Nov 29, 2024 •

edited

Loading