generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 232
Open
Labels
kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.Indicates an issue or PR lacks a `triage/foo` label and requires one.
Description
What happened:
In the discussion #2133 (comment), we found that "tools" in the chat completion API are not considered in the prefix matching.
What you expected to happen:
"tools" should be considered to match vllm's logic. "tools" are often considered the first part of the chat template, example. so for the prefix scorer, we should merge "tools" field as first part of the prefix string.
How to reproduce it (as minimally and precisely as possible):
Anything else we need to know?:
Environment:
- Kubernetes version (use
kubectl version): - Inference extension version (use
git describe --tags --dirty --always): - Cloud provider or hardware configuration:
- Install tools:
- Others:
Metadata
Metadata
Assignees
Labels
kind/bugCategorizes issue or PR as related to a bug.Categorizes issue or PR as related to a bug.needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.Indicates an issue or PR lacks a `triage/foo` label and requires one.