RedHatAI
/

Qwen3-VL-235B-A22B-Instruct-speculator.eagle3

Model card Files Files and versions

ekurtic commited on Apr 8

Commit

be00b3b

·

verified ·

1 Parent(s): 92a6f45

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -120,5 +120,17 @@ guidellm benchmark \
   --rate-type sweep \
   --max-seconds 600 \
   --output-path "Qwen235B-HumanEval.json" \
 </details>

   --rate-type sweep \
   --max-seconds 600 \
   --output-path "Qwen235B-HumanEval.json" \
+```
+GuideLLM interface changed, so for compatibility with the latest version (v0.6.0), please use the following command:
+```bash
+GUIDELLM__PREFERRED_ROUTE="chat_completions" \
+guidellm benchmark \
+  --target "http://localhost:8000/v1" \
+  --data "RedHatAI/speculator_benchmarks" \
+  --data-args '{"data_files": "HumanEval.jsonl"}' \
+  --profile sweep \
+  --max-seconds 1800 \
+  --output-path "my_output.json" \
+  --backend-args '{"extras": {"body": {"temperature":0.6, "top_p":0.95, "top_k":20}}}'
+```
 </details>