YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

How to use To use Eagle3 with SGLang, first replace the qwen3.py file in SGLang鈥檚 directory (sglang/python/sglang/srt/models/) with the qwen3.py file from this project.

The launch command for using Eagle3 with SGLang is:

python3 -m sglang.launch_server --model Qwen/Qwen3-4B-Instruct-2507 --speculative-algorithm EAGLE3  --speculative-draft-model-path  Tengyunw/qwen3_4b_eagle3 --speculative-num-steps 6        --speculative-eagle-topk 10 --speculative-num-draft-tokens 32 --mem-fraction 0.9         --cuda-graph-max-bs 2 --dtype bfloat16
Downloads last month
48
Safetensors
Model size
0.2B params
Tensor type
I64
BF16
BOOL
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support