fix: update base model link
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ datasets:
|
|
| 15 |
|
| 16 |
## Model Description
|
| 17 |
|
| 18 |
-
StableVicuna-13B is a [Vicuna-13B v0](https://
|
| 19 |
|
| 20 |
### Apply Delta Weights
|
| 21 |
|
|
|
|
| 15 |
|
| 16 |
## Model Description
|
| 17 |
|
| 18 |
+
StableVicuna-13B is a [Vicuna-13B v0](https://huggingface.co/lmsys/vicuna-13b-delta-v0) model fine-tuned using reinforcement learning from human feedback (RLHF) via Proximal Policy Optimization (PPO) on various conversational and instructional datasets.
|
| 19 |
|
| 20 |
### Apply Delta Weights
|
| 21 |
|