matteosz
/

phi3-mini-4k-sft-dpo-quant

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

matteosz commited on Jun 6, 2024

Commit

55ac834

·

verified ·

1 Parent(s): 06cedba

Update README.md

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -9,8 +9,18 @@ language:
 ---
 # Model Card for Model ID
-A 4-bits quantized version of [ernestoBocini/Phi3-mini-DPO-Tuned](https://huggingface.co/ernestoBocini/Phi3-mini-DPO-Tuned).
 ## Model Details
-This is a Phi-3-mini-4k-instruct fine-tuned with SFT and DPO on STEM domains, and finally quantized to a 4 bits precision, to serve the purpose of being an AI university tutor.

 ---
 # Model Card for Model ID
+A 4-bits double-quantized version of [ernestoBocini/Phi3-mini-DPO-Tuned](https://huggingface.co/ernestoBocini/Phi3-mini-DPO-Tuned).
 ## Model Details
+This is a Phi-3-mini-4k-instruct fine-tuned with SFT and DPO on STEM domains, and finally quantized to a 4 bits precision, to serve the purpose of being an AI university tutor.
+Quantization config used:
+```python
+BitsAndBytesConfig(
+   load_in_4bit=True,
+   bnb_4bit_quant_type="nf4",
+   bnb_4bit_use_double_quant=True,
+   bnb_4bit_compute_dtype=torch.bfloat16
+)
+```