Andyrasika/TweetSumm-tuned
Viewer • Updated • 1.1k • 31 • 1
How to use Dikshant182004/t5-base-lora-finetune-tweetsumm-1759926273 with PEFT:
from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM
base_model = AutoModelForSeq2SeqLM.from_pretrained("google-t5/t5-base")
model = PeftModel.from_pretrained(base_model, "Dikshant182004/t5-base-lora-finetune-tweetsumm-1759926273")How to use Dikshant182004/t5-base-lora-finetune-tweetsumm-1759926273 with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("Dikshant182004/t5-base-lora-finetune-tweetsumm-1759926273", dtype="auto")This model is a fine-tuned version of google-t5/t5-base on the Andyrasika/TweetSumm-tuned dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len | F1 | Precision | Recall |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2.0742 | 1.0 | 110 | 1.8740 | 0.4377 | 0.2001 | 0.361 | 0.4021 | 49.3273 | 0.8863 | 0.8819 | 0.8909 |
| 1.8448 | 2.0 | 220 | 1.8243 | 0.4569 | 0.2143 | 0.3752 | 0.4211 | 47.3818 | 0.8916 | 0.8886 | 0.8949 |
| 1.5995 | 3.0 | 330 | 1.8100 | 0.4551 | 0.2106 | 0.3742 | 0.4167 | 47.5636 | 0.8917 | 0.8879 | 0.8958 |
Base model
google-t5/t5-base