Instructions to use HannaAbiAkl/flan-t5-small-wordnet with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use HannaAbiAkl/flan-t5-small-wordnet with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="HannaAbiAkl/flan-t5-small-wordnet")

# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("HannaAbiAkl/flan-t5-small-wordnet")
model = AutoModelForSeq2SeqLM.from_pretrained("HannaAbiAkl/flan-t5-small-wordnet")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use HannaAbiAkl/flan-t5-small-wordnet with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "HannaAbiAkl/flan-t5-small-wordnet"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "HannaAbiAkl/flan-t5-small-wordnet",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/HannaAbiAkl/flan-t5-small-wordnet

SGLang

How to use HannaAbiAkl/flan-t5-small-wordnet with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "HannaAbiAkl/flan-t5-small-wordnet" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "HannaAbiAkl/flan-t5-small-wordnet",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "HannaAbiAkl/flan-t5-small-wordnet" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "HannaAbiAkl/flan-t5-small-wordnet",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use HannaAbiAkl/flan-t5-small-wordnet with Docker Model Runner:
```
docker model run hf.co/HannaAbiAkl/flan-t5-small-wordnet
```

FLAN-T5 small-WordNet

This model is a fine-tuned version of flan-t5-small on the WordNet dataset.

Model description

The model is trained to classify terms into one of four term types: noun, verb, adjective or adverb. The types themselves are learned and then generated by the model with no more than one type associated with a specific term.

The model also works well as part of a Retrieval-and-Generation (RAG) pipeline by leveraging an external knowledge source, specifically Wordnet Semantic Primes.

Intended uses and limitations

This model is intended to be used to generate a type (class) for an input term.

Training and evaluation data

The training and evaluation data can be found here.

The train size is 40559.

The test size is 9470.

Example

Here's an example of the model capabilities:

input:
- Lexical Term L: question
- Sentence Containing L (Optional): there was a question about my training
output:
- Type: noun
input:
- Lexical Term L: lodge
- Sentence Containing L (Optional): Where are you lodging in Paris?
output:
- Type: verb
input:
- Lexical Term L: genus equisetum
- Sentence Containing L (Optional):
output:
- Type: noun

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 4
eval_batch_size: 4
seed: 42
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss
0.1725	1.0	1000	0.0640
0.1250	2.0	2000	0.0535
0.1040	3.0	3000	0.0469
0.0917	4.0	4000	0.0421
0.0830	5.0	5000	0.0384

@misc{akl2024dstillms4ol2024task,
      title={DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for type classification}, 
      author={Hanna Abi Akl},
      year={2024},
      eprint={2408.14236},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2408.14236}, 
}

Downloads last month: 12

Safetensors

Model size

77M params

Tensor type

F32

Paper for HannaAbiAkl/flan-t5-small-wordnet

DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for type classification

Paper • 2408.14236 • Published Aug 26, 2024 • 5