m2v-e5-small-european
A Model2Vec static embedding model distilled from intfloat/multilingual-e5-small (118M params), pruned to European languages only.
Pruned 36.5% of tokens (removed CJK, Arabic, Hebrew, Thai, Devanagari, Korean, Japanese, etc.).
| Before pruning | After pruning | |
|---|---|---|
| Vocabulary | 249,999 tokens | 158,843 tokens |
| Embedding dim | 256 | 256 |
Usage
from model2vec import StaticModel
model = StaticModel.from_pretrained("flipbitsnotburgers/m2v-e5-small-european")
embeddings = model.encode(["deodorant", "Duschgel", "shower gel"])
License
MIT (same as base model)
- Downloads last month
- 25
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for flipbitsnotburgers/m2v-e5-small-european
Base model
intfloat/multilingual-e5-small