WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models
Paper • 2112.06598 • Published • 1
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Quantization made by Richard Erkhov.
gpt2-large-wechsel-ukrainian - bnb 4bits
gpt2-large transferred to Ukrainian using the method from the NAACL2022 paper WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.