Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

mesolitica
/
malaysian-mistral-191M-MLM-512

Feature Extraction
Transformers
Safetensors
Malay
mistral
custom_code
text-embeddings-inference
Model card Files Files and versions
xet
Community

Instructions to use mesolitica/malaysian-mistral-191M-MLM-512 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use mesolitica/malaysian-mistral-191M-MLM-512 with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("feature-extraction", model="mesolitica/malaysian-mistral-191M-MLM-512", trust_remote_code=True)
    # Load model directly
    from transformers import AutoTokenizer, AutoModel
    
    tokenizer = AutoTokenizer.from_pretrained("mesolitica/malaysian-mistral-191M-MLM-512", trust_remote_code=True)
    model = AutoModel.from_pretrained("mesolitica/malaysian-mistral-191M-MLM-512", trust_remote_code=True)
  • Notebooks
  • Google Colab
  • Kaggle
  • Malaysian Mistral 191M on MLM task using 512 context length

Malaysian Mistral 191M on MLM task using 512 context length

Replicating https://github.com/McGill-NLP/llm2vec using https://huggingface.co/mesolitica/malaysian-mistral-191M-4096, done by https://github.com/aisyahrzk https://twitter.com/aisyahhhrzk

Source code at https://github.com/mesolitica/malaya/tree/master/session/llm2vec

WandB, https://wandb.ai/aisyahrazak/mistral-191M-mlm?nw=nwuseraisyahrazak

Downloads last month
4
Safetensors
Model size
0.2B params
Tensor type
F32
Β·
Inference Providers NEW
Feature Extraction
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Spaces using mesolitica/malaysian-mistral-191M-MLM-512 3

πŸ“š
eduagarcia/multilingual-tokenizer-leaderboard
πŸ•Œ
HalalFoodNLP/flask-halalnlp
πŸŒ–
HalalFoodNLP/halalnlp

Collections including mesolitica/malaysian-mistral-191M-MLM-512

Malaysian MaskLM

Collection
Trained on 17B tokens, 81GB of cleaned texts, able to understand standard Malay, local Malay, local Mandarin, Manglish, and local Tamil. β€’ 7 items β€’ Updated Jun 24, 2025

Malaysian LLM2Vec

Collection
Extending Malaysian CausalLM on non-causal masking training, https://arxiv.org/abs/2404.05961 β€’ 5 items β€’ Updated Jun 24, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs