config = {
    "d_model": 1180,         
    "n_rwkv_layers": 16,  
    "n_attn_layers": 4,    
    "n_heads": 10,
    "seq_len": 1024,        
    "batch_size": 4,
    "accum_steps": 8,      
    "lr": 4e-4,
}
Downloads last month
26
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for i3-lab/i3-500m

Finetunes
1 model

Dataset used to train i3-lab/i3-500m

Space using i3-lab/i3-500m 1

Collection including i3-lab/i3-500m