Inference Providers
Active filters: RLHF
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
• Updated • 9.97k
• • 247
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B • Updated • 5.57k
• 253
mradermacher/OpenBioLLM-Llama3-8B-GGUF
8B • Updated • 195
• 2
NousResearch/Hermes-2-Pro-Llama-3-8B
Text Generation
• 8B • Updated • 20.6k
• • 453
NousResearch/Hermes-2-Theta-Llama-3-8B
Text Generation
• 8B • Updated • 11.1k
• • 207
ruslanmv/Medical-Llama3-v2
Text Generation
• 8B • Updated • 35
• • 14
OpenAssistant/reward-model-deberta-v3-base
Text Classification
• Updated • 11.3k
• • 13
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
• Updated • 40
• 5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
• Updated • 1.74k
• • 26
Text Ranking
• 0.4B • Updated • 11
• 4
nicholasKluge/RewardModelPT
Text Classification
• 0.1B • Updated • 164
nicholasKluge/RewardModel
Text Classification
• 0.1B • Updated • 190
• 2
fb700/chatglm-fitness-RLHF
Updated • 268
fb700/Bofan-chatglm-Best-lora
Updated • 15
• 11
kubernetes-bad/Ligma-L2-13b
Updated • 6
• 3
Text Generation
• 0.4B • Updated • 224
• 209
berkeley-nest/Starling-LM-7B-alpha
Text Generation
• 7B • Updated • 1.87k
• • 560
berkeley-nest/Starling-RM-7B-alpha
Updated • 67
• 105
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
• Updated • 3
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
• Updated • 7
• 1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
• Updated • 7
• 2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
• Updated • 6
• 1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
• Updated • 6
• 2
TheBloke/Starling-LM-7B-alpha-GGUF
7B • Updated • 1.1k
• 94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
• 7B • Updated • 9
• 9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
• 7B • Updated • 220
• 3
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
• 7B • Updated • 17
• 10
bartowski/Starling-LM-7B-alpha-old-exl2
Text Generation
• Updated tastypear/chatglm-fitness-RLHF-GGML
CallComply/Starling-LM-11B-alpha
Text Generation
• 11B • Updated • 101
• 15