Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

causal reward modeling

Team
university
https://docs.google.com/document/u/0/?tgif=d
Activity Feed

AI & ML interests

None defined yet.

Harman Singh's profile picture Pragya Srivastava's profile picture
Organization Card
Community About org cards

Edit this README.md markdown file to author your organization card.

models 2

causal-rewards/llama-3.1-8b-sft_ultrachat_200k

Text Generation • 8B • Updated Sep 16 • 6

causal-rewards/gemma2-9b_rm

9B • Updated Apr 21 • 8

datasets 4

causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new

Viewer • Updated Sep 21 • 847 • 20

causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2

Viewer • Updated Jul 3 • 920k • 20

causal-rewards/ultrafeedback_60658_preference_dataset_original_neutrals_filtered_improve-degrade_filtered0p2

Viewer • Updated Apr 22 • 218k • 9

causal-rewards/ultrafeedback-binarized-preferences-cleaned-neutral

Viewer • Updated Apr 16 • 60.9k • 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs