Running Featured 32 Companionship Leaderboard 🥇 32 AI companionship leaderboard based on the INTIMA benchmark
Training language models to be warm and empathetic makes them less reliable and more sycophantic Paper • 2507.21919 • Published Jul 29 • 2