Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper • 2511.17220 • Published 16 days ago • 16
view article Article Agent2Agent and MCP: An End-to-End Tutorial for a complete Agentic Pipeline Apr 29 • 12
emrecan/bert-base-turkish-cased-mean-nli-stsb-tr Sentence Similarity • Updated Jan 24, 2022 • 21.1k • • 46
view article Article Guided Decoding and Its Critical Role in Retrieval-Augmented Generation: A Deep Dive into Structured LLM Outputs Sep 8 • 16
view article Article Theoretical Limitations of Embedding Models and Their Applications in Turkish: An In-Depth Look Sep 4 • 15
view article Article Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications Aug 29 • 27
view post Post 3325 QwQ-32B is amazing!It ranks below o1-preview, but beats DeepSeek v3 and all Gemini models. onekq-ai/WebApp1K-models-leaderboardNow we have such a powerful model that can fit into a single GPU, can someone finetune a web app model to push SOTA of my leaderboard? 🤗 See translation 1 reply · 👍 11 11 + Reply
Running Featured 547 Open Source Ai Year In Review 2024 😻 547 What happened in open-source AI this year, and what’s next?