Celestine Floquet
Celestine-floquet
AI & ML interests
cute voice models, anime waifus
Recent Activity
upvoted
a
paper
about 13 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
upvoted
a
paper
5 days ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
liked
a Space
8 days ago
ipepe/nomic-embeddings