Yueyi Sun's picture

Yueyi Sun

bitersun

·

bitersun

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

Improved Large Language Diffusion Models

liked a model 4 days ago

MSALab/PerceptionDLM

submitted a paper 4 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

View all activity

Organizations

upvoted a paper about 19 hours ago

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 2 days ago • 30

liked a model 4 days ago

MSALab/PerceptionDLM

Image-Text-to-Text • 9B • Updated 7 days ago • 49 • 7

submitted a paper to Daily Papers 4 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 9 days ago • 61

commented a paper 6 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 9 days ago • 61 •

authored a paper 6 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 9 days ago • 61

upvoted a paper 7 days ago

PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models

Paper • 2606.19534 • Published 9 days ago • 61

upvoted a collection 7 days ago

PerceptionDLM Model Zoo

Huggingace Model Zoo For PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models • 6 items • Updated 6 days ago • 4

liked a model 7 days ago

MSALab/PerceptionDLM-Base

Image-Text-to-Text • 9B • Updated 7 days ago • 56 • 4

upvoted a collection 2 months ago

DeepSeek-V4

4 items • Updated Apr 24 • 693

liked 2 datasets 4 months ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated Mar 31 • 2.33k • 1.81k • 623

TeichAI/claude-4.5-opus-high-reasoning-250x

Viewer • Updated Nov 28, 2025 • 250 • 919 • 397

upvoted 3 papers 8 months ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7, 2025 • 50

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 56

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

updated a dataset 8 months ago

bitersun/Sa2VA-finetune-example

Viewer • Updated Oct 20, 2025 • 10 • 33 • 1

published a dataset 8 months ago

bitersun/Sa2VA-finetune-example

Viewer • Updated Oct 20, 2025 • 10 • 33 • 1

liked a model 10 months ago

ByteDance/Sa2VA-4B

Image-Text-to-Text • 4B • Updated Sep 8, 2025 • 1.76k • 98

upvoted 2 collections 10 months ago

Sa2VA Model Zoo

Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 12 items • Updated Nov 27, 2025 • 46

Multimodal LLM

370 items • Updated Feb 7 • 51

upvoted a paper 12 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10, 2025 • 51