AlekseyCalvin 's Collections Papers Pertinent or Protuberant
updated
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in
Text-to-Image Models
Paper
• 2507.23313
• Published
• 1
SonicMaster: Towards Controllable All-in-One Music Restoration and
Mastering
Paper
• 2508.03448
• Published
• 6
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with
Learnable Advisor
Paper
• 2508.01311
• Published
• 2
Normalized Attention Guidance: Universal Negative Guidance for Diffusion
Model
Paper
• 2505.21179
• Published
• 13
Learning to Detect Multi-class Anomalies with Just One Normal Image
Prompt
Paper
• 2505.09264
• Published
• 5
How to Reduce Change Detection to Semantic Segmentation
Paper
• 2206.07557
• Published
Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly
Detection
Paper
• 2504.14221
• Published
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Paper
• 2505.09926
• Published
• 6
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning
Paper
• 2505.09265
• Published
• 5
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with
Verifiable Rewards
Paper
• 2508.04632
• Published
• 2
Reasoning Language Models for Root Cause Analysis in 5G Wireless
Networks
Paper
• 2507.21974
• Published
• 5
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding
Paper
• 2508.01197
• Published
• 5
Sculptor: Empowering LLMs with Cognitive Agency via Active Context
Management
Paper
• 2508.04664
• Published
• 13
IAUNet: Instance-Aware U-Net
Paper
• 2508.01928
• Published
• 9
Position: The Current AI Conference Model is Unsustainable! Diagnosing
the Crisis of Centralized AI Conference
Paper
• 2508.04586
• Published
• 12
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding
Paper
• 2508.02215
• Published
• 12
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D
Synthesis
Paper
• 2507.23785
• Published
• 18
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought
Paper
• 2508.03560
• Published
• 24
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web
Agents
Paper
• 2508.01858
• Published
• 20
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper
• 2508.03680
• Published
• 137
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper
• 2508.01191
• Published
• 238
Attention Basin: Why Contextual Position Matters in Large Language
Models
Paper
• 2508.05128
• Published
• 4
Unlocking the Potential of MLLMs in Referring Expression Segmentation
via a Light-weight Mask Decode
Paper
• 2508.04107
• Published
• 4
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during
Multi-Hop Analysis
Paper
• 2508.04699
• Published
• 2
RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation
Paper
• 2508.04190
• Published
• 1
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating
Linguistic Shibboleth Detection in LLM Hiring Evaluations
Paper
• 2508.04939
• Published
• 2
REINA: Regularized Entropy Information-Based Loss for Efficient
Simultaneous Speech Translation
Paper
• 2508.04946
• Published
• 1
I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal
Entity Linking
Paper
• 2508.02243
• Published
• 2
Learning to Reason for Factuality
Paper
• 2508.05618
• Published
• 6
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast
Image Compression
Paper
• 2508.04979
• Published
• 5
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
Paper
• 2508.01650
• Published
• 6
MOSEv2: A More Challenging Dataset for Video Object Segmentation in
Complex Scenes
Paper
• 2508.05630
• Published
• 9
Can Large Multimodal Models Actively Recognize Faulty Inputs? A
Systematic Evaluation Framework of Their Input Scrutiny Ability
Paper
• 2508.04017
• Published
• 11
Are We on the Right Way for Assessing Document Retrieval-Augmented
Generation?
Paper
• 2508.03644
• Published
• 25
A Practical Guide to Fine-tuning Language Models with Limited Data
Paper
• 2411.09539
• Published
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for
Low-Resource Language Tasks
Paper
• 2412.12499
• Published
• 1
Development of Pre-Trained Transformer-based Models for the Nepali
Language
Paper
• 2411.15734
• Published
Extending LLMs to New Languages: A Case Study of Llama and Persian
Adaptation
Paper
• 2412.13375
• Published
Facilitating large language model Russian adaptation with Learned
Embedding Propagation
Paper
• 2412.21140
• Published
• 18
BayLing 2: A Multilingual Large Language Model with Efficient Language
Alignment
Paper
• 2411.16300
• Published