Papers Pertinent or Protuberant - a AlekseyCalvin Collection

AlekseyCalvin 's Collections

Papers Pertinent or Protuberant

updated 2 days ago

Upvote

The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models

Paper • 2507.23313 • Published Jul 31, 2025 • 1
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

Paper • 2508.03448 • Published Aug 5, 2025 • 6
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor

Paper • 2508.01311 • Published Aug 2, 2025 • 2
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model

Paper • 2505.21179 • Published May 27, 2025 • 13
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt

Paper • 2505.09264 • Published May 14, 2025 • 5
How to Reduce Change Detection to Semantic Segmentation

Paper • 2206.07557 • Published Jun 15, 2022
Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Paper • 2504.14221 • Published Apr 19, 2025
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

Paper • 2505.09926 • Published May 15, 2025 • 6
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning

Paper • 2505.09265 • Published May 14, 2025 • 5
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Paper • 2508.04632 • Published Aug 6, 2025 • 2
Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks

Paper • 2507.21974 • Published Jul 29, 2025 • 5
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding

Paper • 2508.01197 • Published Aug 2, 2025 • 5
Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Paper • 2508.04664 • Published Aug 6, 2025 • 13
IAUNet: Instance-Aware U-Net

Paper • 2508.01928 • Published Aug 3, 2025 • 9
Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference

Paper • 2508.04586 • Published Aug 6, 2025 • 11
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding

Paper • 2508.02215 • Published Aug 4, 2025 • 12
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Paper • 2507.23785 • Published Jul 31, 2025 • 18
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought

Paper • 2508.03560 • Published Aug 5, 2025 • 24
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents

Paper • 2508.01858 • Published Aug 3, 2025 • 20
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 141
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2, 2025 • 240
Attention Basin: Why Contextual Position Matters in Large Language Models

Paper • 2508.05128 • Published Aug 7, 2025 • 4
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode

Paper • 2508.04107 • Published Aug 6, 2025 • 4
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis

Paper • 2508.04699 • Published Aug 6, 2025 • 2
RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation

Paper • 2508.04190 • Published Aug 6, 2025 • 1
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations

Paper • 2508.04939 • Published Aug 6, 2025 • 2
REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation

Paper • 2508.04946 • Published Aug 7, 2025 • 1
I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal Entity Linking

Paper • 2508.02243 • Published Aug 4, 2025 • 2
Learning to Reason for Factuality

Paper • 2508.05618 • Published Aug 7, 2025 • 7
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression

Paper • 2508.04979 • Published Aug 7, 2025 • 5
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance

Paper • 2508.01650 • Published Aug 3, 2025 • 6
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

Paper • 2508.05630 • Published Aug 7, 2025 • 9
Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability

Paper • 2508.04017 • Published Aug 6, 2025 • 11
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

Paper • 2508.03644 • Published Aug 5, 2025 • 25
A Practical Guide to Fine-tuning Language Models with Limited Data

Paper • 2411.09539 • Published Nov 14, 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks

Paper • 2412.12499 • Published Dec 17, 2024 • 2
Development of Pre-Trained Transformer-based Models for the Nepali Language

Paper • 2411.15734 • Published Nov 24, 2024
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation

Paper • 2412.13375 • Published Dec 17, 2024
Facilitating large language model Russian adaptation with Learned Embedding Propagation

Paper • 2412.21140 • Published Dec 30, 2024 • 18
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Paper • 2411.16300 • Published Nov 25, 2024
Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Paper • 2601.22139 • Published Jan 29 • 1
Mirroring the Mind: Distilling Human-Like Metacognitive Strategies into Large Language Models

Paper • 2602.22508 • Published Feb 26
Contextual Drag: How Errors in the Context Affect LLM Reasoning

Paper • 2602.04288 • Published Feb 4 • 2
Knowledge Integration Decay in Search-Augmented Reasoning of Large Language Models

Paper • 2602.09517 • Published Feb 10 • 1
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65
IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53
CREATE: Testing LLMs for Associative Creativity

Paper • 2603.09970 • Published Mar 10 • 15
Language of Thought Shapes Output Diversity in Large Language Models

Paper • 2601.11227 • Published Jan 16 • 10
What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

Paper • 2602.20300 • Published Feb 23 • 4
No One Size Fits All: QueryBandits for Hallucination Mitigation

Paper • 2602.20332 • Published Feb 23 • 3
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Paper • 2602.14234 • Published Feb 15 • 28
Cognitive Models and AI Algorithms Provide Templates for Designing Language Agents

Paper • 2602.22523 • Published Feb 26 • 1
Agentic Artificial Intelligence (AI): Architectures, Taxonomies, and Evaluation of Large Language Model Agents

Paper • 2601.12560 • Published Jan 18
Shared Nature, Unique Nurture: PRISM for Pluralistic Reasoning via In-context Structure Modeling

Paper • 2602.21317 • Published Feb 24 • 4
DIVERGE: Diversity-Enhanced RAG for Open-Ended Information Seeking

Paper • 2602.00238 • Published Jan 30
dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153
CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models

Paper • 2601.02236 • Published Jan 5
Autoregressive Models Rival Diffusion Models at ANY-ORDER Generation

Paper • 2601.13228 • Published Jan 19
Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?

Paper • 2602.23225 • Published Feb 26
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training

Paper • 2603.02208 • Published Mar 2 • 4
Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

Paper • 2601.15160 • Published Jan 21 • 1
Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Paper • 2601.22642 • Published Jan 30 • 9
Structured Reasoning for Large Language Models

Paper • 2601.07180 • Published Jan 12 • 1
Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward

Paper • 2601.05073 • Published Jan 8
P2S: Probabilistic Process Supervision for General-Domain Reasoning Question Answering

Paper • 2601.20649 • Published Jan 28
VERGE: Formal Refinement and Guidance Engine for Verifiable LLM Reasoning

Paper • 2601.20055 • Published Jan 27 • 7
LLM-Guided Quantified SMT Solving over Uninterpreted Functions

Paper • 2601.04675 • Published Jan 8
Decompose-and-Formalise: Recursively Verifiable Natural Language Inference

Paper • 2601.19605 • Published Jan 27
Agentic Proposing: Enhancing Large Language Model Reasoning via Compositional Skill Synthesis

Paper • 2602.03279 • Published Feb 3 • 1
LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval

Paper • 2603.01425 • Published Mar 2 • 7
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Paper • 2601.21358 • Published Jan 29 • 7
Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens

Paper • 2602.10229 • Published Feb 10 • 5
Beyond Dense States: Elevating Sparse Transcoders to Active Operators for Latent Reasoning

Paper • 2602.01695 • Published Feb 2
OpenAutoNLU: Open Source AutoML Library for NLU

Paper • 2603.01824 • Published Mar 2 • 50
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published Mar 2 • 151
Transformers converge to invariant algorithmic cores

Paper • 2602.22600 • Published Feb 26 • 3
Spilled Energy in Large Language Models

Paper • 2602.18671 • Published Feb 21 • 12
Humans and LLMs Diverge on Probabilistic Inferences

Paper • 2602.23546 • Published Feb 26 • 13
PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference

Paper • 2603.02479 • Published Mar 3 • 20
MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

Paper • 2603.03379 • Published Mar 3 • 32
Distribution-Conditioned Transport

Paper • 2603.04736 • Published Mar 5 • 3
SageBwd: A Trainable Low-bit Attention

Paper • 2603.02170 • Published Mar 2 • 19
Large Multimodal Models as General In-Context Classifiers

Paper • 2602.23229 • Published Feb 26 • 26
nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space

Paper • 2603.04948 • Published Mar 5 • 2
Mario: Multimodal Graph Reasoning with Large Language Models

Paper • 2603.05181 • Published Mar 5 • 10
Reasoning Models Struggle to Control their Chains of Thought

Paper • 2603.05706 • Published Mar 5 • 38
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Paper • 2603.05438 • Published Mar 5 • 40
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

Paper • 2603.03583 • Published Mar 3 • 2
Distilling Token-Trained Models into Byte-Level Models

Paper • 2602.01007 • Published Feb 1
Proxy Compression for Language Modeling

Paper • 2602.04289 • Published Feb 4 • 3
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training

Paper • 2603.07223 • Published Mar 7 • 13
Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published Mar 4 • 40
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

Paper • 2603.09221 • Published Mar 10
ReasonCACHE: Teaching LLMs To Reason Without Weight Updates

Paper • 2602.02366 • Published Feb 2 • 1
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning

Paper • 2603.10377 • Published Mar 11 • 3
Lost in Backpropagation: The LM Head is a Gradient Bottleneck

Paper • 2603.10145 • Published Mar 10 • 13
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published Feb 2 • 32
SimpleGPT: Improving GPT via A Simple Normalization Strategy

Paper • 2602.01212 • Published Feb 1 • 3
Prism-Δ: Differential Subspace Steering for Prompt Highlighting in Large Language Models

Paper • 2603.10705 • Published Mar 11 • 11
Spectral Attention Steering for Prompt Highlighting

Paper • 2603.01281 • Published Mar 1 • 7
YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation

Paper • 2601.08441 • Published Jan 13 • 8
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Paper • 2603.10160 • Published Mar 10 • 26
LLM2Vec-Gen: Generative Embeddings from Large Language Models

Paper • 2603.10913 • Published Mar 11 • 44
OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 154
How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback

Paper • 2501.15378 • Published Jan 26, 2025
Millions of GeAR-s: Extending GraphRAG to Millions of Documents

Paper • 2507.17399 • Published Jul 23, 2025
RAG vs. GraphRAG: A Systematic Evaluation and Key Insights

Paper • 2502.11371 • Published Feb 17, 2025
PROPEX-RAG: Enhanced GraphRAG using Prompt-Driven Prompt Execution

Paper • 2511.01802 • Published Nov 3, 2025 • 2
HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG

Paper • 2602.20926 • Published Feb 24 • 3
PolyG: Effective and Efficient GraphRAG with Adaptive Graph Traversal

Paper • 2504.02112 • Published Apr 2, 2025 • 2
GraphRAG-R1: Graph Retrieval-Augmented Generation with Process-Constrained Reinforcement Learning

Paper • 2507.23581 • Published Jul 31, 2025
Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration

Paper • 2601.11144 • Published Jan 16 • 3
Enhancing Startup Success Predictions in Venture Capital: A GraphRAG Augmented Multivariate Time Series Method

Paper • 2408.09420 • Published Aug 18, 2024
NerVE: Nonlinear Eigenspectrum Dynamics in LLM Feed-Forward Networks

Paper • 2603.06922 • Published Mar 6 • 2
Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation

Paper • 2512.23601 • Published Dec 29, 2025
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Paper • 2603.11076 • Published Mar 10 • 5
Training Language Models via Neural Cellular Automata

Paper • 2603.10055 • Published Mar 9 • 8
Tiny Aya: Bridging Scale and Multilingual Depth

Paper • 2603.11510 • Published Mar 12 • 8
Attention Sinks Are Provably Necessary in Softmax Transformers: Evidence from Trigger-Conditional Tasks

Paper • 2603.11487 • Published Mar 12 • 2
WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Paper • 2602.02053 • Published Feb 2 • 41
Fine-Grained Activation Steering: Steering Less, Achieving More

Paper • 2602.04428 • Published Feb 4
A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs

Paper • 2505.23006 • Published May 29, 2025 • 1
XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 33
mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 326
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 19
Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data

Paper • 2601.22141 • Published Jan 29 • 4
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published Jan 28 • 21
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550
AutoRefine: From Trajectories to Reusable Expertise for Continual LLM Agent Refinement

Paper • 2601.22758 • Published Jan 30 • 1
Spectral Surgery: Training-Free Refinement of LoRA via Gradient-Guided Singular Value Reweighting

Paper • 2603.03995 • Published Mar 4
Poem Meter Classification of Recited Arabic Poetry: Integrating High-Resource Systems for a Low-Resource Task

Paper • 2504.12172 • Published Apr 16, 2025
Alignment Makes Language Models Normative, Not Descriptive

Paper • 2603.17218 • Published Mar 17 • 46
Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation

Paper • 2603.10210 • Published Mar 10

Upvote

Collection guide
Browse collections