MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models Paper • 2508.17467 • Published Aug 24
PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference Paper • 2509.04377 • Published Sep 4
LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference Paper • 2509.02753 • Published Sep 2
ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models Paper • 2510.01582 • Published Oct 2
BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models Paper • 2406.11675 • Published Jun 17, 2024 • 1
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models Paper • 2405.21050 • Published May 31, 2024
SINE: SINgle Image Editing with Text-to-Image Diffusion Models Paper • 2212.04489 • Published Dec 8, 2022
DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction Paper • 2308.09223 • Published Aug 18, 2023 • 1
MLLM-as-a-Judge for Image Safety without Human Labeling Paper • 2501.00192 • Published Dec 31, 2024 • 31
Building Trust: Foundations of Security, Safety and Transparency in AI Paper • 2411.12275 • Published Nov 19, 2024 • 11
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models Paper • 2203.07259 • Published Mar 14, 2022 • 4
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment Paper • 2405.03594 • Published May 6, 2024 • 7
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published Nov 4, 2024 • 51
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published Nov 4, 2024 • 51
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence Paper • 2405.15593 • Published May 24, 2024 • 1
Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning Paper • 2407.10994 • Published Jun 24, 2024 • 2