Fairness in Streaming Submodular Maximization over a Matroid Constraint Paper • 2305.15118 • Published May 24, 2023
GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance Paper • 2505.07004 • Published May 11 • 7
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Paper • 2406.12837 • Published Jun 18, 2024