Unsupervised Welding Defect Detection Using Audio And Video Paper • 2409.02290 • Published Sep 3, 2024 • 3
Inference Performance Optimization for Large Language Models on CPUs Paper • 2407.07304 • Published Jul 10, 2024 • 53
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search Paper • 2404.10934 • Published Apr 16, 2024
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Paper • 2306.16601 • Published Jun 28, 2023 • 4