Red Hat AI

company

Verified

https://www.redhat.com/en/products/ai

RedHat_AI

AI & ML interests

OpenSource and AI

Recent Activity

alexmarques updated a model 37 minutes ago

RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8

alexmarques updated a model 39 minutes ago

RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8

krishnateja95 authored a paper 2 months ago

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

View all activity

alexmarques

updated a model 37 minutes ago

RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8

Text Generation • 81B • Updated 37 minutes ago • 207

krishnateja95

authored 4 papers 2 months ago

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

Paper • 2508.17467 • Published Aug 24

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

Paper • 2509.04377 • Published Sep 4

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

Paper • 2509.02753 • Published Sep 2

ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models

Paper • 2510.01582 • Published Oct 2

ligongh

authored 6 papers 12 months ago

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models

Paper • 2406.11675 • Published Jun 17, 2024 • 1

Implicit In-context Learning

Paper • 2405.14660 • Published May 23, 2024

Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models

Paper • 2405.21050 • Published May 31, 2024

SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Paper • 2212.04489 • Published Dec 8, 2022

DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction

Paper • 2308.09223 • Published Aug 18, 2023 • 1

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 31

HuaminChen

authored a paper about 1 year ago

Building Trust: Foundations of Security, Safety and Transparency in AI

Paper • 2411.12275 • Published Nov 19, 2024 • 11

markurtz

authored 4 papers about 1 year ago

How Well Do Sparse Imagenet Models Transfer?

Paper • 2111.13445 • Published Nov 26, 2021 • 1

The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

Paper • 2203.07259 • Published Mar 14, 2022 • 4

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6, 2024 • 7

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 51

alexmarques

authored a paper about 1 year ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 51

ekurtic

authored 3 papers about 1 year ago

Error Feedback Can Accurately Compress Preconditioners

Paper • 2306.06098 • Published Jun 9, 2023

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence

Paper • 2405.15593 • Published May 24, 2024 • 1

Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning

Paper • 2407.10994 • Published Jun 24, 2024 • 2