Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Krishna Teja Chitty-Venkata's picture
Building on HF
4

Krishna Teja Chitty-Venkata

krishnateja95
RedHatAI
memani's profile picture sraskar's profile picture 21world's profile picture
·
https://krishnateja95.github.io/
  • krishnateja95
  • kt95

AI & ML interests

LLM Optimization, Neural Architecture Search, Quantization, Pruning

Recent Activity

updated a collection about 19 hours ago
Qwen3-Next-80B-A3B Quantized Models
updated a collection about 19 hours ago
Qwen3-Next-80B-A3B Quantized Models
updated a collection about 19 hours ago
Qwen3-Next-80B-A3B Quantized Models
View all activity

Organizations

Argonne National Laboratory's profile picture NM Testing's profile picture Red Hat AI's profile picture Argonne National Laboratory's profile picture Inference Optimization's profile picture

authored 4 papers 2 months ago

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

Paper • 2508.17467 • Published Aug 24

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

Paper • 2509.04377 • Published Sep 4

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

Paper • 2509.02753 • Published Sep 2

ImageNet-Think-250K: A Large-Scale Synthetic Dataset for Multimodal Reasoning for Vision Language Models

Paper • 2510.01582 • Published Oct 2
authored a paper about 1 year ago

A Survey of Techniques for Optimizing Transformer Inference

Paper • 2307.07982 • Published Jul 16, 2023
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs