2 9

Mahdi Nikdan

mnikdan97

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

upvoted a paper about 2 months ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

authored a paper 2 months ago

ECO: Quantized Training without Full-Precision Master Weights

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published Feb 2 • 12

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 60

authored a paper 2 months ago

ECO: Quantized Training without Full-Precision Master Weights

Paper • 2601.22101 • Published Jan 29 • 6

submitted a paper to Daily Papers 2 months ago

ECO: Quantized Training without Full-Precision Master Weights

Paper • 2601.22101 • Published Jan 29 • 6

updated a model 6 months ago

mnikdan97/gpt2-xlarge-sft-unnatural-new

Updated Oct 13, 2025

published a model 6 months ago

mnikdan97/gpt2-xlarge-sft-unnatural-new

Updated Oct 13, 2025

updated a model 6 months ago

mnikdan97/gpt2-xlarge-sft-unnatural

Updated Oct 12, 2025 • 1

published a model 6 months ago

mnikdan97/gpt2-xlarge-sft-unnatural

Updated Oct 12, 2025 • 1

updated a model 6 months ago

mnikdan97/gpt2-base-sft-unnatural

Updated Oct 11, 2025 • 1

published a model 6 months ago

mnikdan97/gpt2-base-sft-unnatural

Updated Oct 11, 2025 • 1

upvoted a paper 6 months ago

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27, 2025 • 29

upvoted a paper 8 months ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24, 2025 • 41

upvoted a paper 10 months ago

Unified Scaling Laws for Compressed Representations

Paper • 2506.01863 • Published Jun 2, 2025 • 19

authored a paper 10 months ago

Efficient Data Selection at Scale via Influence Distillation

Paper • 2505.19051 • Published May 25, 2025 • 4

commented a paper 10 months ago

Efficient Data Selection at Scale via Influence Distillation

Paper • 2505.19051 • Published May 25, 2025 • 4 •

upvoted a paper 10 months ago

SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models

Paper • 2505.17967 • Published May 23, 2025 • 17

authored a paper 10 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

upvoted a paper 10 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

authored 2 papers about 1 year ago

RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation

Paper • 2401.04679 • Published Jan 9, 2024 • 2

SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks

Paper • 2302.04852 • Published Feb 9, 2023

Mahdi Nikdan

AI & ML interests

Recent Activity

Organizations

mnikdan97's activity