arxiv:2409.00492
Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated a model 4 days ago
google/gemma-4-E4B-it-qat-mobile-ct updated a model 4 days ago
google/gemma-4-E2B-it-qat-mobile-ct published a model 6 days ago
google/gemma-4-E4B-it-qat-mobile-ct