Dongwon Jo's picture

1

Dongwon Jo

dongwonjo

·

dongwonjo

AI & ML interests

Efficient AI, Model Compression, Quantization, Pruning, Generative Model, Large Language Model, Diffusion

Organizations

dongwonjo's activity

commented a paper 5 months ago

Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models

Paper • 2406.12311 • Published Jun 18 • 6 •