Marchisio, K., Dash, S., Chen, H., Aumiller, D., Üstün, A., Hooker, S., & Ruder, S. (2024). How Does Quantization Affect Multilingual LLMs?. arXiv preprint arXiv:2407.03211 https://arxiv.org/abs/2407.03211 How Does Quantization Affect Multilingual LLMs?Quantization techniques are widely used to improve inference speed and deployment of large language models. While a wide body of work examines th..