BitNet: Scaling 1-bit Transformers for Large Language Models https://arxiv.org/abs/2310.11453 Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Huaijie Wang, Lingxiao Ma, Fan Yang, Ruiping Wang, Yi Wu, Furu Wei BitNet