Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456 Paper page - Qwen2 Technical Report
https://huggingface.co/spaces/Qwen/Qwen2-72B-Instruct\n","updatedAt":"2024-07-16T03:30:01.012Z","author":{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","fullname":"Binyuan Hui","name":"huybery","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":74,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.775611937046051},"editors":["huybery"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg"],"reactions":[{"reaction":"❤️","users":["clem","tnlin","marinaretik","RuPeng"],"count":4}],"isReport":false}},{"id":"66971f10e8ec15fafe09be7d","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false},"createdAt":"2024-07-17T01:32:00.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is an automated message from the [Librarian Bot](https://huggingface.co/librarian-bots). I found the following papers similar to this paper. \n\nThe following papers were recommended by the Semantic Scholar API \n\n* [YuLan: An Open-source Large Language Model](https://huggingface.co/papers/2406.19853) (2024)\n* [GEB-1.3B: Open Lightweight Large Language Model](https://huggingface.co/papers/2406.09900) (2024)\n* [A Teacher Is Worth A Million Instructions](https://huggingface.co/papers/2406.19112) (2024)\n* [Aya 23: Open Weight Releases to Further Multilingual Progress](https://huggingface.co/papers/2405.15032) (2024)\n* [A Survey on Large Language Models for Code Generation](https://huggingface.co/papers/2406.00515) (2024)\n\n\n Please give a thumbs up to this comment if you found it helpful!\n\n If you want recommendations for any Paper on Hugging Face checkout [this](https://huggingface.co/spaces/librarian-bots/recommend_similar_papers) Space\n\n You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: `@librarian-bot recommend`","html":"
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
\n
The following papers were recommended by the Semantic Scholar API
Please give a thumbs up to this comment if you found it helpful!
\n
If you want recommendations for any Paper on Hugging Face checkout this Space
\n
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: \n\n@librarian-bot\n\t recommend
\n","updatedAt":"2024-07-17T01:32:00.255Z","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7473025321960449},"editors":["librarian-bot"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2407.10671","authors":[{"_id":"6695e0de321386ed51decbda","user":{"_id":"62088594a5943c8a8fc94560","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1644733028938-62088594a5943c8a8fc94560.png","isPro":false,"fullname":"An Yang","user":"yangapku","type":"user"},"name":"An Yang","status":"claimed_verified","statusLastChangedAt":"2024-09-23T16:29:54.776Z","hidden":false},{"_id":"6695e0de321386ed51decbdb","user":{"_id":"64b0a77df12b47366663884c","avatarUrl":"/avatars/a212ea862abb5966060e439dd0e7656f.svg","isPro":false,"fullname":"Baosong Yang","user":"Baosong","type":"user"},"name":"Baosong Yang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:12:42.572Z","hidden":false},{"_id":"6695e0de321386ed51decbdc","user":{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","isPro":false,"fullname":"Binyuan Hui","user":"huybery","type":"user"},"name":"Binyuan Hui","status":"claimed_verified","statusLastChangedAt":"2024-07-16T20:16:02.548Z","hidden":false},{"_id":"6695e0de321386ed51decbdd","user":{"_id":"62c695ad5aae1c624ca992e2","avatarUrl":"/avatars/20d10fb3338e4bd4dc59e88a18cb2617.svg","isPro":false,"fullname":"Bo Zheng","user":"bzheng","type":"user"},"name":"Bo Zheng","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:12:57.275Z","hidden":false},{"_id":"6695e0de321386ed51decbde","name":"Bowen Yu","hidden":false},{"_id":"6695e0de321386ed51decbdf","user":{"_id":"622892774323cef93a956a4a","avatarUrl":"/avatars/e57ef5c3b0c4289988ccd42f14e54336.svg","isPro":false,"fullname":"chang zhou","user":"jiemizc","type":"user"},"name":"Chang Zhou","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:13:44.162Z","hidden":false},{"_id":"6695e0de321386ed51decbe0","user":{"_id":"65294b334d7cf551ac50d6a6","avatarUrl":"/avatars/75d21e20b711b871616ef3850bb900b7.svg","isPro":false,"fullname":"ChengpengLi","user":"ChengpengLi","type":"user"},"name":"Chengpeng Li","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:13:51.135Z","hidden":false},{"_id":"6695e0de321386ed51decbe1","name":"Chengyuan Li","hidden":false},{"_id":"6695e0de321386ed51decbe2","user":{"_id":"6434d4989bd5a84b5dd0b0f5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6434d4989bd5a84b5dd0b0f5/0Elf9qbfG9Hkgypm9pTGm.jpeg","isPro":false,"fullname":"Dayiheng Liu","user":"Losin94","type":"user"},"name":"Dayiheng Liu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:14:16.139Z","hidden":false},{"_id":"6695e0de321386ed51decbe3","user":{"_id":"635b8b6a37c6a2c12e2cce00","avatarUrl":"/avatars/229fb72180529141515d1df797b33709.svg","isPro":false,"fullname":"Fei Huang","user":"hzhwcmhf","type":"user"},"name":"Fei Huang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:14:50.471Z","hidden":false},{"_id":"6695e0de321386ed51decbe4","user":{"_id":"61cd4b833dd34ba1985e0753","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61cd4b833dd34ba1985e0753/BfHfrwotoMESpXZOHiIe4.png","isPro":false,"fullname":"KABI","user":"dongguanting","type":"user"},"name":"Guanting Dong","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:15:09.576Z","hidden":false},{"_id":"6695e0de321386ed51decbe5","user":{"_id":"628b430afc0078a72e38b04a","avatarUrl":"/avatars/b4958c184be06534645f2284635d850e.svg","isPro":false,"fullname":"Haoran Wei","user":"whr94621","type":"user"},"name":"Haoran Wei","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:16:15.069Z","hidden":false},{"_id":"6695e0de321386ed51decbe6","name":"Huan Lin","hidden":false},{"_id":"6695e0de321386ed51decbe7","user":{"_id":"63281d05ac205d01918b5fc7","avatarUrl":"/avatars/fc3e0f7285bb2869a92670f764dfc535.svg","isPro":false,"fullname":"Jialong Tang","user":"Jialong","type":"user"},"name":"Jialong Tang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:16:46.155Z","hidden":false},{"_id":"6695e0de321386ed51decbe8","user":{"_id":"6634979161776e1d8d35b16c","avatarUrl":"/avatars/32a1fac0016445959c2a062c1ab76ab9.svg","isPro":false,"fullname":"jialinwang","user":"jialinwangpku","type":"user"},"name":"Jialin Wang","status":"claimed_verified","statusLastChangedAt":"2025-02-21T10:00:48.017Z","hidden":false},{"_id":"6695e0de321386ed51decbe9","user":{"_id":"64ccb9bfead94891d12aef42","avatarUrl":"/avatars/c54809d43d93d3f0766bd2555cecc4e3.svg","isPro":false,"fullname":"Yang Jian","user":"CSJianYang","type":"user"},"name":"Jian Yang","status":"claimed_verified","statusLastChangedAt":"2024-10-14T19:05:31.557Z","hidden":false},{"_id":"6695e0de321386ed51decbea","user":{"_id":"654bead777401b47e6424f88","avatarUrl":"/avatars/7bcbdbb051c93b004f0dc3ad36c4a0ce.svg","isPro":false,"fullname":"Jianhong Tu","user":"JianhongTu","type":"user"},"name":"Jianhong Tu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:18:35.197Z","hidden":true},{"_id":"6695e0de321386ed51decbeb","name":"Jianwei Zhang","hidden":false},{"_id":"6695e0de321386ed51decbec","name":"Jianxin Ma","hidden":true},{"_id":"6695e0de321386ed51decbed","name":"Jin Xu","hidden":false},{"_id":"6695e0de321386ed51decbee","user":{"_id":"602f88f5e8149a962412a667","avatarUrl":"/avatars/b78f0e583df8e5d5e3365934fe5f4900.svg","isPro":false,"fullname":"Zhou","user":"Jingren","type":"user"},"name":"Jingren Zhou","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:19:28.484Z","hidden":false},{"_id":"6695e0de321386ed51decbef","user":{"_id":"60113fad51e116b62cd0a30e","avatarUrl":"/avatars/469357d0a4a5d2e104ae5e32801b395d.svg","isPro":false,"fullname":"Jinze Bai","user":"Jinze","type":"user"},"name":"Jinze Bai","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:19:35.233Z","hidden":false},{"_id":"6695e0de321386ed51decbf0","name":"Jinzheng He","hidden":false},{"_id":"6695e0de321386ed51decbf1","user":{"_id":"620760a26e3b7210c2ff1943","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/620760a26e3b7210c2ff1943/VC-rKqimF6yxGESNVlPoR.jpeg","isPro":false,"fullname":"Junyang Lin","user":"JustinLin610","type":"user"},"name":"Junyang Lin","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:19:49.891Z","hidden":false},{"_id":"6695e0de321386ed51decbf2","name":"Kai Dang","hidden":false},{"_id":"6695e0de321386ed51decbf3","user":{"_id":"6453fa96ed6d7fede94408e0","avatarUrl":"/avatars/e8c9025ef24cec958c87a1008bb54fd7.svg","isPro":false,"fullname":"Keming Lu","user":"keminglu","type":"user"},"name":"Keming Lu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:20:12.506Z","hidden":false},{"_id":"6695e0de321386ed51decbf4","name":"Keqin Chen","hidden":false},{"_id":"6695e0de321386ed51decbf5","user":{"_id":"65b0b3957e5d5a4ecc750de0","avatarUrl":"/avatars/e0d79d3265ca4ad5c5411feb01043fb4.svg","isPro":false,"fullname":"Kexin Yang","user":"dawn0929","type":"user"},"name":"Kexin Yang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:20:39.738Z","hidden":false},{"_id":"6695e0de321386ed51decbf6","name":"Mei Li","hidden":false},{"_id":"6695e0de321386ed51decbf7","user":{"_id":"5f8946925d083370c711f296","avatarUrl":"/avatars/14246aae3b1f8b7ad050f8ff2c8b260e.svg","isPro":false,"fullname":"Mingfeng Xue","user":"mingfengxue","type":"user"},"name":"Mingfeng Xue","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:20:47.889Z","hidden":false},{"_id":"6695e0de321386ed51decbf8","name":"Na Ni","hidden":false},{"_id":"6695e0de321386ed51decbf9","name":"Pei Zhang","hidden":false},{"_id":"6695e0de321386ed51decbfa","name":"Peng Wang","hidden":false},{"_id":"6695e0de321386ed51decbfb","user":{"_id":"6687b233586426849536faff","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6687b233586426849536faff/q7EBRrWlk2eYidsKCPC9h.jpeg","isPro":false,"fullname":"Ru Peng","user":"RuPeng","type":"user"},"name":"Ru Peng","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:21:03.708Z","hidden":false},{"_id":"6695e0de321386ed51decbfc","user":{"_id":"6209bb200436d7d6f27cbeea","avatarUrl":"/avatars/0b8a72a8b66ef7b36780fe2ccc343f78.svg","isPro":false,"fullname":"Iurnem","user":"Iurnem","type":"user"},"name":"Rui Men","status":"claimed_verified","statusLastChangedAt":"2024-09-23T16:29:56.514Z","hidden":false},{"_id":"6695e0de321386ed51decbfd","user":{"_id":"6629ed94aabce1b25c3db90c","avatarUrl":"/avatars/cbc39db81c8e8f950d3bd2c2e03f71c8.svg","isPro":false,"fullname":"Ruize Gao","user":"gaoruize","type":"user"},"name":"Ruize Gao","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:21:20.781Z","hidden":false},{"_id":"6695e0de321386ed51decbfe","user":{"_id":"649a52e5de0fb7f3f499e583","avatarUrl":"/avatars/25f6106fa168ae57ad5cd8ef55c70d31.svg","isPro":false,"fullname":"Runji Lin","user":"RunjiLin","type":"user"},"name":"Runji Lin","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:21:27.836Z","hidden":false},{"_id":"6695e0de321386ed51decbff","user":{"_id":"6472321922016353ae3ab2e9","avatarUrl":"/avatars/93d9e397ae6079ea0672d6b54234f388.svg","isPro":false,"fullname":"Shijie Wang","user":"simonJJJ","type":"user"},"name":"Shijie Wang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:22:16.009Z","hidden":false},{"_id":"6695e0de321386ed51decc00","user":{"_id":"63451cf0a05b51f7ded25505","avatarUrl":"/avatars/dec4bbee4a82b773fc58dfc2dce9dbeb.svg","isPro":false,"fullname":"shuai bai","user":"ShuaiBai623","type":"user"},"name":"Shuai Bai","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:22:43.791Z","hidden":false},{"_id":"6695e0de321386ed51decc01","user":{"_id":"6337a9bb0267ebcf026ad110","avatarUrl":"/avatars/12a170b28ade8df979067077828d719c.svg","isPro":false,"fullname":"Sinan Tan","user":"tinytangent","type":"user"},"name":"Sinan Tan","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:22:58.805Z","hidden":false},{"_id":"6695e0de321386ed51decc02","name":"Tianhang Zhu","hidden":false},{"_id":"6695e0de321386ed51decc03","user":{"_id":"64abb96bd691b1c2482e7c19","avatarUrl":"/avatars/ff3a429f985a52c9c5ea4f64872599f2.svg","isPro":false,"fullname":"litianhao","user":"litianhao","type":"user"},"name":"Tianhao Li","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:23:54.179Z","hidden":false},{"_id":"6695e0de321386ed51decc04","name":"Tianyu Liu","hidden":false},{"_id":"6695e0de321386ed51decc05","user":{"_id":"634d06c6f0a69955f662e641","avatarUrl":"/avatars/5a0af8af0a21d2a93192f4a3c430fc60.svg","isPro":false,"fullname":"Wenbin Ge","user":"gewenbin292","type":"user"},"name":"Wenbin Ge","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:24:46.979Z","hidden":false},{"_id":"6695e0de321386ed51decc06","name":"Xiaodong Deng","hidden":false},{"_id":"6695e0de321386ed51decc07","user":{"_id":"668f5875b5b3081d776e4094","avatarUrl":"/avatars/8c763393f25afbe5fb8b132f775e746a.svg","isPro":false,"fullname":"Xiaohuan Zhou","user":"XiaohuanZhou","type":"user"},"name":"Xiaohuan Zhou","status":"claimed_verified","statusLastChangedAt":"2025-03-04T08:53:46.525Z","hidden":false},{"_id":"6695e0de321386ed51decc08","user":{"_id":"645c82c4081be4b32049633a","avatarUrl":"/avatars/e5a08cf0ec5a04bd9d66111382ce0508.svg","isPro":false,"fullname":"xzhren","user":"xingzhang","type":"user"},"name":"Xingzhang Ren","status":"claimed_verified","statusLastChangedAt":"2025-01-03T14:06:34.985Z","hidden":false},{"_id":"6695e0de321386ed51decc09","name":"Xinyu Zhang","hidden":false},{"_id":"6695e0de321386ed51decc0a","name":"Xipin Wei","hidden":false},{"_id":"6695e0de321386ed51decc0b","name":"Xuancheng Ren","hidden":false},{"_id":"6695e0de321386ed51decc0c","name":"Yang Fan","hidden":false},{"_id":"6695e0de321386ed51decc0d","name":"Yang Yao","hidden":false},{"_id":"6695e0de321386ed51decc0e","name":"Yichang Zhang","hidden":false},{"_id":"6695e0de321386ed51decc0f","name":"Yu Wan","hidden":false},{"_id":"6695e0de321386ed51decc10","user":{"_id":"62c6a751a71b40cf26f359a8","avatarUrl":"/avatars/49abd2e71946035452c316d703baaac6.svg","isPro":false,"fullname":"Yunfei Chu","user":"faychu","type":"user"},"name":"Yunfei Chu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:25:36.195Z","hidden":false},{"_id":"6695e0de321386ed51decc11","user":{"_id":"643fb14a18afbc4d1f3ebfb4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/643fb14a18afbc4d1f3ebfb4/2qeL_qPSB9_MTzZf46ynJ.png","isPro":false,"fullname":"czy yente","user":"cyente","type":"user"},"name":"Zeyu Cui","status":"claimed_verified","statusLastChangedAt":"2024-09-23T14:22:32.235Z","hidden":false},{"_id":"6695e0de321386ed51decc12","user":{"_id":"64704e973601bb7b06643e98","avatarUrl":"/avatars/52e51f4d1be6769e4397b8be2799cf32.svg","isPro":false,"fullname":"Zhenru Zhang","user":"Zhenru","type":"user"},"name":"Zhenru Zhang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:27:56.417Z","hidden":false},{"_id":"6695e0de321386ed51decc13","name":"Zhihao Fan","hidden":false}],"publishedAt":"2024-07-15T12:35:42.000Z","submittedOnDailyAt":"2024-07-16T01:24:43.373Z","title":"Qwen2 Technical Report","submittedOnDailyBy":{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","isPro":false,"fullname":"Binyuan Hui","user":"huybery","type":"user"},"summary":"This report introduces the Qwen2 series, the latest addition to our large\nlanguage models and large multimodal models. We release a comprehensive suite\nof foundational and instruction-tuned language models, encompassing a parameter\nrange from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts\nmodel. Qwen2 surpasses most prior open-weight models, including its predecessor\nQwen1.5, and exhibits competitive performance relative to proprietary models\nacross diverse benchmarks on language understanding, generation, multilingual\nproficiency, coding, mathematics, and reasoning.\n The flagship model, Qwen2-72B, showcases remarkable performance: 84.2 on\nMMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH as a base\nlanguage model. The instruction-tuned variant, Qwen2-72B-Instruct, attains 9.1\non MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench. Moreover, Qwen2\ndemonstrates robust multilingual capabilities, proficient in approximately 30\nlanguages, spanning English, Chinese, Spanish, French, German, Arabic, Russian,\nKorean, Japanese, Thai, Vietnamese, and more, underscoring its versatility and\nglobal reach.\n To foster community innovation and accessibility, we have made the Qwen2\nmodel weights openly available on Hugging Face1 and ModelScope2, and the\nsupplementary materials including example code on GitHub3. These platforms also\ninclude resources for quantization, fine-tuning, and deployment, facilitating a\nwide range of applications and research endeavors.","upvotes":168,"discussionId":"6695e0df321386ed51decc39","githubRepo":"https://github.com/qwenlm/qwen2","githubRepoAddedBy":"auto","ai_summary":"The Qwen2 series, comprising 0.5 to 72 billion parameter models, surpasses prior open models across language understanding, generation, multilingualism, coding, math, and reasoning, with exceptional performance in benchmarks like MMLU, GPQA, HumanEval, GSM8K, BBH, MT-Bench, Arena-Hard, and LiveCodeBench.","ai_keywords":["Mixture-of-Experts","language models","multimodal models","MMLU","GPQA","HumanEval","GSM8K","BBH","MT-Bench","Arena-Hard","LiveCodeBench"],"githubStars":26635},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"662620578c67281c9eb17f7b","avatarUrl":"/avatars/0e9c26d2234a9f1898a380f183b1645a.svg","isPro":false,"fullname":"Li Zhaodonghui","user":"LZ12DH","type":"user"},{"_id":"668cd4bbe990292e5f6974d3","avatarUrl":"/avatars/d1747b2372e94500ecb5fb56809b482d.svg","isPro":false,"fullname":"Jinyeong Kim","user":"rubatoyeong","type":"user"},{"_id":"62627a439517ea567fb916f2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62627a439517ea567fb916f2/nx3P1FdnLzaAxazhOS_4u.jpeg","isPro":false,"fullname":"Léo Hunout","user":"hunoutl","type":"user"},{"_id":"63107b18e87051f3e3e0f598","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63107b18e87051f3e3e0f598/R9onir4Y0MZuq1jEWCZ2-.jpeg","isPro":false,"fullname":"Unchun Yang","user":"ucyang","type":"user"},{"_id":"626237d9bbcbd1c34f1bb231","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/626237d9bbcbd1c34f1bb231/EJrOjvAL-68qMCYdnvOrq.png","isPro":true,"fullname":"Ali El Filali","user":"alielfilali01","type":"user"},{"_id":"641aef7b1911d3be67425338","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/641aef7b1911d3be67425338/CmCbWWB6NxkAaus59q31w.jpeg","isPro":false,"fullname":"Qi Liu (SJTU & SII)","user":"purewhite42","type":"user"},{"_id":"612ee6a7b960e78c6d2319d4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/612ee6a7b960e78c6d2319d4/2Hu9BaAyXbyh1vt0v1Qui.jpeg","isPro":false,"fullname":"Qian Liu","user":"SivilTaram","type":"user"},{"_id":"637f0eb22438d7485b8ef5d7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/637f0eb22438d7485b8ef5d7/70h7dekqj7LuBobOXckmJ.jpeg","isPro":false,"fullname":"Ming Li","user":"limingcv","type":"user"},{"_id":"61cd4b833dd34ba1985e0753","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61cd4b833dd34ba1985e0753/BfHfrwotoMESpXZOHiIe4.png","isPro":false,"fullname":"KABI","user":"dongguanting","type":"user"},{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","isPro":false,"fullname":"Binyuan Hui","user":"huybery","type":"user"},{"_id":"63d9d68c1cae35c27bf7a6a7","avatarUrl":"/avatars/b5ad98cf269ae5f1fe90861fb4170fae.svg","isPro":false,"fullname":"Bowen Yu","user":"Tigerph","type":"user"},{"_id":"63ef22b2bfe4ead22ca9e1e4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1676616348535-noauth.jpeg","isPro":false,"fullname":"Phú Võ","user":"phuvo","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":1}">
The Qwen2 series, comprising 0.5 to 72 billion parameter models, surpasses prior open models across language understanding, generation, multilingualism, coding, math, and reasoning, with exceptional performance in benchmarks like MMLU, GPQA, HumanEval, GSM8K, BBH, MT-Bench, Arena-Hard, and LiveCodeBench.
AI-generated summary
This report introduces the Qwen2 series, the latest addition to our large
language models and large multimodal models. We release a comprehensive suite
of foundational and instruction-tuned language models, encompassing a parameter
range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts
model. Qwen2 surpasses most prior open-weight models, including its predecessor
Qwen1.5, and exhibits competitive performance relative to proprietary models
across diverse benchmarks on language understanding, generation, multilingual
proficiency, coding, mathematics, and reasoning.
The flagship model, Qwen2-72B, showcases remarkable performance: 84.2 on
MMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH as a base
language model. The instruction-tuned variant, Qwen2-72B-Instruct, attains 9.1
on MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench. Moreover, Qwen2
demonstrates robust multilingual capabilities, proficient in approximately 30
languages, spanning English, Chinese, Spanish, French, German, Arabic, Russian,
Korean, Japanese, Thai, Vietnamese, and more, underscoring its versatility and
global reach.
To foster community innovation and accessibility, we have made the Qwen2
model weights openly available on Hugging Face1 and ModelScope2, and the
supplementary materials including example code on GitHub3. These platforms also
include resources for quantization, fine-tuning, and deployment, facilitating a
wide range of applications and research endeavors.