Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Paper page - Qwen2 Technical Report
[go: Go Back, main page]

https://huggingface.co/spaces/Qwen/Qwen2-72B-Instruct

\n","updatedAt":"2024-07-16T03:30:01.012Z","author":{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","fullname":"Binyuan Hui","name":"huybery","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":74,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.775611937046051},"editors":["huybery"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg"],"reactions":[{"reaction":"❤️","users":["clem","tnlin","marinaretik","RuPeng"],"count":4}],"isReport":false}},{"id":"66971f10e8ec15fafe09be7d","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false},"createdAt":"2024-07-17T01:32:00.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is an automated message from the [Librarian Bot](https://huggingface.co/librarian-bots). I found the following papers similar to this paper. \n\nThe following papers were recommended by the Semantic Scholar API \n\n* [YuLan: An Open-source Large Language Model](https://huggingface.co/papers/2406.19853) (2024)\n* [GEB-1.3B: Open Lightweight Large Language Model](https://huggingface.co/papers/2406.09900) (2024)\n* [A Teacher Is Worth A Million Instructions](https://huggingface.co/papers/2406.19112) (2024)\n* [Aya 23: Open Weight Releases to Further Multilingual Progress](https://huggingface.co/papers/2405.15032) (2024)\n* [A Survey on Large Language Models for Code Generation](https://huggingface.co/papers/2406.00515) (2024)\n\n\n Please give a thumbs up to this comment if you found it helpful!\n\n If you want recommendations for any Paper on Hugging Face checkout [this](https://huggingface.co/spaces/librarian-bots/recommend_similar_papers) Space\n\n You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: `@librarian-bot recommend`","html":"

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

\n

The following papers were recommended by the Semantic Scholar API

\n\n

Please give a thumbs up to this comment if you found it helpful!

\n

If you want recommendations for any Paper on Hugging Face checkout this Space

\n

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: \n\n@librarian-bot\n\t recommend

\n","updatedAt":"2024-07-17T01:32:00.255Z","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7473025321960449},"editors":["librarian-bot"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2407.10671","authors":[{"_id":"6695e0de321386ed51decbda","user":{"_id":"62088594a5943c8a8fc94560","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1644733028938-62088594a5943c8a8fc94560.png","isPro":false,"fullname":"An Yang","user":"yangapku","type":"user"},"name":"An Yang","status":"claimed_verified","statusLastChangedAt":"2024-09-23T16:29:54.776Z","hidden":false},{"_id":"6695e0de321386ed51decbdb","user":{"_id":"64b0a77df12b47366663884c","avatarUrl":"/avatars/a212ea862abb5966060e439dd0e7656f.svg","isPro":false,"fullname":"Baosong Yang","user":"Baosong","type":"user"},"name":"Baosong Yang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:12:42.572Z","hidden":false},{"_id":"6695e0de321386ed51decbdc","user":{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","isPro":false,"fullname":"Binyuan Hui","user":"huybery","type":"user"},"name":"Binyuan Hui","status":"claimed_verified","statusLastChangedAt":"2024-07-16T20:16:02.548Z","hidden":false},{"_id":"6695e0de321386ed51decbdd","user":{"_id":"62c695ad5aae1c624ca992e2","avatarUrl":"/avatars/20d10fb3338e4bd4dc59e88a18cb2617.svg","isPro":false,"fullname":"Bo Zheng","user":"bzheng","type":"user"},"name":"Bo Zheng","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:12:57.275Z","hidden":false},{"_id":"6695e0de321386ed51decbde","name":"Bowen Yu","hidden":false},{"_id":"6695e0de321386ed51decbdf","user":{"_id":"622892774323cef93a956a4a","avatarUrl":"/avatars/e57ef5c3b0c4289988ccd42f14e54336.svg","isPro":false,"fullname":"chang zhou","user":"jiemizc","type":"user"},"name":"Chang Zhou","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:13:44.162Z","hidden":false},{"_id":"6695e0de321386ed51decbe0","user":{"_id":"65294b334d7cf551ac50d6a6","avatarUrl":"/avatars/75d21e20b711b871616ef3850bb900b7.svg","isPro":false,"fullname":"ChengpengLi","user":"ChengpengLi","type":"user"},"name":"Chengpeng Li","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:13:51.135Z","hidden":false},{"_id":"6695e0de321386ed51decbe1","name":"Chengyuan Li","hidden":false},{"_id":"6695e0de321386ed51decbe2","user":{"_id":"6434d4989bd5a84b5dd0b0f5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6434d4989bd5a84b5dd0b0f5/0Elf9qbfG9Hkgypm9pTGm.jpeg","isPro":false,"fullname":"Dayiheng Liu","user":"Losin94","type":"user"},"name":"Dayiheng Liu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:14:16.139Z","hidden":false},{"_id":"6695e0de321386ed51decbe3","user":{"_id":"635b8b6a37c6a2c12e2cce00","avatarUrl":"/avatars/229fb72180529141515d1df797b33709.svg","isPro":false,"fullname":"Fei Huang","user":"hzhwcmhf","type":"user"},"name":"Fei Huang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:14:50.471Z","hidden":false},{"_id":"6695e0de321386ed51decbe4","user":{"_id":"61cd4b833dd34ba1985e0753","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61cd4b833dd34ba1985e0753/BfHfrwotoMESpXZOHiIe4.png","isPro":false,"fullname":"KABI","user":"dongguanting","type":"user"},"name":"Guanting Dong","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:15:09.576Z","hidden":false},{"_id":"6695e0de321386ed51decbe5","user":{"_id":"628b430afc0078a72e38b04a","avatarUrl":"/avatars/b4958c184be06534645f2284635d850e.svg","isPro":false,"fullname":"Haoran Wei","user":"whr94621","type":"user"},"name":"Haoran Wei","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:16:15.069Z","hidden":false},{"_id":"6695e0de321386ed51decbe6","name":"Huan Lin","hidden":false},{"_id":"6695e0de321386ed51decbe7","user":{"_id":"63281d05ac205d01918b5fc7","avatarUrl":"/avatars/fc3e0f7285bb2869a92670f764dfc535.svg","isPro":false,"fullname":"Jialong Tang","user":"Jialong","type":"user"},"name":"Jialong Tang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:16:46.155Z","hidden":false},{"_id":"6695e0de321386ed51decbe8","user":{"_id":"6634979161776e1d8d35b16c","avatarUrl":"/avatars/32a1fac0016445959c2a062c1ab76ab9.svg","isPro":false,"fullname":"jialinwang","user":"jialinwangpku","type":"user"},"name":"Jialin Wang","status":"claimed_verified","statusLastChangedAt":"2025-02-21T10:00:48.017Z","hidden":false},{"_id":"6695e0de321386ed51decbe9","user":{"_id":"64ccb9bfead94891d12aef42","avatarUrl":"/avatars/c54809d43d93d3f0766bd2555cecc4e3.svg","isPro":false,"fullname":"Yang Jian","user":"CSJianYang","type":"user"},"name":"Jian Yang","status":"claimed_verified","statusLastChangedAt":"2024-10-14T19:05:31.557Z","hidden":false},{"_id":"6695e0de321386ed51decbea","user":{"_id":"654bead777401b47e6424f88","avatarUrl":"/avatars/7bcbdbb051c93b004f0dc3ad36c4a0ce.svg","isPro":false,"fullname":"Jianhong Tu","user":"JianhongTu","type":"user"},"name":"Jianhong Tu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:18:35.197Z","hidden":true},{"_id":"6695e0de321386ed51decbeb","name":"Jianwei Zhang","hidden":false},{"_id":"6695e0de321386ed51decbec","name":"Jianxin Ma","hidden":true},{"_id":"6695e0de321386ed51decbed","name":"Jin Xu","hidden":false},{"_id":"6695e0de321386ed51decbee","user":{"_id":"602f88f5e8149a962412a667","avatarUrl":"/avatars/b78f0e583df8e5d5e3365934fe5f4900.svg","isPro":false,"fullname":"Zhou","user":"Jingren","type":"user"},"name":"Jingren Zhou","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:19:28.484Z","hidden":false},{"_id":"6695e0de321386ed51decbef","user":{"_id":"60113fad51e116b62cd0a30e","avatarUrl":"/avatars/469357d0a4a5d2e104ae5e32801b395d.svg","isPro":false,"fullname":"Jinze Bai","user":"Jinze","type":"user"},"name":"Jinze Bai","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:19:35.233Z","hidden":false},{"_id":"6695e0de321386ed51decbf0","name":"Jinzheng He","hidden":false},{"_id":"6695e0de321386ed51decbf1","user":{"_id":"620760a26e3b7210c2ff1943","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/620760a26e3b7210c2ff1943/VC-rKqimF6yxGESNVlPoR.jpeg","isPro":false,"fullname":"Junyang Lin","user":"JustinLin610","type":"user"},"name":"Junyang Lin","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:19:49.891Z","hidden":false},{"_id":"6695e0de321386ed51decbf2","name":"Kai Dang","hidden":false},{"_id":"6695e0de321386ed51decbf3","user":{"_id":"6453fa96ed6d7fede94408e0","avatarUrl":"/avatars/e8c9025ef24cec958c87a1008bb54fd7.svg","isPro":false,"fullname":"Keming Lu","user":"keminglu","type":"user"},"name":"Keming Lu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:20:12.506Z","hidden":false},{"_id":"6695e0de321386ed51decbf4","name":"Keqin Chen","hidden":false},{"_id":"6695e0de321386ed51decbf5","user":{"_id":"65b0b3957e5d5a4ecc750de0","avatarUrl":"/avatars/e0d79d3265ca4ad5c5411feb01043fb4.svg","isPro":false,"fullname":"Kexin Yang","user":"dawn0929","type":"user"},"name":"Kexin Yang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:20:39.738Z","hidden":false},{"_id":"6695e0de321386ed51decbf6","name":"Mei Li","hidden":false},{"_id":"6695e0de321386ed51decbf7","user":{"_id":"5f8946925d083370c711f296","avatarUrl":"/avatars/14246aae3b1f8b7ad050f8ff2c8b260e.svg","isPro":false,"fullname":"Mingfeng Xue","user":"mingfengxue","type":"user"},"name":"Mingfeng Xue","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:20:47.889Z","hidden":false},{"_id":"6695e0de321386ed51decbf8","name":"Na Ni","hidden":false},{"_id":"6695e0de321386ed51decbf9","name":"Pei Zhang","hidden":false},{"_id":"6695e0de321386ed51decbfa","name":"Peng Wang","hidden":false},{"_id":"6695e0de321386ed51decbfb","user":{"_id":"6687b233586426849536faff","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6687b233586426849536faff/q7EBRrWlk2eYidsKCPC9h.jpeg","isPro":false,"fullname":"Ru Peng","user":"RuPeng","type":"user"},"name":"Ru Peng","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:21:03.708Z","hidden":false},{"_id":"6695e0de321386ed51decbfc","user":{"_id":"6209bb200436d7d6f27cbeea","avatarUrl":"/avatars/0b8a72a8b66ef7b36780fe2ccc343f78.svg","isPro":false,"fullname":"Iurnem","user":"Iurnem","type":"user"},"name":"Rui Men","status":"claimed_verified","statusLastChangedAt":"2024-09-23T16:29:56.514Z","hidden":false},{"_id":"6695e0de321386ed51decbfd","user":{"_id":"6629ed94aabce1b25c3db90c","avatarUrl":"/avatars/cbc39db81c8e8f950d3bd2c2e03f71c8.svg","isPro":false,"fullname":"Ruize Gao","user":"gaoruize","type":"user"},"name":"Ruize Gao","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:21:20.781Z","hidden":false},{"_id":"6695e0de321386ed51decbfe","user":{"_id":"649a52e5de0fb7f3f499e583","avatarUrl":"/avatars/25f6106fa168ae57ad5cd8ef55c70d31.svg","isPro":false,"fullname":"Runji Lin","user":"RunjiLin","type":"user"},"name":"Runji Lin","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:21:27.836Z","hidden":false},{"_id":"6695e0de321386ed51decbff","user":{"_id":"6472321922016353ae3ab2e9","avatarUrl":"/avatars/93d9e397ae6079ea0672d6b54234f388.svg","isPro":false,"fullname":"Shijie Wang","user":"simonJJJ","type":"user"},"name":"Shijie Wang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:22:16.009Z","hidden":false},{"_id":"6695e0de321386ed51decc00","user":{"_id":"63451cf0a05b51f7ded25505","avatarUrl":"/avatars/dec4bbee4a82b773fc58dfc2dce9dbeb.svg","isPro":false,"fullname":"shuai bai","user":"ShuaiBai623","type":"user"},"name":"Shuai Bai","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:22:43.791Z","hidden":false},{"_id":"6695e0de321386ed51decc01","user":{"_id":"6337a9bb0267ebcf026ad110","avatarUrl":"/avatars/12a170b28ade8df979067077828d719c.svg","isPro":false,"fullname":"Sinan Tan","user":"tinytangent","type":"user"},"name":"Sinan Tan","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:22:58.805Z","hidden":false},{"_id":"6695e0de321386ed51decc02","name":"Tianhang Zhu","hidden":false},{"_id":"6695e0de321386ed51decc03","user":{"_id":"64abb96bd691b1c2482e7c19","avatarUrl":"/avatars/ff3a429f985a52c9c5ea4f64872599f2.svg","isPro":false,"fullname":"litianhao","user":"litianhao","type":"user"},"name":"Tianhao Li","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:23:54.179Z","hidden":false},{"_id":"6695e0de321386ed51decc04","name":"Tianyu Liu","hidden":false},{"_id":"6695e0de321386ed51decc05","user":{"_id":"634d06c6f0a69955f662e641","avatarUrl":"/avatars/5a0af8af0a21d2a93192f4a3c430fc60.svg","isPro":false,"fullname":"Wenbin Ge","user":"gewenbin292","type":"user"},"name":"Wenbin Ge","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:24:46.979Z","hidden":false},{"_id":"6695e0de321386ed51decc06","name":"Xiaodong Deng","hidden":false},{"_id":"6695e0de321386ed51decc07","user":{"_id":"668f5875b5b3081d776e4094","avatarUrl":"/avatars/8c763393f25afbe5fb8b132f775e746a.svg","isPro":false,"fullname":"Xiaohuan Zhou","user":"XiaohuanZhou","type":"user"},"name":"Xiaohuan Zhou","status":"claimed_verified","statusLastChangedAt":"2025-03-04T08:53:46.525Z","hidden":false},{"_id":"6695e0de321386ed51decc08","user":{"_id":"645c82c4081be4b32049633a","avatarUrl":"/avatars/e5a08cf0ec5a04bd9d66111382ce0508.svg","isPro":false,"fullname":"xzhren","user":"xingzhang","type":"user"},"name":"Xingzhang Ren","status":"claimed_verified","statusLastChangedAt":"2025-01-03T14:06:34.985Z","hidden":false},{"_id":"6695e0de321386ed51decc09","name":"Xinyu Zhang","hidden":false},{"_id":"6695e0de321386ed51decc0a","name":"Xipin Wei","hidden":false},{"_id":"6695e0de321386ed51decc0b","name":"Xuancheng Ren","hidden":false},{"_id":"6695e0de321386ed51decc0c","name":"Yang Fan","hidden":false},{"_id":"6695e0de321386ed51decc0d","name":"Yang Yao","hidden":false},{"_id":"6695e0de321386ed51decc0e","name":"Yichang Zhang","hidden":false},{"_id":"6695e0de321386ed51decc0f","name":"Yu Wan","hidden":false},{"_id":"6695e0de321386ed51decc10","user":{"_id":"62c6a751a71b40cf26f359a8","avatarUrl":"/avatars/49abd2e71946035452c316d703baaac6.svg","isPro":false,"fullname":"Yunfei Chu","user":"faychu","type":"user"},"name":"Yunfei Chu","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:25:36.195Z","hidden":false},{"_id":"6695e0de321386ed51decc11","user":{"_id":"643fb14a18afbc4d1f3ebfb4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/643fb14a18afbc4d1f3ebfb4/2qeL_qPSB9_MTzZf46ynJ.png","isPro":false,"fullname":"czy yente","user":"cyente","type":"user"},"name":"Zeyu Cui","status":"claimed_verified","statusLastChangedAt":"2024-09-23T14:22:32.235Z","hidden":false},{"_id":"6695e0de321386ed51decc12","user":{"_id":"64704e973601bb7b06643e98","avatarUrl":"/avatars/52e51f4d1be6769e4397b8be2799cf32.svg","isPro":false,"fullname":"Zhenru Zhang","user":"Zhenru","type":"user"},"name":"Zhenru Zhang","status":"admin_assigned","statusLastChangedAt":"2024-07-18T09:27:56.417Z","hidden":false},{"_id":"6695e0de321386ed51decc13","name":"Zhihao Fan","hidden":false}],"publishedAt":"2024-07-15T12:35:42.000Z","submittedOnDailyAt":"2024-07-16T01:24:43.373Z","title":"Qwen2 Technical Report","submittedOnDailyBy":{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","isPro":false,"fullname":"Binyuan Hui","user":"huybery","type":"user"},"summary":"This report introduces the Qwen2 series, the latest addition to our large\nlanguage models and large multimodal models. We release a comprehensive suite\nof foundational and instruction-tuned language models, encompassing a parameter\nrange from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts\nmodel. Qwen2 surpasses most prior open-weight models, including its predecessor\nQwen1.5, and exhibits competitive performance relative to proprietary models\nacross diverse benchmarks on language understanding, generation, multilingual\nproficiency, coding, mathematics, and reasoning.\n The flagship model, Qwen2-72B, showcases remarkable performance: 84.2 on\nMMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH as a base\nlanguage model. The instruction-tuned variant, Qwen2-72B-Instruct, attains 9.1\non MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench. Moreover, Qwen2\ndemonstrates robust multilingual capabilities, proficient in approximately 30\nlanguages, spanning English, Chinese, Spanish, French, German, Arabic, Russian,\nKorean, Japanese, Thai, Vietnamese, and more, underscoring its versatility and\nglobal reach.\n To foster community innovation and accessibility, we have made the Qwen2\nmodel weights openly available on Hugging Face1 and ModelScope2, and the\nsupplementary materials including example code on GitHub3. These platforms also\ninclude resources for quantization, fine-tuning, and deployment, facilitating a\nwide range of applications and research endeavors.","upvotes":168,"discussionId":"6695e0df321386ed51decc39","githubRepo":"https://github.com/qwenlm/qwen2","githubRepoAddedBy":"auto","ai_summary":"The Qwen2 series, comprising 0.5 to 72 billion parameter models, surpasses prior open models across language understanding, generation, multilingualism, coding, math, and reasoning, with exceptional performance in benchmarks like MMLU, GPQA, HumanEval, GSM8K, BBH, MT-Bench, Arena-Hard, and LiveCodeBench.","ai_keywords":["Mixture-of-Experts","language models","multimodal models","MMLU","GPQA","HumanEval","GSM8K","BBH","MT-Bench","Arena-Hard","LiveCodeBench"],"githubStars":26635},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"662620578c67281c9eb17f7b","avatarUrl":"/avatars/0e9c26d2234a9f1898a380f183b1645a.svg","isPro":false,"fullname":"Li Zhaodonghui","user":"LZ12DH","type":"user"},{"_id":"668cd4bbe990292e5f6974d3","avatarUrl":"/avatars/d1747b2372e94500ecb5fb56809b482d.svg","isPro":false,"fullname":"Jinyeong Kim","user":"rubatoyeong","type":"user"},{"_id":"62627a439517ea567fb916f2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62627a439517ea567fb916f2/nx3P1FdnLzaAxazhOS_4u.jpeg","isPro":false,"fullname":"Léo Hunout","user":"hunoutl","type":"user"},{"_id":"63107b18e87051f3e3e0f598","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63107b18e87051f3e3e0f598/R9onir4Y0MZuq1jEWCZ2-.jpeg","isPro":false,"fullname":"Unchun Yang","user":"ucyang","type":"user"},{"_id":"626237d9bbcbd1c34f1bb231","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/626237d9bbcbd1c34f1bb231/EJrOjvAL-68qMCYdnvOrq.png","isPro":true,"fullname":"Ali El Filali","user":"alielfilali01","type":"user"},{"_id":"641aef7b1911d3be67425338","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/641aef7b1911d3be67425338/CmCbWWB6NxkAaus59q31w.jpeg","isPro":false,"fullname":"Qi Liu (SJTU & SII)","user":"purewhite42","type":"user"},{"_id":"612ee6a7b960e78c6d2319d4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/612ee6a7b960e78c6d2319d4/2Hu9BaAyXbyh1vt0v1Qui.jpeg","isPro":false,"fullname":"Qian Liu","user":"SivilTaram","type":"user"},{"_id":"637f0eb22438d7485b8ef5d7","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/637f0eb22438d7485b8ef5d7/70h7dekqj7LuBobOXckmJ.jpeg","isPro":false,"fullname":"Ming Li","user":"limingcv","type":"user"},{"_id":"61cd4b833dd34ba1985e0753","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61cd4b833dd34ba1985e0753/BfHfrwotoMESpXZOHiIe4.png","isPro":false,"fullname":"KABI","user":"dongguanting","type":"user"},{"_id":"61e4c4ca1ab24785ac11ba69","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e4c4ca1ab24785ac11ba69/1Q1zhhyGSJ9RJG9MzwxVv.jpeg","isPro":false,"fullname":"Binyuan Hui","user":"huybery","type":"user"},{"_id":"63d9d68c1cae35c27bf7a6a7","avatarUrl":"/avatars/b5ad98cf269ae5f1fe90861fb4170fae.svg","isPro":false,"fullname":"Bowen Yu","user":"Tigerph","type":"user"},{"_id":"63ef22b2bfe4ead22ca9e1e4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1676616348535-noauth.jpeg","isPro":false,"fullname":"Phú Võ","user":"phuvo","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":1}">
Papers
arxiv:2407.10671

Qwen2 Technical Report

Published on Jul 15, 2024
· Submitted by
Binyuan Hui
on Jul 16, 2024
#1 Paper of the day
Authors:
,
,
,
,
,
,

Abstract

The Qwen2 series, comprising 0.5 to 72 billion parameter models, surpasses prior open models across language understanding, generation, multilingualism, coding, math, and reasoning, with exceptional performance in benchmarks like MMLU, GPQA, HumanEval, GSM8K, BBH, MT-Bench, Arena-Hard, and LiveCodeBench.

AI-generated summary

This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts model. Qwen2 surpasses most prior open-weight models, including its predecessor Qwen1.5, and exhibits competitive performance relative to proprietary models across diverse benchmarks on language understanding, generation, multilingual proficiency, coding, mathematics, and reasoning. The flagship model, Qwen2-72B, showcases remarkable performance: 84.2 on MMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH as a base language model. The instruction-tuned variant, Qwen2-72B-Instruct, attains 9.1 on MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench. Moreover, Qwen2 demonstrates robust multilingual capabilities, proficient in approximately 30 languages, spanning English, Chinese, Spanish, French, German, Arabic, Russian, Korean, Japanese, Thai, Vietnamese, and more, underscoring its versatility and global reach. To foster community innovation and accessibility, we have made the Qwen2 model weights openly available on Hugging Face1 and ModelScope2, and the supplementary materials including example code on GitHub3. These platforms also include resources for quantization, fine-tuning, and deployment, facilitating a wide range of applications and research endeavors.

Community

Paper author Paper submitter

Qwen2 Technical Report

Paper author Paper submitter

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 1,000+

Browse 1,000+ models citing this paper

Datasets citing this paper 2

Spaces citing this paper 31,466

Collections including this paper 46