Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Paper page - Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
[go: Go Back, main page]

https://huggingface.co/microsoft/Phi-3-mini-4k-instruct

\n","updatedAt":"2024-04-23T15:13:42.818Z","author":{"_id":"64726d4045c2f5457fb00e47","avatarUrl":"/avatars/511705723ec776a91bdff4fd4bd37c12.svg","fullname":"Volodymyr","name":"ThreeBlessings","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7681654691696167},"editors":["ThreeBlessings"],"editorAvatarUrls":["/avatars/511705723ec776a91bdff4fd4bd37c12.svg"],"reactions":[],"isReport":false,"parentCommentId":"662728d0974a2868d847890a"}},{"id":"670516f3c89edc4616b515d8","author":{"_id":"650af091b29b1683eda2e143","avatarUrl":"/avatars/cbfcec32c4291c2f7015553e3d6544af.svg","fullname":"chengjianling","name":"www3077665332","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-10-08T11:26:43.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"When did mini3.5-gguf release the new mobile phone processing and it can run smoothly?","html":"

When did mini3.5-gguf release the new mobile phone processing and it can run smoothly?

\n","updatedAt":"2024-10-08T11:26:43.624Z","author":{"_id":"650af091b29b1683eda2e143","avatarUrl":"/avatars/cbfcec32c4291c2f7015553e3d6544af.svg","fullname":"chengjianling","name":"www3077665332","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"editors":["www3077665332"],"editorAvatarUrls":["/avatars/cbfcec32c4291c2f7015553e3d6544af.svg"],"reactions":[],"isReport":false,"parentCommentId":"662728d0974a2868d847890a"}}]},{"id":"66272d886ba9408567e69d1c","author":{"_id":"65a9096b539e21143610dfb0","avatarUrl":"/avatars/f8f016759049a9ddd68a1206637e8d89.svg","fullname":"seven zhang","name":"seven89","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false},"createdAt":"2024-04-23T03:39:52.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"G","html":"

G

\n","updatedAt":"2024-04-23T03:40:25.672Z","author":{"_id":"65a9096b539e21143610dfb0","avatarUrl":"/avatars/f8f016759049a9ddd68a1206637e8d89.svg","fullname":"seven zhang","name":"seven89","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"zh","probability":0.9832334518432617},"editors":["seven89"],"editorAvatarUrls":["/avatars/f8f016759049a9ddd68a1206637e8d89.svg"],"reactions":[],"isReport":false}},{"id":"66272efb8ea014ccfff47014","author":{"_id":"65a3e1f99acab19980cb6523","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/65a3e1f99acab19980cb6523/FE9kMEYpFPFFmJAM378vY.jpeg","fullname":"Maksym Huczynski","name":"h2m","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2024-04-23T03:46:03.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"pocket-size gpt-3.5?\n14B matching GPT-4-0314 MT-Bench?\n<3","html":"

pocket-size gpt-3.5?
14B matching GPT-4-0314 MT-Bench?
&lt;3

\n","updatedAt":"2024-04-23T03:46:03.877Z","author":{"_id":"65a3e1f99acab19980cb6523","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/65a3e1f99acab19980cb6523/FE9kMEYpFPFFmJAM378vY.jpeg","fullname":"Maksym Huczynski","name":"h2m","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.38990142941474915},"editors":["h2m"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/65a3e1f99acab19980cb6523/FE9kMEYpFPFFmJAM378vY.jpeg"],"reactions":[{"reaction":"โค๏ธ","users":["clem","NNet","marcusinthesky","edmond","dkyazze","Gen0410","davanstrien","zahidpichen","farpluto"],"count":9}],"isReport":false}},{"id":"66272f5d9585a200d2eacd65","author":{"_id":"648b344fb04571b0a5e58281","avatarUrl":"/avatars/e33fb6196c422d8c386d18c0bf1465a8.svg","fullname":"Mohd Zeeshan","name":"ZappY-AI","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false},"createdAt":"2024-04-23T03:47:41.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"weights or it didn't happen. Also, please make it apache 2.0.","html":"

weights or it didn't happen. Also, please make it apache 2.0.

\n","updatedAt":"2024-04-23T03:47:41.603Z","author":{"_id":"648b344fb04571b0a5e58281","avatarUrl":"/avatars/e33fb6196c422d8c386d18c0bf1465a8.svg","fullname":"Mohd Zeeshan","name":"ZappY-AI","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9741812348365784},"editors":["ZappY-AI"],"editorAvatarUrls":["/avatars/e33fb6196c422d8c386d18c0bf1465a8.svg"],"reactions":[{"reaction":"โž•","users":["samusenps","clem","tripathiarpan20","Enigrand","Vezora","Sakanamochi","g-ronimo","mechanicmuthu","dtanow","adamm-hf","alvarobartt","arunsammit","Lin-Chen","Kimmonismus","abhishek","burtenshaw","Jaward","Xalphinions","mlinmg","andriizadaianchuk","eliolio","lolendo","maazmikail17","kitsuneb","MyUser","edmond","jeohalves","3642578a","ksaml","gcrois","utydfd","marinac","druser","metaltiger775","Someman","charlesniswander","Tanne"],"count":37},{"reaction":"๐Ÿ‘","users":["adamm-hf","Tanne","kaizuberbuehler","marcusinthesky","barakplasma","abhishek","burtenshaw","mlinmg","lolendo","edmond","3642578a","gcrois","bembo","metaltiger775","Bachstelze","Someman","charlesniswander","mertyazan"],"count":18},{"reaction":"โค๏ธ","users":["adamm-hf","burtenshaw","mlinmg","lolendo","edmond","gcrois","utydfd","jacksth22","viktordanov","Abecid","charlesniswander"],"count":11},{"reaction":"๐Ÿ”ฅ","users":["adamm-hf","burtenshaw","mlinmg","lolendo","edmond","gcrois","viktordanov","charlesniswander"],"count":8}],"isReport":false}},{"id":"662737de311a0f41f97b0843","author":{"_id":"645ba903ca5d8a2977148389","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/zxdZvpuAP6qEhk3vyRO3_.jpeg","fullname":"Zoltan Csaki","name":"zolicsaki","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":36,"isUserFollowing":false},"createdAt":"2024-04-23T04:23:58.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Weights? ","html":"

Weights?

\n","updatedAt":"2024-04-23T04:23:58.687Z","author":{"_id":"645ba903ca5d8a2977148389","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/zxdZvpuAP6qEhk3vyRO3_.jpeg","fullname":"Zoltan Csaki","name":"zolicsaki","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":36,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9924558401107788},"editors":["zolicsaki"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/zxdZvpuAP6qEhk3vyRO3_.jpeg"],"reactions":[{"reaction":"๐Ÿ‘","users":["adamm-hf","ThreeBlessings","kaizuberbuehler","druser"],"count":4},{"reaction":"โž•","users":["adamm-hf"],"count":1}],"isReport":false}},{"id":"66274f92f7cf69d4224d7ca2","author":{"_id":"6340651b388c3fa40f9a5bc0","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6340651b388c3fa40f9a5bc0/av1C4_S7bHGxAzOu8lOmG.jpeg","fullname":"Adam Molnar","name":"adamm-hf","type":"user","isPro":false,"isHf":true,"isHfAdmin":true,"isMod":false,"followerCount":514,"isUserFollowing":false},"createdAt":"2024-04-23T06:05:06.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Very cool! Would be awesome to increase visibility + experimentation by sharing the weights as well ๐Ÿค—","html":"

Very cool! Would be awesome to increase visibility + experimentation by sharing the weights as well ๐Ÿค—

\n","updatedAt":"2024-04-23T06:05:06.167Z","author":{"_id":"6340651b388c3fa40f9a5bc0","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6340651b388c3fa40f9a5bc0/av1C4_S7bHGxAzOu8lOmG.jpeg","fullname":"Adam Molnar","name":"adamm-hf","type":"user","isPro":false,"isHf":true,"isHfAdmin":true,"isMod":false,"followerCount":514,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9653069376945496},"editors":["adamm-hf"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/6340651b388c3fa40f9a5bc0/av1C4_S7bHGxAzOu8lOmG.jpeg"],"reactions":[{"reaction":"๐Ÿ‘","users":["kaizuberbuehler","druser","charlesniswander","farpluto"],"count":4}],"isReport":false}},{"id":"66275e7c03be83febc4b6d54","author":{"_id":"65de83ad04b9352580d2f03a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/rta6LcYODzuTjt9SDd-0d.png","fullname":"Arda TuฤŸsat","name":"siberparsomen","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-04-23T07:08:44.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is ฤฑncredible. When we will see the weights?","html":"

This is ฤฑncredible. When we will see the weights?

\n","updatedAt":"2024-04-23T07:08:44.887Z","author":{"_id":"65de83ad04b9352580d2f03a","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/rta6LcYODzuTjt9SDd-0d.png","fullname":"Arda TuฤŸsat","name":"siberparsomen","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9643322229385376},"editors":["siberparsomen"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/rta6LcYODzuTjt9SDd-0d.png"],"reactions":[{"reaction":"๐Ÿ‘","users":["kaizuberbuehler","druser","charlesniswander"],"count":3}],"isReport":false}},{"id":"662766d403be83febc4d4762","author":{"_id":"5df7e9e5da6d0311fd3d53f9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857746553-5df7e9e5da6d0311fd3d53f9.jpeg","fullname":"Thomas Wolf","name":"thomwolf","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":1683,"isUserFollowing":false},"createdAt":"2024-04-23T07:44:20.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"So great to see the successor of Phi-1.5/2 โ€“ Looking forward to being able to play with the model and embed it locally everywhere!","html":"

So great to see the successor of Phi-1.5/2 โ€“ Looking forward to being able to play with the model and embed it locally everywhere!

\n","updatedAt":"2024-04-23T07:44:46.480Z","author":{"_id":"5df7e9e5da6d0311fd3d53f9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857746553-5df7e9e5da6d0311fd3d53f9.jpeg","fullname":"Thomas Wolf","name":"thomwolf","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":1683,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.9642260670661926},"editors":["thomwolf"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1583857746553-5df7e9e5da6d0311fd3d53f9.jpeg"],"reactions":[{"reaction":"๐Ÿ”ฅ","users":["andriizadaianchuk","Pingmeep"],"count":2}],"isReport":false}},{"id":"66276f4bc374273c27f8da64","author":{"_id":"630c2a12910e17bbfeb1ce18","avatarUrl":"/avatars/0c1dd3ebc0e2c8ecf6c771d3728accf9.svg","fullname":"Razvan","name":"razvanab","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":21,"isUserFollowing":false},"createdAt":"2024-04-23T08:20:27.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Weights Please ๐Ÿ™","html":"

Weights Please ๐Ÿ™

\n","updatedAt":"2024-04-23T08:20:27.204Z","author":{"_id":"630c2a12910e17bbfeb1ce18","avatarUrl":"/avatars/0c1dd3ebc0e2c8ecf6c771d3728accf9.svg","fullname":"Razvan","name":"razvanab","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":21,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9088301658630371},"editors":["razvanab"],"editorAvatarUrls":["/avatars/0c1dd3ebc0e2c8ecf6c771d3728accf9.svg"],"reactions":[{"reaction":"โค๏ธ","users":["maxrubin629","druser"],"count":2}],"isReport":false}},{"id":"662777ed0d3cf6f7b951cb49","author":{"_id":"662774716268ae5491c621c0","avatarUrl":"/avatars/a74e0c041ae3968b20dbf7ab0ac2b636.svg","fullname":"A Bugayev","name":"Alex805","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-04-23T08:57:17.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"These SLMs are better and better so far. Would be cool to get an apk to actually run them on mobile devices without termux. Two existing things that I know of, are with limited models support. And GPT 3.5 level of model quality is a good occasion to wrap it","html":"

These SLMs are better and better so far. Would be cool to get an apk to actually run them on mobile devices without termux. Two existing things that I know of, are with limited models support. And GPT 3.5 level of model quality is a good occasion to wrap it

\n","updatedAt":"2024-04-23T08:57:17.779Z","author":{"_id":"662774716268ae5491c621c0","avatarUrl":"/avatars/a74e0c041ae3968b20dbf7ab0ac2b636.svg","fullname":"A Bugayev","name":"Alex805","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9414817690849304},"editors":["Alex805"],"editorAvatarUrls":["/avatars/a74e0c041ae3968b20dbf7ab0ac2b636.svg"],"reactions":[],"isReport":false},"replies":[{"id":"6627874c911d848e35747ba4","author":{"_id":"5e80b7d830dc073f817a2bc0","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1585493970035-noauth.jpeg","fullname":"Haris Jabbar","name":"maveriq","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":7,"isUserFollowing":false},"createdAt":"2024-04-23T10:02:52.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"I agree that SLMs probably need more focus and have potential to make great strides on multiple fronts; be it accessbility, deployability, inference speed and new usecases. Ofcourse it means putting in more effort on dataset curation and maybe even the architecture. Phi series is the proof that focused data curation alone can improve performance quite a bit. ","html":"

I agree that SLMs probably need more focus and have potential to make great strides on multiple fronts; be it accessbility, deployability, inference speed and new usecases. Ofcourse it means putting in more effort on dataset curation and maybe even the architecture. Phi series is the proof that focused data curation alone can improve performance quite a bit.

\n","updatedAt":"2024-04-23T10:02:52.746Z","author":{"_id":"5e80b7d830dc073f817a2bc0","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1585493970035-noauth.jpeg","fullname":"Haris Jabbar","name":"maveriq","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":7,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9626772999763489},"editors":["maveriq"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1585493970035-noauth.jpeg"],"reactions":[],"isReport":false,"parentCommentId":"662777ed0d3cf6f7b951cb49"}}]},{"id":"662782d0846c2a66a7c6fcb3","author":{"_id":"64aea8ff67511bd3d965697b","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64aea8ff67511bd3d965697b/Jxn52EmDF5RApJh8antxn.jpeg","fullname":"Feynman Innovations","name":"ajibawa-2023","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":189,"isUserFollowing":false},"createdAt":"2024-04-23T09:43:44.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Given recent events, I don't think weights will be available and forget about dataset. Even if weights are released it will be taken down next day for some testing or alignment or some other stuff only never to return. Great job guys!!","html":"

Given recent events, I don't think weights will be available and forget about dataset. Even if weights are released it will be taken down next day for some testing or alignment or some other stuff only never to return. Great job guys!!

\n","updatedAt":"2024-04-23T09:43:44.852Z","author":{"_id":"64aea8ff67511bd3d965697b","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64aea8ff67511bd3d965697b/Jxn52EmDF5RApJh8antxn.jpeg","fullname":"Feynman Innovations","name":"ajibawa-2023","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":189,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9426619410514832},"editors":["ajibawa-2023"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64aea8ff67511bd3d965697b/Jxn52EmDF5RApJh8antxn.jpeg"],"reactions":[{"reaction":"๐Ÿ˜”","users":["eek","ThreeBlessings"],"count":2},{"reaction":"๐Ÿ‘€","users":["CaryPalmer"],"count":1}],"isReport":false},"replies":[{"id":"6627a525efb20bb976a6b7fd","author":{"_id":"64726d4045c2f5457fb00e47","avatarUrl":"/avatars/511705723ec776a91bdff4fd4bd37c12.svg","fullname":"Volodymyr","name":"ThreeBlessings","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false},"createdAt":"2024-04-23T12:10:13.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"I'm not sure what recent events you're referring to. I'll wait for the official statement before jumping to conclusions.","html":"

I'm not sure what recent events you're referring to. I'll wait for the official statement before jumping to conclusions.

\n","updatedAt":"2024-04-23T12:10:13.879Z","author":{"_id":"64726d4045c2f5457fb00e47","avatarUrl":"/avatars/511705723ec776a91bdff4fd4bd37c12.svg","fullname":"Volodymyr","name":"ThreeBlessings","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9678840637207031},"editors":["ThreeBlessings"],"editorAvatarUrls":["/avatars/511705723ec776a91bdff4fd4bd37c12.svg"],"reactions":[],"isReport":false,"parentCommentId":"662782d0846c2a66a7c6fcb3"}},{"id":"664e19b5bc40cce7a8623640","author":{"_id":"6402c5df5caf6d21d680a2e0","avatarUrl":"/avatars/e78b8cd006ddee8ac5f19f809c3ff659.svg","fullname":"Shawn Fumo","name":"InvidFlower","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-05-22T16:13:41.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"I'm pretty sure that was just related to a fine tuning that was done on top of Llama 3. It doesn't have anything to do with Phi 3, which is trained from scratch. And in fact the weights for for the 14b Medium Phi-3 model and a vision model were just released yesterday.","html":"

I'm pretty sure that was just related to a fine tuning that was done on top of Llama 3. It doesn't have anything to do with Phi 3, which is trained from scratch. And in fact the weights for for the 14b Medium Phi-3 model and a vision model were just released yesterday.

\n","updatedAt":"2024-05-22T16:13:41.334Z","author":{"_id":"6402c5df5caf6d21d680a2e0","avatarUrl":"/avatars/e78b8cd006ddee8ac5f19f809c3ff659.svg","fullname":"Shawn Fumo","name":"InvidFlower","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9970608949661255},"editors":["InvidFlower"],"editorAvatarUrls":["/avatars/e78b8cd006ddee8ac5f19f809c3ff659.svg"],"reactions":[],"isReport":false,"parentCommentId":"662782d0846c2a66a7c6fcb3"}}]},{"id":"6627891c8c21ce7747fbf64d","author":{"_id":"644a114201e18bf93a6eff8f","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/644a114201e18bf93a6eff8f/rYdjfaJ-4-SlT3PRmChdc.jpeg","fullname":"Mouhu","name":"MouhuAI","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":13,"isUserFollowing":false},"createdAt":"2024-04-23T10:10:36.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"When will the model be released?","html":"

When will the model be released?

\n","updatedAt":"2024-04-23T10:10:36.027Z","author":{"_id":"644a114201e18bf93a6eff8f","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/644a114201e18bf93a6eff8f/rYdjfaJ-4-SlT3PRmChdc.jpeg","fullname":"Mouhu","name":"MouhuAI","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":13,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9401509165763855},"editors":["MouhuAI"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/644a114201e18bf93a6eff8f/rYdjfaJ-4-SlT3PRmChdc.jpeg"],"reactions":[{"reaction":"๐Ÿ‘€","users":["GeroxDev","aseto","tkdss37","SunilSurineni","Bachstelze","RalfB23"],"count":6}],"isReport":false}},{"id":"6627a9d88e14b51809f553ea","author":{"_id":"62af665424488e6adfa9b8e2","avatarUrl":"/avatars/2bdb4a26fde4cbe5b4673e53e0d44540.svg","fullname":"Edmond Jacoupeau","name":"edmond","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"createdAt":"2024-04-23T12:30:16.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"LLama was never competitive, LLama 2 got in a few weeks beaten by Mistral, LLama 3 got in a few days beaten by Phi 3 ?\nDamn if this is true Zuck might start to become seriously mad ... (even if phi is using LLama 2)","html":"

LLama was never competitive, LLama 2 got in a few weeks beaten by Mistral, LLama 3 got in a few days beaten by Phi 3 ?
Damn if this is true Zuck might start to become seriously mad ... (even if phi is using LLama 2)

\n","updatedAt":"2024-04-23T12:30:16.059Z","author":{"_id":"62af665424488e6adfa9b8e2","avatarUrl":"/avatars/2bdb4a26fde4cbe5b4673e53e0d44540.svg","fullname":"Edmond Jacoupeau","name":"edmond","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9743345379829407},"editors":["edmond"],"editorAvatarUrls":["/avatars/2bdb4a26fde4cbe5b4673e53e0d44540.svg"],"reactions":[],"isReport":false}},{"id":"6627aaa7846c2a66a7d17f0b","author":{"_id":"6438a9027de34e8ea7e4b257","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6438a9027de34e8ea7e4b257/vib8QSd1AWMr_bR9ig_xJ.jpeg","fullname":"Jaward Sesay","name":"Jaward","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":357,"isUserFollowing":false},"createdAt":"2024-04-23T12:33:43.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Here's a quick walkthrough of the paper: https://huggingface.co/posts/Jaward/284702584639894","html":"

Here's a quick walkthrough of the paper: https://huggingface.co/posts/Jaward/284702584639894

\n","updatedAt":"2024-04-23T12:33:43.581Z","author":{"_id":"6438a9027de34e8ea7e4b257","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6438a9027de34e8ea7e4b257/vib8QSd1AWMr_bR9ig_xJ.jpeg","fullname":"Jaward Sesay","name":"Jaward","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":357,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7239969968795776},"editors":["Jaward"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/6438a9027de34e8ea7e4b257/vib8QSd1AWMr_bR9ig_xJ.jpeg"],"reactions":[{"reaction":"๐Ÿ‘","users":["lukestanley","braneloop","victor"],"count":3},{"reaction":"๐Ÿ”ฅ","users":["clem"],"count":1}],"isReport":false}},{"id":"6627ae969177494937990401","author":{"_id":"6486638da4cf2081f20c40ec","avatarUrl":"/avatars/0bc16a7447cd71ac18828a678313bd83.svg","fullname":"Mike Young","name":"mikelabs","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":13,"isUserFollowing":false},"createdAt":"2024-04-23T12:50:30.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"I saw weights are coming tomorrow (on Twitter, hopefully it's legit!). In any case, there's a plain-english rewrite of this paper available here if you want: https://www.aimodels.fyi/papers/arxiv/phi-3-technical-report-highly-capable-language","html":"

I saw weights are coming tomorrow (on Twitter, hopefully it's legit!). In any case, there's a plain-english rewrite of this paper available here if you want: https://www.aimodels.fyi/papers/arxiv/phi-3-technical-report-highly-capable-language

\n","updatedAt":"2024-04-23T12:50:30.476Z","author":{"_id":"6486638da4cf2081f20c40ec","avatarUrl":"/avatars/0bc16a7447cd71ac18828a678313bd83.svg","fullname":"Mike Young","name":"mikelabs","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":13,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8670075535774231},"editors":["mikelabs"],"editorAvatarUrls":["/avatars/0bc16a7447cd71ac18828a678313bd83.svg"],"reactions":[],"isReport":false}},{"id":"6627c36ca7eec4c3fce66575","author":{"_id":"62f27b53e538a8081154ba9f","avatarUrl":"/avatars/7ba33fdcf3b9f2658839421ea0d1d67f.svg","fullname":"Luke Stanley","name":"lukestanley","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":2,"isUserFollowing":false},"createdAt":"2024-04-23T14:19:24.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Surely the first reference needs fixing to say 2024 and to use capital letters in the right places?\nCurrently says: \"References\n[AI23] Meta AI. Introducing meta llama 3: The most capable openly available llm to date, 2023.\"\n\nSurely it should be: \"[AI23] Meta AI. Introducing Meta Llama 3: The most capable openly available LLM to date, 2024.\"?\n@gugarosa \nSo looking forward to playing with this, well done all!","html":"

Surely the first reference needs fixing to say 2024 and to use capital letters in the right places?
Currently says: \"References
[AI23] Meta AI. Introducing meta llama 3: The most capable openly available llm to date, 2023.\"

\n

Surely it should be: \"[AI23] Meta AI. Introducing Meta Llama 3: The most capable openly available LLM to date, 2024.\"?
\n\n@gugarosa\n\t
So looking forward to playing with this, well done all!

\n","updatedAt":"2024-04-23T14:19:24.209Z","author":{"_id":"62f27b53e538a8081154ba9f","avatarUrl":"/avatars/7ba33fdcf3b9f2658839421ea0d1d67f.svg","fullname":"Luke Stanley","name":"lukestanley","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":2,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8073400259017944},"editors":["lukestanley"],"editorAvatarUrls":["/avatars/7ba33fdcf3b9f2658839421ea0d1d67f.svg"],"reactions":[],"isReport":false}},{"id":"6627cec4785b13b10709a83b","author":{"_id":"630b01434bee441367b788d3","avatarUrl":"/avatars/4e8089069801441e1ecc242e2e706df1.svg","fullname":"Mahesh Deshwal","name":"deshwalmahesh","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-04-23T15:07:48.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"How did you \"filter\" data for Phase-1 and 2? Was it manual? How did you ensure if it was automated? \n\nAlso, what was the criteria for \"inducing reasoning\" on the dataset and web?","html":"

How did you \"filter\" data for Phase-1 and 2? Was it manual? How did you ensure if it was automated?

\n

Also, what was the criteria for \"inducing reasoning\" on the dataset and web?

\n","updatedAt":"2024-04-23T15:07:48.091Z","author":{"_id":"630b01434bee441367b788d3","avatarUrl":"/avatars/4e8089069801441e1ecc242e2e706df1.svg","fullname":"Mahesh Deshwal","name":"deshwalmahesh","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9733771681785583},"editors":["deshwalmahesh"],"editorAvatarUrls":["/avatars/4e8089069801441e1ecc242e2e706df1.svg"],"reactions":[],"isReport":false}},{"id":"6627d3764db3629b5361365a","author":{"_id":"60a551a34ecc5d054c8ad93e","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/60a551a34ecc5d054c8ad93e/dhcBFtwNLcKqqASxniyVw.jpeg","fullname":"Mishig Davaadorj","name":"mishig","type":"user","isPro":false,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":307,"isUserFollowing":false},"createdAt":"2024-04-23T15:27:50.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"it is now available on hugging chat ๐Ÿ”ฅ https://huggingface.co/chat/models/microsoft/Phi-3-mini-4k-instruct\n\n![image.png](https://cdn-uploads.huggingface.co/production/uploads/60a551a34ecc5d054c8ad93e/tiwjumC2wpj3KtWW-KOQ8.png)\n","html":"

it is now available on hugging chat ๐Ÿ”ฅ https://huggingface.co/chat/models/microsoft/Phi-3-mini-4k-instruct

\n

\"image.png\"

\n","updatedAt":"2024-04-23T15:27:50.609Z","author":{"_id":"60a551a34ecc5d054c8ad93e","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/60a551a34ecc5d054c8ad93e/dhcBFtwNLcKqqASxniyVw.jpeg","fullname":"Mishig Davaadorj","name":"mishig","type":"user","isPro":false,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":307,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.5877237319946289},"editors":["mishig"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/60a551a34ecc5d054c8ad93e/dhcBFtwNLcKqqASxniyVw.jpeg"],"reactions":[{"reaction":"โค๏ธ","users":["clem"],"count":1}],"isReport":false},"replies":[{"id":"6627db76acde0707c66aa005","author":{"_id":"658430e5c5599ad89a350e95","avatarUrl":"/avatars/49d5d69dca622797a1378c50cf460377.svg","fullname":"Retteghy","name":"retteghy","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false},"createdAt":"2024-04-23T16:01:58.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"for some reason the one on hugging chat gives me crappy answers , e.g.: \nQ: what files are needed for a chrome extension, what are their names? \nA: To add unit tests for the URLAnalyzer class, you'll need to set up a testing framework like Jest. Here's an example of how you might write tests for the waitForClassPresence and analyzeUrl met [...] and so on, completely unrelated junk)\n\nI tried the gguf Q4 version in gpt4all, and got much better results, only issue is with the stop token","html":"

for some reason the one on hugging chat gives me crappy answers , e.g.:
Q: what files are needed for a chrome extension, what are their names?
A: To add unit tests for the URLAnalyzer class, you'll need to set up a testing framework like Jest. Here's an example of how you might write tests for the waitForClassPresence and analyzeUrl met [...] and so on, completely unrelated junk)

\n

I tried the gguf Q4 version in gpt4all, and got much better results, only issue is with the stop token

\n","updatedAt":"2024-04-24T03:43:31.301Z","author":{"_id":"658430e5c5599ad89a350e95","avatarUrl":"/avatars/49d5d69dca622797a1378c50cf460377.svg","fullname":"Retteghy","name":"retteghy","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.9022683501243591},"editors":["retteghy"],"editorAvatarUrls":["/avatars/49d5d69dca622797a1378c50cf460377.svg"],"reactions":[],"isReport":false,"parentCommentId":"6627d3764db3629b5361365a"}},{"id":"662985dac9d9396ee27ccf29","author":{"_id":"5f17f0a0925b9863e28ad517","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5f17f0a0925b9863e28ad517/fXIY5i9RLsIa1v3CCuVtt.jpeg","fullname":"Victor Mustar","name":"victor","type":"user","isPro":true,"isHf":true,"isHfAdmin":true,"isMod":false,"followerCount":5157,"isUserFollowing":false},"createdAt":"2024-04-24T22:21:14.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"> for some reason the one on hugging chat gives me crappy answers , e.g.:\n\ncc @nsarrazin who is working on it - should be better soon ๐Ÿ”ฅ","html":"
\n

for some reason the one on hugging chat gives me crappy answers , e.g.:

\n
\n

cc \n\n@nsarrazin\n\t who is working on it - should be better soon ๐Ÿ”ฅ

\n","updatedAt":"2024-04-24T22:21:14.040Z","author":{"_id":"5f17f0a0925b9863e28ad517","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5f17f0a0925b9863e28ad517/fXIY5i9RLsIa1v3CCuVtt.jpeg","fullname":"Victor Mustar","name":"victor","type":"user","isPro":true,"isHf":true,"isHfAdmin":true,"isMod":false,"followerCount":5157,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8766080737113953},"editors":["victor"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/5f17f0a0925b9863e28ad517/fXIY5i9RLsIa1v3CCuVtt.jpeg"],"reactions":[],"isReport":false,"parentCommentId":"6627d3764db3629b5361365a"}}]},{"id":"6627d389c09f335afc48ca50","author":{"_id":"5e67bdd61009063689407479","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg","fullname":"Clem ๐Ÿค—","name":"clem","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":2868,"isUserFollowing":false},"createdAt":"2024-04-23T15:28:09.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"The weights just dropped - MIT license!\n\nhttps://huggingface.co/microsoft/Phi-3-mini-128k-instruct \nhttps://huggingface.co/microsoft/Phi-3-mini-4k-instruct","html":"

The weights just dropped - MIT license!

\n

https://huggingface.co/microsoft/Phi-3-mini-128k-instruct
https://huggingface.co/microsoft/Phi-3-mini-4k-instruct

\n","updatedAt":"2024-04-23T15:28:09.192Z","author":{"_id":"5e67bdd61009063689407479","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg","fullname":"Clem ๐Ÿค—","name":"clem","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":2868,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6164886951446533},"editors":["clem"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg"],"reactions":[{"reaction":"๐Ÿš€","users":["ThreeBlessings","Jaward","AdinaY","Vezora","colannino","KrishnaKaasyap"],"count":6},{"reaction":"๐Ÿ”ฅ","users":["Jaward","AdinaY","Vezora"],"count":3},{"reaction":"๐Ÿค—","users":["KrishnaKaasyap"],"count":1}],"isReport":false}},{"id":"6627edd82c3877cd9db67680","author":{"_id":"64aed48cfec303c461d06242","avatarUrl":"/avatars/236c771e6c5a25ef6ed5e1bc061e30b8.svg","fullname":"Krishna Kaasyap","name":"KrishnaKaasyap","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":6,"isUserFollowing":false},"createdAt":"2024-04-23T17:20:24.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Would love to see all models in this family on LMSYS arena!\n\nArena is like double blind peer review ++ randomized controlled trials in science! The golden standard to judge something. I hope some API provider like Together API would provide inference services for these family of models to us all and also for Arena!","html":"

Would love to see all models in this family on LMSYS arena!

\n

Arena is like double blind peer review ++ randomized controlled trials in science! The golden standard to judge something. I hope some API provider like Together API would provide inference services for these family of models to us all and also for Arena!

\n","updatedAt":"2024-04-23T17:20:24.977Z","author":{"_id":"64aed48cfec303c461d06242","avatarUrl":"/avatars/236c771e6c5a25ef6ed5e1bc061e30b8.svg","fullname":"Krishna Kaasyap","name":"KrishnaKaasyap","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":6,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9594466090202332},"editors":["KrishnaKaasyap"],"editorAvatarUrls":["/avatars/236c771e6c5a25ef6ed5e1bc061e30b8.svg"],"reactions":[],"isReport":false},"replies":[{"id":"6627ef58c09f335afc4fe9bb","author":{"_id":"5e67bdd61009063689407479","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg","fullname":"Clem ๐Ÿค—","name":"clem","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":2868,"isUserFollowing":false},"createdAt":"2024-04-23T17:26:48.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"you can try the models with transformers (https://github.com/huggingface/transformers) or TGI already ( https://github.com/huggingface/text-generation-inference) cc @Narsil @lysandre","html":"

you can try the models with transformers (https://github.com/huggingface/transformers) or TGI already ( https://github.com/huggingface/text-generation-inference) cc \n\n@Narsil\n\t \n\n@lysandre\n\t

\n","updatedAt":"2024-04-23T17:27:09.091Z","author":{"_id":"5e67bdd61009063689407479","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg","fullname":"Clem ๐Ÿค—","name":"clem","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":2868,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.7363807559013367},"editors":["clem"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg"],"reactions":[{"reaction":"โค๏ธ","users":["KrishnaKaasyap"],"count":1}],"isReport":false,"parentCommentId":"6627edd82c3877cd9db67680"}}]},{"id":"6627f34809826aaa250d1b01","author":{"_id":"615b8a9c23f3c5e91441a387","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/615b8a9c23f3c5e91441a387/b7wAb09b-doTD5zf4gZOc.jpeg","fullname":"Carlos Escobar","name":"Broomva","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-04-23T17:43:36.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Does it have any tuning for function calling? What dataset was used or how to fine tune it for agent applications?","html":"

Does it have any tuning for function calling? What dataset was used or how to fine tune it for agent applications?

\n","updatedAt":"2024-04-23T17:43:36.247Z","author":{"_id":"615b8a9c23f3c5e91441a387","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/615b8a9c23f3c5e91441a387/b7wAb09b-doTD5zf4gZOc.jpeg","fullname":"Carlos Escobar","name":"Broomva","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9594060182571411},"editors":["Broomva"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/615b8a9c23f3c5e91441a387/b7wAb09b-doTD5zf4gZOc.jpeg"],"reactions":[],"isReport":false}},{"id":"6627fe02ddab753c4a19a818","author":{"_id":"60c8d264224e250fb0178f77","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/60c8d264224e250fb0178f77/i8fbkBVcoFeJRmkQ9kYAE.png","fullname":"Adam Lee","name":"Abecid","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":7,"isUserFollowing":false},"createdAt":"2024-04-23T18:29:22.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Wow people are actually starting to use hf comments now really cool ๐Ÿ˜Ž","html":"

Wow people are actually starting to use hf comments now really cool ๐Ÿ˜Ž

\n","updatedAt":"2024-04-23T18:29:22.801Z","author":{"_id":"60c8d264224e250fb0178f77","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/60c8d264224e250fb0178f77/i8fbkBVcoFeJRmkQ9kYAE.png","fullname":"Adam Lee","name":"Abecid","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":7,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9734535813331604},"editors":["Abecid"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/60c8d264224e250fb0178f77/i8fbkBVcoFeJRmkQ9kYAE.png"],"reactions":[{"reaction":"๐Ÿ˜Ž","users":["clem"],"count":1},{"reaction":"๐Ÿ”ฅ","users":["victor"],"count":1}],"isReport":false}},{"id":"662809bad48d5daa797416a3","author":{"_id":"639c751c8a34ed9a404d1627","avatarUrl":"/avatars/ad21cfa15f2ac6dfbbd7ad60c28266f6.svg","fullname":"Damodharan","name":"damojay","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-04-23T19:19:22.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"To be able to finetune it for json mode and the ability to use it in mobile will have very nice impact!\nOpens so many opportunities for agents in GPU poor devices","html":"

To be able to finetune it for json mode and the ability to use it in mobile will have very nice impact!
Opens so many opportunities for agents in GPU poor devices

\n","updatedAt":"2024-04-23T19:19:22.180Z","author":{"_id":"639c751c8a34ed9a404d1627","avatarUrl":"/avatars/ad21cfa15f2ac6dfbbd7ad60c28266f6.svg","fullname":"Damodharan","name":"damojay","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9369449019432068},"editors":["damojay"],"editorAvatarUrls":["/avatars/ad21cfa15f2ac6dfbbd7ad60c28266f6.svg"],"reactions":[],"isReport":false}},{"id":"66285f875d246621f331a128","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false},"createdAt":"2024-04-24T01:25:27.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is an automated message from the [Librarian Bot](https://huggingface.co/librarian-bots). I found the following papers similar to this paper. \n\nThe following papers were recommended by the Semantic Scholar API \n\n* [Stable LM 2 1.6B Technical Report](https://huggingface.co/papers/2402.17834) (2024)\n* [Nemotron-4 15B Technical Report](https://huggingface.co/papers/2402.16819) (2024)\n* [Latxa: An Open Language Model and Evaluation Suite for Basque](https://huggingface.co/papers/2403.20266) (2024)\n* [Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws](https://huggingface.co/papers/2404.05405) (2024)\n* [LAB: Large-Scale Alignment for ChatBots](https://huggingface.co/papers/2403.01081) (2024)\n\n\n Please give a thumbs up to this comment if you found it helpful!\n\n If you want recommendations for any Paper on Hugging Face checkout [this](https://huggingface.co/spaces/librarian-bots/recommend_similar_papers) Space\n\n You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: `@librarian-bot recommend`","html":"

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

\n

The following papers were recommended by the Semantic Scholar API

\n\n

Please give a thumbs up to this comment if you found it helpful!

\n

If you want recommendations for any Paper on Hugging Face checkout this Space

\n

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: \n\n@librarian-bot\n\t recommend

\n","updatedAt":"2024-04-24T01:25:27.541Z","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7284889221191406},"editors":["librarian-bot"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg"],"reactions":[],"isReport":false}},{"id":"662c2ff78082c634bba7ecff","author":{"_id":"5f353bb37e58354338621655","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1639773384591-5f353bb37e58354338621655.jpeg","fullname":"Nicholas Broad","name":"nbroad","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":117,"isUserFollowing":false},"createdAt":"2024-04-26T22:51:35.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"phi-4 will run on a toaster","html":"

phi-4 will run on a toaster

\n","updatedAt":"2024-04-26T22:51:35.868Z","author":{"_id":"5f353bb37e58354338621655","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1639773384591-5f353bb37e58354338621655.jpeg","fullname":"Nicholas Broad","name":"nbroad","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":117,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7404853105545044},"editors":["nbroad"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1639773384591-5f353bb37e58354338621655.jpeg"],"reactions":[{"reaction":"๐Ÿš€","users":["mrfakename","m-ric","florentgbelidji","CkMad","InvidFlower","jayebaku"],"count":6}],"isReport":false}},{"id":"662e8a46a364f7df3995db19","author":{"_id":"648a210e9da3cc3506961585","avatarUrl":"/avatars/808e9d7ac99837fe79169d0b8d49c366.svg","fullname":"Ajith V Prabhakar","name":"ajithprabhakar","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"createdAt":"2024-04-28T17:41:26.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Here is my blog showcasing this paper : https://ajithp.com/2024/04/28/the-miniature-language-model-with-massive-potential-introducing-phi-3/","html":"

Here is my blog showcasing this paper : https://ajithp.com/2024/04/28/the-miniature-language-model-with-massive-potential-introducing-phi-3/

\n","updatedAt":"2024-04-28T17:41:26.653Z","author":{"_id":"648a210e9da3cc3506961585","avatarUrl":"/avatars/808e9d7ac99837fe79169d0b8d49c366.svg","fullname":"Ajith V Prabhakar","name":"ajithprabhakar","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7513186931610107},"editors":["ajithprabhakar"],"editorAvatarUrls":["/avatars/808e9d7ac99837fe79169d0b8d49c366.svg"],"reactions":[],"isReport":false}},{"id":"662fc89872779ad1f13569b8","author":{"_id":"63d10d4e8eaa4831005e92b5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63d10d4e8eaa4831005e92b5/7p7-OmWM6PqqCs7ZStPGD.jpeg","fullname":"Aymeric Roucher","name":"m-ric","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1889,"isUserFollowing":false},"createdAt":"2024-04-29T16:19:36.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"This very high performance on some benchmarks (the paper claims a performance than Mixtral 8x7B) seems suspicious, given that the model scores way lower on Chatbot Arena: it has an ELO 1064 as of now, so it's good but below Mistral 7B-Instruct-0.2 (1073), and far below Mixtral (1114).","html":"

This very high performance on some benchmarks (the paper claims a performance than Mixtral 8x7B) seems suspicious, given that the model scores way lower on Chatbot Arena: it has an ELO 1064 as of now, so it's good but below Mistral 7B-Instruct-0.2 (1073), and far below Mixtral (1114).

\n","updatedAt":"2024-04-29T16:33:16.979Z","author":{"_id":"63d10d4e8eaa4831005e92b5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63d10d4e8eaa4831005e92b5/7p7-OmWM6PqqCs7ZStPGD.jpeg","fullname":"Aymeric Roucher","name":"m-ric","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1889,"isUserFollowing":false}},"numEdits":3,"identifiedLanguage":{"language":"en","probability":0.9493606090545654},"editors":["m-ric"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/63d10d4e8eaa4831005e92b5/7p7-OmWM6PqqCs7ZStPGD.jpeg"],"reactions":[],"isReport":false}},{"id":"66303f85b6d15534200c23e0","author":{"_id":"6366313c361a96184dbadff8","avatarUrl":"/avatars/9b83c5aedc02267d9596b19c20fbe593.svg","fullname":"HAN JUNGU","name":"JUNGU","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":4,"isUserFollowing":false},"createdAt":"2024-04-30T00:47:01.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"I've never seen so many comments on an HF paper before.","html":"

I've never seen so many comments on an HF paper before.

\n","updatedAt":"2024-04-30T00:47:01.431Z","author":{"_id":"6366313c361a96184dbadff8","avatarUrl":"/avatars/9b83c5aedc02267d9596b19c20fbe593.svg","fullname":"HAN JUNGU","name":"JUNGU","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":4,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9923465251922607},"editors":["JUNGU"],"editorAvatarUrls":["/avatars/9b83c5aedc02267d9596b19c20fbe593.svg"],"reactions":[{"reaction":"โค๏ธ","users":["edmond","clem","jayebaku"],"count":3}],"isReport":false},"replies":[{"id":"6630e9c95a9e326bb340764e","author":{"_id":"62af665424488e6adfa9b8e2","avatarUrl":"/avatars/2bdb4a26fde4cbe5b4673e53e0d44540.svg","fullname":"Edmond Jacoupeau","name":"edmond","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false},"createdAt":"2024-04-30T12:53:29.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Yeah, people dont want to understand that we dont want big open source LLMs but decent sized ones... ","html":"

Yeah, people dont want to understand that we dont want big open source LLMs but decent sized ones...

\n","updatedAt":"2024-04-30T12:53:29.413Z","author":{"_id":"62af665424488e6adfa9b8e2","avatarUrl":"/avatars/2bdb4a26fde4cbe5b4673e53e0d44540.svg","fullname":"Edmond Jacoupeau","name":"edmond","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":3,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8838304877281189},"editors":["edmond"],"editorAvatarUrls":["/avatars/2bdb4a26fde4cbe5b4673e53e0d44540.svg"],"reactions":[{"reaction":"๐Ÿ”ฅ","users":["jayebaku"],"count":1}],"isReport":false,"parentCommentId":"66303f85b6d15534200c23e0"}},{"id":"664e1a012fb51bcf1e752356","author":{"_id":"6402c5df5caf6d21d680a2e0","avatarUrl":"/avatars/e78b8cd006ddee8ac5f19f809c3ff659.svg","fullname":"Shawn Fumo","name":"InvidFlower","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2024-05-22T16:14:57.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"You didn't see the bitnet paper ๐Ÿ˜‰","html":"

You didn't see the bitnet paper ๐Ÿ˜‰

\n","updatedAt":"2024-05-22T16:15:13.607Z","author":{"_id":"6402c5df5caf6d21d680a2e0","avatarUrl":"/avatars/e78b8cd006ddee8ac5f19f809c3ff659.svg","fullname":"Shawn Fumo","name":"InvidFlower","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.985586941242218},"editors":["InvidFlower"],"editorAvatarUrls":["/avatars/e78b8cd006ddee8ac5f19f809c3ff659.svg"],"reactions":[],"isReport":false,"parentCommentId":"66303f85b6d15534200c23e0"}}]},{"id":"6664a73b8a270cedd51cbee1","author":{"_id":"6186ddf6a7717cb375090c01","avatarUrl":"/avatars/716b6a7d1094c8036b2a8a7b9063e8aa.svg","fullname":"Julien BLANCHON","name":"blanchon","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":176,"isUserFollowing":false},"createdAt":"2024-06-08T18:47:23.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"# Unleashing Phi-3-mini: Powerful AI on Your Phone \n\nhttps://cdn-uploads.huggingface.co/production/uploads/6186ddf6a7717cb375090c01/cBOSBURURaxRjK1TD6GQ4.mp4 \n\n## Links ๐Ÿ”—:\n๐Ÿ‘‰ Subscribe: https://www.youtube.com/@Arxflix\n๐Ÿ‘‰ Twitter: https://x.com/arxflix\n๐Ÿ‘‰ LMNT (Partner): https://lmnt.com/\n\n\nBy Arxflix\n![9t4iCUHx_400x400-1.jpg](https://cdn-uploads.huggingface.co/production/uploads/6186ddf6a7717cb375090c01/v4S5zBurs0ouGNwYj1GEd.jpeg)","html":"

\n\t\n\t\t\n\t\n\t\n\t\tUnleashing Phi-3-mini: Powerful AI on Your Phone\n\t\n

\n

\n\n

\n\t\n\t\t\n\t\n\t\n\t\tLinks ๐Ÿ”—:\n\t\n

\n

๐Ÿ‘‰ Subscribe: https://www.youtube.com/@Arxflix
๐Ÿ‘‰ Twitter: https://x.com/arxflix
๐Ÿ‘‰ LMNT (Partner): https://lmnt.com/

\n

By Arxflix
\"9t4iCUHx_400x400-1.jpg\"

\n","updatedAt":"2024-06-08T18:47:23.019Z","author":{"_id":"6186ddf6a7717cb375090c01","avatarUrl":"/avatars/716b6a7d1094c8036b2a8a7b9063e8aa.svg","fullname":"Julien BLANCHON","name":"blanchon","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":176,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.4771801233291626},"editors":["blanchon"],"editorAvatarUrls":["/avatars/716b6a7d1094c8036b2a8a7b9063e8aa.svg"],"reactions":[{"reaction":"โค๏ธ","users":["nguyenbh"],"count":1}],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2404.14219","authors":[{"_id":"66271d7a322746c7bfca83a8","user":{"_id":"649af1e3ca20306aeef590b8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/649af1e3ca20306aeef590b8/Tvy8yf7zDO2FoNs03tWj6.jpeg","isPro":false,"fullname":"Marah Abdin","user":"marah-abdin","type":"user"},"name":"Marah Abdin","status":"claimed_verified","statusLastChangedAt":"2024-04-23T13:00:35.357Z","hidden":false},{"_id":"66271d7a322746c7bfca83a9","user":{"_id":"651736dd20b18e99b44dd5d5","avatarUrl":"/avatars/c054eb2567e027408340a03a7ca0b29d.svg","isPro":false,"fullname":"Sam Ade Jacobs","user":"samadejacobs","type":"user"},"name":"Sam Ade Jacobs","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:17:35.386Z","hidden":false},{"_id":"66271d7a322746c7bfca83aa","user":{"_id":"6324b8417d6d0cbbe2c65019","avatarUrl":"/avatars/7c1a468daaa984203a600a3b3526298f.svg","isPro":false,"fullname":"Ammar Awan","user":"ammarawan","type":"user"},"name":"Ammar Ahmad Awan","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:17:44.962Z","hidden":false},{"_id":"66271d7a322746c7bfca83ab","user":{"_id":"623e17038972a8c030af23fa","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/623e17038972a8c030af23fa/2ERyhCNAdV2wt4uRy3Wq5.jpeg","isPro":false,"fullname":"Jyoti Aneja","user":"jyotiA","type":"user"},"name":"Jyoti Aneja","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:17:51.473Z","hidden":false},{"_id":"66271d7a322746c7bfca83ac","user":{"_id":"6557a41ee74d1065f7d05c51","avatarUrl":"/avatars/b29e57fc61d1d3012637f84b82c14818.svg","isPro":false,"fullname":"Ahmed Awadallah","user":"AhmedAwadallah","type":"user"},"name":"Ahmed Awadallah","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:18:04.741Z","hidden":false},{"_id":"66271d7a322746c7bfca83ad","name":"Hany Awadalla","hidden":false},{"_id":"66271d7a322746c7bfca83ae","user":{"_id":"5f3ec133a4dd343b63a632dd","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1597948137099-noauth.jpeg","isPro":false,"fullname":"Nguyen Bach","user":"nguyenbh","type":"user"},"name":"Nguyen Bach","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:18:30.800Z","hidden":false},{"_id":"66271d7a322746c7bfca83af","user":{"_id":"604cfaaad8a4193fd3a928ef","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1615657716895-604cfaaad8a4193fd3a928ef.jpeg","isPro":false,"fullname":"Amit Bahree","user":"bahree","type":"user"},"name":"Amit Bahree","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:18:40.847Z","hidden":false},{"_id":"66271d7a322746c7bfca83b0","user":{"_id":"6553cda0c81411e2aaf930b1","avatarUrl":"/avatars/291e4e9cef092d649ab734a3f09a3af1.svg","isPro":false,"fullname":"Arash Bakhtiari","user":"arashb","type":"user"},"name":"Arash Bakhtiari","status":"claimed_verified","statusLastChangedAt":"2024-04-24T11:11:57.013Z","hidden":false},{"_id":"66271d7a322746c7bfca83b1","user":{"_id":"63c6aa9e656e7822e2359d9c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63c6aa9e656e7822e2359d9c/4rn_faJ23m_a6bevXEDG_.jpeg","isPro":false,"fullname":"Harkirat Behl","user":"Harkirat","type":"user"},"name":"Harkirat Behl","status":"claimed_verified","statusLastChangedAt":"2024-04-23T07:13:54.219Z","hidden":false},{"_id":"66271d7a322746c7bfca83b2","user":{"_id":"65b9b627e7c838136275a681","avatarUrl":"/avatars/22423f3d9a6c4ee34cad3b0894d27d23.svg","isPro":false,"fullname":"Alon Benhaim","user":"alonbenhaim","type":"user"},"name":"Alon Benhaim","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:19:30.493Z","hidden":false},{"_id":"66271d7a322746c7bfca83b3","name":"Misha Bilenko","hidden":false},{"_id":"66271d7a322746c7bfca83b4","user":{"_id":"65fdfbbbc91ba4c08ae76db9","avatarUrl":"/avatars/6a44dcf01a531966806f7de9df94178b.svg","isPro":false,"fullname":"johan bjorck","user":"njb-ms","type":"user"},"name":"Johan Bjorck","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:19:54.353Z","hidden":false},{"_id":"66271d7a322746c7bfca83b5","user":{"_id":"64557408c9c0dcc8c24a7a92","avatarUrl":"/avatars/911ba8fc3c224578d5946dddd94bf4c0.svg","isPro":false,"fullname":"Sebastien Bubeck","user":"sebubeck","type":"user"},"name":"Sรฉbastien Bubeck","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:20:04.354Z","hidden":false},{"_id":"66271d7a322746c7bfca83b6","user":{"_id":"63599df91a0dca1aa2497470","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1666817522152-noauth.png","isPro":false,"fullname":"Martin Cai","user":"cailang","type":"user"},"name":"Martin Cai","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:20:23.447Z","hidden":false},{"_id":"66271d7a322746c7bfca83b7","user":{"_id":"66269c058f7573e6a6488532","avatarUrl":"/avatars/5d47ce0eaba09b385596ad5fd97f8ef7.svg","isPro":false,"fullname":"Caio Mendes","user":"caiomms","type":"user"},"name":"Caio Cรฉsar Teodoro Mendes","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:20:59.498Z","hidden":false},{"_id":"66271d7a322746c7bfca83b8","user":{"_id":"64da876370446182be5b608d","avatarUrl":"/avatars/e412fdc71404ecdf638e416846e3ebfb.svg","isPro":false,"fullname":"Weizhu Chen","user":"chenweizhu","type":"user"},"name":"Weizhu Chen","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:21:07.482Z","hidden":false},{"_id":"66271d7a322746c7bfca83b9","user":{"_id":"659c7ac977ac6f1bf5e63d7e","avatarUrl":"/avatars/86a6efde0d483564a67ed5f344d479a0.svg","isPro":false,"fullname":"Vishrav Chaudhary","user":"vishravmsft","type":"user"},"name":"Vishrav Chaudhary","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:21:31.901Z","hidden":false},{"_id":"66271d7a322746c7bfca83ba","user":{"_id":"65fe033542e4b66cd59bbc33","avatarUrl":"/avatars/71a5d01b83f36d2926aef9aed624b953.svg","isPro":false,"fullname":"Parul Chopra","user":"Parul09","type":"user"},"name":"Parul Chopra","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:21:38.415Z","hidden":false},{"_id":"66271d7a322746c7bfca83bb","user":{"_id":"63f5562471a5d395c721cd8e","avatarUrl":"/avatars/ac296b6017e86ea04c73803fe2c44433.svg","isPro":false,"fullname":"Allie Del Giorno","user":"microallie","type":"user"},"name":"Allie Del Giorno","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:21:53.089Z","hidden":false},{"_id":"66271d7a322746c7bfca83bc","user":{"_id":"6157454831624da88210e627","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1666203761402-6157454831624da88210e627.jpeg","isPro":false,"fullname":"Gustavo de Rosa","user":"gugarosa","type":"user"},"name":"Gustavo de Rosa","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:00.600Z","hidden":false},{"_id":"66271d7a322746c7bfca83bd","user":{"_id":"6316cbde29411a6864b9a15c","avatarUrl":"/avatars/649e713b17871a18f55ea8f167860ae3.svg","isPro":false,"fullname":"Matthew Dixon","user":"mmdixon","type":"user"},"name":"Matthew Dixon","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:06.151Z","hidden":false},{"_id":"66271d7a322746c7bfca83be","user":{"_id":"633723a80267ebcf0264c06b","avatarUrl":"/avatars/22bb971597e9f3abfa343280a9d0f65f.svg","isPro":false,"fullname":"Ronen Eldan","user":"roneneldan","type":"user"},"name":"Ronen Eldan","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:13.165Z","hidden":false},{"_id":"66271d7a322746c7bfca83bf","user":{"_id":"6604959a5186899ca9d62d27","avatarUrl":"/avatars/32416322dc3fd3b057d2cdd78750cdff.svg","isPro":false,"fullname":"Dan I","user":"daniter-msft","type":"user"},"name":"Dan Iter","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:22.938Z","hidden":false},{"_id":"66271d7a322746c7bfca83c0","user":{"_id":"62cdae333529c21a2283a0a1","avatarUrl":"/avatars/cafc2821e522bbd06d49830e36a073e3.svg","isPro":false,"fullname":"Abhishek GOSWAMI","user":"abgoswam","type":"user"},"name":"Abhishek Goswami","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:39.305Z","hidden":false},{"_id":"66271d7a322746c7bfca83c1","user":{"_id":"63e1b4f77fbb6ae4d4f36aa4","avatarUrl":"/avatars/8e03bf9143be5e6456cc8e732ed3daaf.svg","isPro":false,"fullname":"Suriya Gunasekar","user":"suriyagunasekar","type":"user"},"name":"Suriya Gunasekar","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:47.893Z","hidden":false},{"_id":"66271d7a322746c7bfca83c2","name":"Emman Haider","hidden":false},{"_id":"66271d7a322746c7bfca83c3","user":{"_id":"5f04c4394ec31d33a72116d6","avatarUrl":"/avatars/75d4b9020070e73604b12e5adc1c8201.svg","isPro":false,"fullname":"Junheng Hao","user":"jeffhao","type":"user"},"name":"Junheng Hao","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:22:59.089Z","hidden":false},{"_id":"66271d7a322746c7bfca83c4","name":"Russell J. Hewett","hidden":false},{"_id":"66271d7a322746c7bfca83c5","name":"Jamie Huynh","hidden":false},{"_id":"66271d7a322746c7bfca83c6","user":{"_id":"643dbfa31e5be78c66440fcf","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/643dbfa31e5be78c66440fcf/VtLrAlPY_4ncVrM4eL7V-.jpeg","isPro":false,"fullname":"Mojan Javaheripi","user":"mojanjp","type":"user"},"name":"Mojan Javaheripi","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:23:17.966Z","hidden":false},{"_id":"66271d7a322746c7bfca83c7","name":"Xin Jin","hidden":false},{"_id":"66271d7a322746c7bfca83c8","name":"Piero Kauffmann","hidden":false},{"_id":"66271d7a322746c7bfca83c9","user":{"_id":"64d6785b089bc502cea059be","avatarUrl":"/avatars/79a504d2b66aa256327c8d11243d41b0.svg","isPro":false,"fullname":"Nikos Karampatziakis","user":"n17s","type":"user"},"name":"Nikos Karampatziakis","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:23:34.411Z","hidden":false},{"_id":"66271d7a322746c7bfca83ca","user":{"_id":"662476aec8920ec351b8d3d8","avatarUrl":"/avatars/791e40f53073563680ef18f75b3ea95e.svg","isPro":false,"fullname":"Dongwoo Kim","user":"dongwookim-ms","type":"user"},"name":"Dongwoo Kim","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:23:48.587Z","hidden":false},{"_id":"66271d7a322746c7bfca83cb","name":"Mahoud Khademi","hidden":false},{"_id":"66271d7a322746c7bfca83cc","user":{"_id":"635852141d66b442317cbe26","avatarUrl":"/avatars/bcf9164f6ac6393ffecc5f4383d12cbe.svg","isPro":false,"fullname":"Lev Kurilenko","user":"lekurile","type":"user"},"name":"Lev Kurilenko","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:24:00.911Z","hidden":false},{"_id":"66271d7a322746c7bfca83cd","name":"James R. Lee","hidden":false},{"_id":"66271d7a322746c7bfca83ce","user":{"_id":"6303297604c75db08b8972e2","avatarUrl":"/avatars/693b2dada2244ba5d97df9518f473ccb.svg","isPro":false,"fullname":"Yin Tat Lee","user":"yintat","type":"user"},"name":"Yin Tat Lee","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:24:21.143Z","hidden":false},{"_id":"66271d7a322746c7bfca83cf","user":{"_id":"641d5a313f04f9bf2da17e8c","avatarUrl":"/avatars/4035f73c69ddb045573ad96db4f13f04.svg","isPro":false,"fullname":"Yuanzhi Li","user":"Uushizhu1234","type":"user"},"name":"Yuanzhi Li","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:24:45.250Z","hidden":false},{"_id":"66271d7a322746c7bfca83d0","user":{"_id":"63e6b5e22d2c508de9001afd","avatarUrl":"/avatars/43cec9e8b8d490bd259e383954846a1e.svg","isPro":false,"fullname":"Chen Liang","user":"cliang1453","type":"user"},"name":"Chen Liang","status":"claimed_verified","statusLastChangedAt":"2024-06-02T18:48:02.336Z","hidden":false},{"_id":"66271d7a322746c7bfca83d1","user":{"_id":"637c3ac41c13d2970bf67625","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/637c3ac41c13d2970bf67625/K_KEPoWjj4k7EndHeU2Jt.png","isPro":false,"fullname":"Weishung Liu ","user":"weish","type":"user"},"name":"Weishung Liu","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:24:59.941Z","hidden":false},{"_id":"66271d7a322746c7bfca83d2","name":"Eric Lin","hidden":false},{"_id":"66271d7a322746c7bfca83d3","name":"Zeqi Lin","hidden":false},{"_id":"66271d7a322746c7bfca83d4","user":{"_id":"66269a329014ef4d10f55d9d","avatarUrl":"/avatars/d4866c32419a7dd07e9aa0660f4bafa9.svg","isPro":false,"fullname":"Piyush Madan","user":"PiyushMadan","type":"user"},"name":"Piyush Madan","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:25:40.367Z","hidden":false},{"_id":"66271d7a322746c7bfca83d5","name":"Arindam Mitra","hidden":false},{"_id":"66271d7a322746c7bfca83d6","user":{"_id":"617c70112ee1adbd7739e7d5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1635545103491-noauth.jpeg","isPro":false,"fullname":"Hardik Modi","user":"hardikmodi","type":"user"},"name":"Hardik Modi","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:26:10.428Z","hidden":false},{"_id":"66271d7a322746c7bfca83d7","user":{"_id":"649bc84833486cdd77c01c66","avatarUrl":"/avatars/36f4e4bb15c337c4391bfbd234051f4c.svg","isPro":false,"fullname":"Nguyen Anh","user":"Anhnguyen","type":"user"},"name":"Anh Nguyen","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:26:19.012Z","hidden":false},{"_id":"66271d7a322746c7bfca83d8","user":{"_id":"63200bb0b45367a05fef3049","avatarUrl":"/avatars/13e7a5c84d253c74112b3a6b058ae260.svg","isPro":false,"fullname":"Brandon Norick","user":"bnorick","type":"user"},"name":"Brandon Norick","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:26:45.406Z","hidden":false},{"_id":"66271d7a322746c7bfca83d9","name":"Barun Patra","hidden":false},{"_id":"66271d7a322746c7bfca83da","name":"Daniel Perez-Becker","hidden":false},{"_id":"66271d7a322746c7bfca83db","user":{"_id":"65c52dad286bf45e79491697","avatarUrl":"/avatars/01ebc7979273df6e53971ae9835b503f.svg","isPro":false,"fullname":"Thomas Portet","user":"thopo","type":"user"},"name":"Thomas Portet","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:34:57.983Z","hidden":false},{"_id":"66271d7a322746c7bfca83dc","user":{"_id":"64e3d38c618cd90997ee0138","avatarUrl":"/avatars/847210a609b156fdb9d824cc815bde27.svg","isPro":false,"fullname":"Reid Pryzant","user":"rpryzant","type":"user"},"name":"Reid Pryzant","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:34:50.855Z","hidden":false},{"_id":"66271d7a322746c7bfca83dd","user":{"_id":"63bc6a611374e3ef9134a24e","avatarUrl":"/avatars/9df037b9cf7d7952a10d919e24a2ffa2.svg","isPro":false,"fullname":"Heyang Qin","user":"heyangqin","type":"user"},"name":"Heyang Qin","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:34:43.485Z","hidden":false},{"_id":"66271d7a322746c7bfca83de","name":"Marko Radmilac","hidden":false},{"_id":"66271d7a322746c7bfca83df","user":{"_id":"64ec43147e2ec711a761c594","avatarUrl":"/avatars/e4fc3de141ff026eee6eaa0b91d34ff1.svg","isPro":false,"fullname":"corby","user":"corbyrosset","type":"user"},"name":"Corby Rosset","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:34:30.505Z","hidden":false},{"_id":"66271d7a322746c7bfca83e0","name":"Sambudha Roy","hidden":false},{"_id":"66271d7a322746c7bfca83e1","user":{"_id":"6477a7108ab7e732b6d86701","avatarUrl":"/avatars/3319db398ce8142bd7d61de1d2cd1b4b.svg","isPro":false,"fullname":"Olli Saarikivi","user":"olsaarik","type":"user"},"name":"Olli Saarikivi","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:34:13.500Z","hidden":false},{"_id":"66271d7a322746c7bfca83e2","name":"Amin Saied","hidden":false},{"_id":"66271d7a322746c7bfca83e3","name":"Adil Salim","hidden":false},{"_id":"66271d7a322746c7bfca83e4","user":{"_id":"6626a4371ee17eac05a311ee","avatarUrl":"/avatars/0e0af517b715af3a7a4113855448702a.svg","isPro":false,"fullname":"Michael Santacroce","user":"misantac","type":"user"},"name":"Michael Santacroce","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:33:54.298Z","hidden":false},{"_id":"66271d7a322746c7bfca83e5","user":{"_id":"5fea740dbb3114fa14c82934","avatarUrl":"/avatars/5f9fe9aaf74a3b930b6fcebcf8a4e4cc.svg","isPro":false,"fullname":"Shital Shah","user":"sytelus","type":"user"},"name":"Shital Shah","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:33:36.790Z","hidden":false},{"_id":"66271d7a322746c7bfca83e6","name":"Ning Shang","hidden":false},{"_id":"66271d7a322746c7bfca83e7","name":"Hiteshi Sharma","hidden":false},{"_id":"66271d7a322746c7bfca83e8","user":{"_id":"630540f999870e13d3ddb997","avatarUrl":"/avatars/7dc53475142c0c5c3a04d7476682269d.svg","isPro":false,"fullname":"Xia Song","user":"ssaaxx","type":"user"},"name":"Xia Song","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:32:42.218Z","hidden":false},{"_id":"66271d7a322746c7bfca83e9","user":{"_id":"6499d685ea2cdac80992e742","avatarUrl":"/avatars/c32741e7fc57c9a08722fab3877a7b81.svg","isPro":false,"fullname":"Olatunji Ruwase","user":"tjruwase","type":"user"},"name":"Olatunji Ruwase","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:32:31.128Z","hidden":false},{"_id":"66271d7a322746c7bfca83ea","name":"Xin Wang","hidden":false},{"_id":"66271d7a322746c7bfca83eb","user":{"_id":"6626ab758a19c27042ed5e73","avatarUrl":"/avatars/757f1fda0ce88a344d85bde05e468d9c.svg","isPro":false,"fullname":"Rachel Ward","user":"rward314","type":"user"},"name":"Rachel Ward","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:32:20.827Z","hidden":false},{"_id":"66271d7a322746c7bfca83ec","name":"Guanhua Wang","hidden":false},{"_id":"66271d7a322746c7bfca83ed","user":{"_id":"64fcf02401aedd0e86e4b933","avatarUrl":"/avatars/602e8081d396101f537f773d873eb41b.svg","isPro":false,"fullname":"Philipp Witte","user":"philippwitte","type":"user"},"name":"Philipp Witte","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:30:48.497Z","hidden":false},{"_id":"66271d7a322746c7bfca83ee","user":{"_id":"62910c52d1be2630c0d38a72","avatarUrl":"/avatars/40a72a10e3ea17809a09bde77360c194.svg","isPro":false,"fullname":"Michael Wyatt","user":"mwyatt","type":"user"},"name":"Michael Wyatt","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:30:36.125Z","hidden":false},{"_id":"66271d7a322746c7bfca83ef","name":"Can Xu","hidden":false},{"_id":"66271d7a322746c7bfca83f0","user":{"_id":"62abdf657b037eafffc48808","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1655430982462-noauth.jpeg","isPro":false,"fullname":"Jiahang Xu","user":"Jiahang","type":"user"},"name":"Jiahang Xu","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:30:06.292Z","hidden":false},{"_id":"66271d7a322746c7bfca83f1","name":"Sonali Yadav","hidden":false},{"_id":"66271d7a322746c7bfca83f2","name":"Fan Yang","hidden":false},{"_id":"66271d7a322746c7bfca83f3","user":{"_id":"646be72031968a60a021ddb3","avatarUrl":"/avatars/79f3766cf404fd180274dbc86ea4dac6.svg","isPro":false,"fullname":"Ziyi Yang","user":"Ziyi-Yang","type":"user"},"name":"Ziyi Yang","status":"claimed_verified","statusLastChangedAt":"2024-07-03T07:41:36.627Z","hidden":false},{"_id":"66271d7a322746c7bfca83f4","user":{"_id":"65b01b8a29ae836e9ed5af24","avatarUrl":"/avatars/a8b78a4b54d3f10858c5925521357001.svg","isPro":false,"fullname":"Donghan Yu","user":"donghanyu","type":"user"},"name":"Donghan Yu","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:28:53.970Z","hidden":false},{"_id":"66271d7a322746c7bfca83f5","user":{"_id":"64646896884f2e3e1ced3cd5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64646896884f2e3e1ced3cd5/86-t8V8LGMNaPQRXnADiD.png","isPro":false,"fullname":"Zhang","user":"Chengruidong","type":"user"},"name":"Chengruidong Zhang","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:28:36.041Z","hidden":false},{"_id":"66271d7a322746c7bfca83f6","user":{"_id":"63a6a3a93d8cc9b1183c6593","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63a6a3a93d8cc9b1183c6593/UF-9ostqBHiAGDOuEtIw_.png","isPro":false,"fullname":"Cyril Zhang","user":"cyrilzhang","type":"user"},"name":"Cyril Zhang","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:28:23.642Z","hidden":false},{"_id":"66271d7a322746c7bfca83f7","user":{"_id":"63601ee38fb9c2420ffbe45d","avatarUrl":"/avatars/56af091aaff1b42dcfbae84a6ee1e7f7.svg","isPro":true,"fullname":"Jianwen Zhang","user":"jianwenzh","type":"user"},"name":"Jianwen Zhang","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:28:15.724Z","hidden":false},{"_id":"66271d7a322746c7bfca83f8","user":{"_id":"62b0009c72043b05d29492b2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62b0009c72043b05d29492b2/NqRkX2YLhlfOLvYysa7dD.png","isPro":false,"fullname":"Li Lyna Zhang","user":"lynazhang","type":"user"},"name":"Li Lyna Zhang","status":"admin_assigned","statusLastChangedAt":"2024-04-23T07:28:03.688Z","hidden":false},{"_id":"66271d7a322746c7bfca83f9","name":"Yi Zhang","hidden":false},{"_id":"66271d7a322746c7bfca83fa","user":{"_id":"6367f0be856a92b278915bb0","avatarUrl":"/avatars/6b36f53c74f0fa559aca5d4a8e3ed585.svg","isPro":false,"fullname":"yunan","user":"yunan","type":"user"},"name":"Yunan Zhang","status":"claimed_verified","statusLastChangedAt":"2024-04-24T11:11:55.084Z","hidden":false},{"_id":"66271d7a322746c7bfca83fb","user":{"_id":"66ce4c9f864befb39cfc74e9","avatarUrl":"/avatars/ef66398466c470fc1d384c6817d9e461.svg","isPro":false,"fullname":"Xiren Zhou","user":"XirenZhou","type":"user"},"name":"Xiren Zhou","status":"claimed_verified","statusLastChangedAt":"2024-08-28T08:01:22.806Z","hidden":false}],"publishedAt":"2024-04-22T14:32:33.000Z","submittedOnDailyAt":"2024-04-23T01:01:22.853Z","title":"Phi-3 Technical Report: A Highly Capable Language Model Locally on Your\n Phone","submittedOnDailyBy":{"_id":"60f1abe7544c2adfd699860c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674929746905-60f1abe7544c2adfd699860c.jpeg","isPro":false,"fullname":"AK","user":"akhaliq","type":"user"},"summary":"We introduce phi-3-mini, a 3.8 billion parameter language model trained on\n3.3 trillion tokens, whose overall performance, as measured by both academic\nbenchmarks and internal testing, rivals that of models such as Mixtral 8x7B and\nGPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite\nbeing small enough to be deployed on a phone. The innovation lies entirely in\nour dataset for training, a scaled-up version of the one used for phi-2,\ncomposed of heavily filtered web data and synthetic data. The model is also\nfurther aligned for robustness, safety, and chat format. We also provide some\ninitial parameter-scaling results with a 7B and 14B models trained for 4.8T\ntokens, called phi-3-small and phi-3-medium, both significantly more capable\nthan phi-3-mini (e.g., respectively 75% and 78% on MMLU, and 8.7 and 8.9 on\nMT-bench).","upvotes":259,"discussionId":"66271d7a322746c7bfca8423","ai_summary":"Phi-3-mini, a compact 3.8 billion parameter language model, achieves competitive performance with larger models through an enhanced training dataset and alignment.","ai_keywords":["language model","MMLU","MT-bench","robustness","safety","chat format","parameter-scaling"],"organization":{"_id":"5e6485f787403103f9f1055e","name":"microsoft","fullname":"Microsoft","avatar":"https://cdn-uploads.huggingface.co/production/uploads/1583646260758-5e64858c87403103f9f1055d.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"62e0ef42edb0462c8d51818d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62e0ef42edb0462c8d51818d/3YM7DUynIWiiRFM6_enpg.jpeg","isPro":false,"fullname":"Ting-En Lin","user":"tnlin","type":"user"},{"_id":"63869d1e81fe8c678a3a9422","avatarUrl":"/avatars/3bb8728057fa2ba0e24f5ceb1600068d.svg","isPro":true,"fullname":"Zach Mustafa","user":"Zmu","type":"user"},{"_id":"64747f7e33192631bacd8831","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64747f7e33192631bacd8831/dstkZJ4sHJSeqLesV5cOC.jpeg","isPro":false,"fullname":"Taufiq Dwi Purnomo","user":"taufiqdp","type":"user"},{"_id":"655ac762cb17ec19ef82719b","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/655ac762cb17ec19ef82719b/1kDncYrGLYS_2SR8cNdAL.png","isPro":false,"fullname":"Welcome to matlok","user":"matlok","type":"user"},{"_id":"64639c336c27a7e33b26cbff","avatarUrl":"/avatars/fe27e05e0994e9cf0639ab0afeca85af.svg","isPro":false,"fullname":"Wen Sun","user":"HermitSun","type":"user"},{"_id":"620783f24e28382272337ba4","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/620783f24e28382272337ba4/zkUveQPNiDfYjgGhuFErj.jpeg","isPro":false,"fullname":"GuoLiangTang","user":"Tommy930","type":"user"},{"_id":"631ece54c1a8269da391efe9","avatarUrl":"/avatars/eea8d90e514d8b011d1549d401bd9f9f.svg","isPro":false,"fullname":"Dhruvajyoti Sarma","user":"dhruva-sarma","type":"user"},{"_id":"636cb70fcfb49b46821e0870","avatarUrl":"/avatars/fdf7670a6536e1318ea9d66666c904f3.svg","isPro":false,"fullname":"mark","user":"LeeJo","type":"user"},{"_id":"63b7b2c6bd2d153522821766","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63b7b2c6bd2d153522821766/aHtga-_OUdOrg_TRrXO08.jpeg","isPro":false,"fullname":"Mu Cai","user":"mucai","type":"user"},{"_id":"635cada2c017767a629db012","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1667018139063-noauth.jpeg","isPro":false,"fullname":"Ojasvi Singh Yadav","user":"ojasvisingh786","type":"user"},{"_id":"64b02ec0e5000ae8a572ced5","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64b02ec0e5000ae8a572ced5/6ifLntBU2ICQK7SW8WxKU.png","isPro":false,"fullname":"Lin Chen","user":"Lin-Chen","type":"user"},{"_id":"644f10d267a3dd3d072a2669","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/644f10d267a3dd3d072a2669/r7AA6gkm-AuQLHQ77G7d5.png","isPro":false,"fullname":"Neil Van","user":"nvhf","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":1,"organization":{"_id":"5e6485f787403103f9f1055e","name":"microsoft","fullname":"Microsoft","avatar":"https://cdn-uploads.huggingface.co/production/uploads/1583646260758-5e64858c87403103f9f1055d.png"}}">
Papers
arxiv:2404.14219

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Published on Apr 22, 2024
ยท Submitted by
AK
on Apr 23, 2024
#1 Paper of the day

Abstract

Phi-3-mini, a compact 3.8 billion parameter language model, achieves competitive performance with larger models through an enhanced training dataset and alignment.

AI-generated summary

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset for training, a scaled-up version of the one used for phi-2, composed of heavily filtered web data and synthetic data. The model is also further aligned for robustness, safety, and chat format. We also provide some initial parameter-scaling results with a 7B and 14B models trained for 4.8T tokens, called phi-3-small and phi-3-medium, both significantly more capable than phi-3-mini (e.g., respectively 75% and 78% on MMLU, and 8.7 and 8.9 on MT-bench).

Community

Nice! When will the weights be released under an open source license?

ยท

Also important could we also get the 3.3T token dataset? ๐Ÿค— pretty please

pocket-size gpt-3.5?
14B matching GPT-4-0314 MT-Bench?
<3

weights or it didn't happen. Also, please make it apache 2.0.

Weights?

Very cool! Would be awesome to increase visibility + experimentation by sharing the weights as well ๐Ÿค—

This is ฤฑncredible. When we will see the weights?

So great to see the successor of Phi-1.5/2 โ€“ Looking forward to being able to play with the model and embed it locally everywhere!

Weights Please ๐Ÿ™

These SLMs are better and better so far. Would be cool to get an apk to actually run them on mobile devices without termux. Two existing things that I know of, are with limited models support. And GPT 3.5 level of model quality is a good occasion to wrap it

ยท

I agree that SLMs probably need more focus and have potential to make great strides on multiple fronts; be it accessbility, deployability, inference speed and new usecases. Ofcourse it means putting in more effort on dataset curation and maybe even the architecture. Phi series is the proof that focused data curation alone can improve performance quite a bit.

Given recent events, I don't think weights will be available and forget about dataset. Even if weights are released it will be taken down next day for some testing or alignment or some other stuff only never to return. Great job guys!!

ยท

I'm not sure what recent events you're referring to. I'll wait for the official statement before jumping to conclusions.

When will the model be released?

LLama was never competitive, LLama 2 got in a few weeks beaten by Mistral, LLama 3 got in a few days beaten by Phi 3 ?
Damn if this is true Zuck might start to become seriously mad ... (even if phi is using LLama 2)

Here's a quick walkthrough of the paper: https://huggingface.co/posts/Jaward/284702584639894

I saw weights are coming tomorrow (on Twitter, hopefully it's legit!). In any case, there's a plain-english rewrite of this paper available here if you want: https://www.aimodels.fyi/papers/arxiv/phi-3-technical-report-highly-capable-language

Surely the first reference needs fixing to say 2024 and to use capital letters in the right places?
Currently says: "References
[AI23] Meta AI. Introducing meta llama 3: The most capable openly available llm to date, 2023."

Surely it should be: "[AI23] Meta AI. Introducing Meta Llama 3: The most capable openly available LLM to date, 2024."?
@gugarosa
So looking forward to playing with this, well done all!

How did you "filter" data for Phase-1 and 2? Was it manual? How did you ensure if it was automated?

Also, what was the criteria for "inducing reasoning" on the dataset and web?

it is now available on hugging chat ๐Ÿ”ฅ https://huggingface.co/chat/models/microsoft/Phi-3-mini-4k-instruct

image.png

ยท

for some reason the one on hugging chat gives me crappy answers , e.g.:
Q: what files are needed for a chrome extension, what are their names?
A: To add unit tests for the URLAnalyzer class, you'll need to set up a testing framework like Jest. Here's an example of how you might write tests for the waitForClassPresence and analyzeUrl met [...] and so on, completely unrelated junk)

I tried the gguf Q4 version in gpt4all, and got much better results, only issue is with the stop token

Would love to see all models in this family on LMSYS arena!

Arena is like double blind peer review ++ randomized controlled trials in science! The golden standard to judge something. I hope some API provider like Together API would provide inference services for these family of models to us all and also for Arena!

ยท

Does it have any tuning for function calling? What dataset was used or how to fine tune it for agent applications?

Wow people are actually starting to use hf comments now really cool ๐Ÿ˜Ž

To be able to finetune it for json mode and the ability to use it in mobile will have very nice impact!
Opens so many opportunities for agents in GPU poor devices

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

phi-4 will run on a toaster

This very high performance on some benchmarks (the paper claims a performance than Mixtral 8x7B) seems suspicious, given that the model scores way lower on Chatbot Arena: it has an ELO 1064 as of now, so it's good but below Mistral 7B-Instruct-0.2 (1073), and far below Mixtral (1114).

I've never seen so many comments on an HF paper before.

ยท

Yeah, people dont want to understand that we dont want big open source LLMs but decent sized ones...

Unleashing Phi-3-mini: Powerful AI on Your Phone

Links ๐Ÿ”—:

๐Ÿ‘‰ Subscribe: https://www.youtube.com/@Arxflix
๐Ÿ‘‰ Twitter: https://x.com/arxflix
๐Ÿ‘‰ LMNT (Partner): https://lmnt.com/

By Arxflix
9t4iCUHx_400x400-1.jpg

Sign up or log in to comment

Models citing this paper 103

Browse 103 models citing this paper

Datasets citing this paper 6

Browse 6 datasets citing this paper

Spaces citing this paper 540

Collections including this paper 68