Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Paper page - PaperBanana: Automating Academic Illustration for AI Scientists
[go: Go Back, main page]

Librarian Bot. I found the following papers similar to this paper.

\n

The following papers were recommended by the Semantic Scholar API

\n\n

Please give a thumbs up to this comment if you found it helpful!

\n

If you want recommendations for any Paper on Hugging Face checkout this Space

\n

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: \n\n@librarian-bot\n\t recommend

\n","updatedAt":"2026-02-03T01:39:15.131Z","author":{"_id":"63d3e0e8ff1384ce6c5dd17d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg","fullname":"Librarian Bot (Bot)","name":"librarian-bot","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":318,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7220591902732849},"editors":["librarian-bot"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg"],"reactions":[{"reaction":"๐Ÿ‘","users":["mobrown"],"count":1}],"isReport":false}},{"id":"698217b82650edae2abc0b54","author":{"_id":"62d648291fa3e4e7ae3fa6e8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62d648291fa3e4e7ae3fa6e8/oatOwf8Xqe5eDbCSuYqCd.png","fullname":"ben burtenshaw","name":"burtenshaw","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":4318,"isUserFollowing":false},"createdAt":"2026-02-03T15:43:52.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Amazing work! ๐Ÿš€ \n\nIt would be very cool to get a space demo of paperbanana so we can understand and try out how the vlms and generators are orchestrated.","html":"

Amazing work! ๐Ÿš€

\n

It would be very cool to get a space demo of paperbanana so we can understand and try out how the vlms and generators are orchestrated.

\n","updatedAt":"2026-02-03T15:43:52.337Z","author":{"_id":"62d648291fa3e4e7ae3fa6e8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/62d648291fa3e4e7ae3fa6e8/oatOwf8Xqe5eDbCSuYqCd.png","fullname":"ben burtenshaw","name":"burtenshaw","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":4318,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9498290419578552},"editors":["burtenshaw"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/62d648291fa3e4e7ae3fa6e8/oatOwf8Xqe5eDbCSuYqCd.png"],"reactions":[{"reaction":"๐Ÿš€","users":["sergiopaniego","Kamal0303","YellowjacketGames"],"count":3},{"reaction":"๐Ÿ”ฅ","users":["sergiopaniego"],"count":1},{"reaction":"๐Ÿ‘","users":["tyb343"],"count":1}],"isReport":false},"replies":[{"id":"698366723a4fa563d27e2f72","author":{"_id":"6947f69751d7ae7c3c7b6908","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/PuIDZB9XDShHohKhYmdmp.png","fullname":"Ben Kelly","name":"YellowjacketGames","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2026-02-04T15:32:02.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"someone should take inspiration from the twitter account \"science diagrams that look like s***posts\" and make a meme generator using this.","html":"

someone should take inspiration from the twitter account \"science diagrams that look like s***posts\" and make a meme generator using this.

\n","updatedAt":"2026-02-04T15:32:02.202Z","author":{"_id":"6947f69751d7ae7c3c7b6908","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/PuIDZB9XDShHohKhYmdmp.png","fullname":"Ben Kelly","name":"YellowjacketGames","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8717743158340454},"editors":["YellowjacketGames"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/PuIDZB9XDShHohKhYmdmp.png"],"reactions":[{"reaction":"โค๏ธ","users":["dippatel1994"],"count":1}],"isReport":false,"parentCommentId":"698217b82650edae2abc0b54"}},{"id":"6984875c8db89f0704e81081","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2026-02-05T12:04:44.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"We can create a simple Gradio app here in space where a user can pass their Gemini API key and things would work. Nice idea, but alternatively try adding paperbanana in Claude code/cursor. MCP server/skills are supported now.\n\nDisclaimer: This is not an official implementation, but I tried to implement it as close as possible. Just couldn't mimic those ~230 examples the research team added. However, with the help of the open-source community, we can add even more examples from diverse backgrounds (e.g., biology, AI research, math papers) and a nice retrieval mechanism to surpass the performance reported in the paper! \n\nMore on this here - https://github.com/llmsresearch/paperbanana/wiki","html":"

We can create a simple Gradio app here in space where a user can pass their Gemini API key and things would work. Nice idea, but alternatively try adding paperbanana in Claude code/cursor. MCP server/skills are supported now.

\n

Disclaimer: This is not an official implementation, but I tried to implement it as close as possible. Just couldn't mimic those ~230 examples the research team added. However, with the help of the open-source community, we can add even more examples from diverse backgrounds (e.g., biology, AI research, math papers) and a nice retrieval mechanism to surpass the performance reported in the paper!

\n

More on this here - https://github.com/llmsresearch/paperbanana/wiki

\n","updatedAt":"2026-02-05T12:04:44.013Z","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.922322690486908},"editors":["dippatel1994"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png"],"reactions":[],"isReport":false,"parentCommentId":"698217b82650edae2abc0b54"}},{"id":"6984882d9b6e6bbead24f460","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2026-02-05T12:08:13.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Check on project page a sample output for the quality check. Will add some more examples for quick validation.\n","html":"

Check on project page a sample output for the quality check. Will add some more examples for quick validation.

\n","updatedAt":"2026-02-05T12:08:13.877Z","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7457436919212341},"editors":["dippatel1994"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png"],"reactions":[],"isReport":false,"parentCommentId":"698217b82650edae2abc0b54"}},{"id":"6984d652dfab2d63f834c124","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2026-02-05T17:41:38.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"@burtenshaw here you go! Created a playground to try it. Just bring your own Gemini API key and test it directly in the space below. Right now it uses the Gemini 2.0 Flash model. Iโ€™ll add an option to switch models soon, but this is a great place to start experimenting.\n\nTry it here: - https://huggingface.co/spaces/dippatel1994/paperbanana","html":"

\n\n@burtenshaw\n\t here you go! Created a playground to try it. Just bring your own Gemini API key and test it directly in the space below. Right now it uses the Gemini 2.0 Flash model. Iโ€™ll add an option to switch models soon, but this is a great place to start experimenting.

\n

Try it here: - https://huggingface.co/spaces/dippatel1994/paperbanana

\n","updatedAt":"2026-02-05T17:42:44.014Z","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.8138831853866577},"editors":["dippatel1994"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png"],"reactions":[],"isReport":false,"parentCommentId":"698217b82650edae2abc0b54"}}]},{"id":"69830c91cbcea27a63c0a9f4","author":{"_id":"679240f4bf5cc40508f460bb","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/8HGZry14fzgAn5pVWNbi3.jpeg","fullname":"Krishn Jatav","name":"krishnjatav5","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2026-02-04T09:08:33.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"\n","html":"","updatedAt":"2026-02-04T09:08:49.416Z","author":{"_id":"679240f4bf5cc40508f460bb","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/8HGZry14fzgAn5pVWNbi3.jpeg","fullname":"Krishn Jatav","name":"krishnjatav5","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.36188793182373047},"editors":["krishnjatav5"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/8HGZry14fzgAn5pVWNbi3.jpeg"],"reactions":[],"isReport":false}},{"id":"698365aaa94181edfd5df306","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2026-02-04T15:28:42.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Released unofficial implementation with MCP server support: https://github.com/llmsresearch/paperbanana\nWe can use it until we have an official version from the Google research team.","html":"

Released unofficial implementation with MCP server support: https://github.com/llmsresearch/paperbanana
We can use it until we have an official version from the Google research team.

\n","updatedAt":"2026-02-04T15:28:42.040Z","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8721780180931091},"editors":["dippatel1994"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png"],"reactions":[{"reaction":"๐Ÿ”ฅ","users":["dippatel1994"],"count":1}],"isReport":false}},{"id":"69838478036f5289e472f523","author":{"_id":"67b10a7bba726eda5c5300d9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/67b10a7bba726eda5c5300d9/v4JU_FPuuxSuaj7xb-LET.jpeg","fullname":"Juan David","name":"Jdcloude","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false},"createdAt":"2026-02-04T17:40:08.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Stunned! This is a huge tool for scientists! I was need this myself hahaha, I hate doing academic ilustrations manually. ","html":"

Stunned! This is a huge tool for scientists! I was need this myself hahaha, I hate doing academic ilustrations manually.

\n","updatedAt":"2026-02-04T17:40:08.038Z","author":{"_id":"67b10a7bba726eda5c5300d9","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/67b10a7bba726eda5c5300d9/v4JU_FPuuxSuaj7xb-LET.jpeg","fullname":"Juan David","name":"Jdcloude","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9739241600036621},"editors":["Jdcloude"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/67b10a7bba726eda5c5300d9/v4JU_FPuuxSuaj7xb-LET.jpeg"],"reactions":[{"reaction":"โค๏ธ","users":["dippatel1994"],"count":1}],"isReport":false}},{"id":"6984869de90cf9cce2f2d9ec","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false},"createdAt":"2026-02-05T12:01:33.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"MCP server & skills support is available now. You just need to use \"uvx --from \"paperbanana[mcp]\" paperbanana-mcp\" to configure the paperbanana mcp server. or, \"claude mcp add paperbanana -e GOOGLE_API_KEY=your-key -- uvx --from \"paperbanana[mcp]\"\n paperbanana-mcp\" if using Claude. ","html":"

MCP server & skills support is available now. You just need to use \"uvx --from \"paperbanana[mcp]\" paperbanana-mcp\" to configure the paperbanana mcp server. or, \"claude mcp add paperbanana -e GOOGLE_API_KEY=your-key -- uvx --from \"paperbanana[mcp]\"
paperbanana-mcp\" if using Claude.

\n","updatedAt":"2026-02-05T12:01:33.269Z","author":{"_id":"64bacb06f346e6651476780c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png","fullname":"Dipkumar Patel","name":"dippatel1994","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":11,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6236711144447327},"editors":["dippatel1994"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64bacb06f346e6651476780c/7g3HFlkKLISrxn2bqhEs5.png"],"reactions":[{"reaction":"๐Ÿ”ฅ","users":["dippatel1994"],"count":1}],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2601.23265","authors":[{"_id":"698024266676f933227065e5","name":"Dawei Zhu","hidden":false},{"_id":"698024266676f933227065e6","name":"Rui Meng","hidden":false},{"_id":"698024266676f933227065e7","name":"Yale Song","hidden":false},{"_id":"698024266676f933227065e8","user":{"_id":"66e01e9a0447b0c09906a84a","avatarUrl":"/avatars/5b30840a17d61943f714fc7d6be45ece.svg","isPro":false,"fullname":"Xiyu","user":"Lemon-prog","type":"user"},"name":"Xiyu Wei","status":"claimed_verified","statusLastChangedAt":"2026-02-03T10:08:48.856Z","hidden":false},{"_id":"698024266676f933227065e9","name":"Sujian Li","hidden":false},{"_id":"698024266676f933227065ea","name":"Tomas Pfister","hidden":false},{"_id":"698024266676f933227065eb","name":"Jinsung Yoon","hidden":false}],"publishedAt":"2026-01-30T18:33:37.000Z","submittedOnDailyAt":"2026-02-02T01:42:31.514Z","title":"PaperBanana: Automating Academic Illustration for AI Scientists","submittedOnDailyBy":{"_id":"6039478ab3ecf716b1a5fd4d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6039478ab3ecf716b1a5fd4d/_Thy4E7taiSYBLKxEKJbT.jpeg","isPro":true,"fullname":"taesiri","user":"taesiri","type":"user"},"summary":"Despite rapid advances in autonomous AI scientists powered by language models, generating publication-ready illustrations remains a labor-intensive bottleneck in the research workflow. To lift this burden, we introduce PaperBanana, an agentic framework for automated generation of publication-ready academic illustrations. Powered by state-of-the-art VLMs and image generation models, PaperBanana orchestrates specialized agents to retrieve references, plan content and style, render images, and iteratively refine via self-critique. To rigorously evaluate our framework, we introduce PaperBananaBench, comprising 292 test cases for methodology diagrams curated from NeurIPS 2025 publications, covering diverse research domains and illustration styles. Comprehensive experiments demonstrate that PaperBanana consistently outperforms leading baselines in faithfulness, conciseness, readability, and aesthetics. We further show that our method effectively extends to the generation of high-quality statistical plots. Collectively, PaperBanana paves the way for the automated generation of publication-ready illustrations.","upvotes":191,"discussionId":"698024276676f933227065ec","projectPage":"https://dwzhu-pku.github.io/PaperBanana/","githubRepo":"https://github.com/dwzhu-pku/PaperBanana","githubRepoAddedBy":"admin","ai_summary":"_paperbanana is an agentic framework that automates the creation of publication-ready academic illustrations using advanced vision-language models and image generation techniques.","ai_keywords":["VLMs","image generation models","agentic framework","publication-ready illustrations","methodology diagrams","PaperBananaBench","self-critique","statistical plots"],"githubStars":3751,"organization":{"_id":"5e6aca39878b8b2bf9806447","name":"google","fullname":"Google","avatar":"https://cdn-uploads.huggingface.co/production/uploads/5dd96eb166059660ed1ee413/WtA3YYitedOr9n02eHfJe.png"}},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"6039478ab3ecf716b1a5fd4d","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6039478ab3ecf716b1a5fd4d/_Thy4E7taiSYBLKxEKJbT.jpeg","isPro":true,"fullname":"taesiri","user":"taesiri","type":"user"},{"_id":"6434b6619bd5a84b5dcfa4de","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6434b6619bd5a84b5dcfa4de/h8Q6kPNjFNc03wmdboHzq.jpeg","isPro":true,"fullname":"Young-Jun Lee","user":"passing2961","type":"user"},{"_id":"6463554dd2044cd1d7c6e0bf","avatarUrl":"/avatars/d7653623117268c545a7063fec69664b.svg","isPro":false,"fullname":"Bingzheng Wei","user":"Bingzheng","type":"user"},{"_id":"652ce0d4c543a08aa92e010f","avatarUrl":"/avatars/7978304e3fe99b0d4d0712441c6a24f3.svg","isPro":false,"fullname":"Haoyu Guo","user":"ghy0324","type":"user"},{"_id":"633e570be7d5ce7bfe037a53","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/633e570be7d5ce7bfe037a53/zV8ULv4Mu7YIGZ8D3JtmK.jpeg","isPro":false,"fullname":"Zhaocheng Liu","user":"zhaocheng","type":"user"},{"_id":"61af81009f77f7b669578f95","avatarUrl":"/avatars/fb50773ac49948940eb231834ee6f2fd.svg","isPro":false,"fullname":"rotem israeli","user":"irotem98","type":"user"},{"_id":"646f17ff6b3df773a2c80697","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/646f17ff6b3df773a2c80697/W6hgBFkyDJaEpzNm_wGp6.png","isPro":false,"fullname":"Yingjie Lei","user":"ChaceLei2004","type":"user"},{"_id":"65377c30e48353201e6fdda0","avatarUrl":"/avatars/a8f803b6f2e598eaee9c52c0d2ddfc16.svg","isPro":false,"fullname":"Jiaheng Liu","user":"CheeryLJH","type":"user"},{"_id":"61f44bab7eba274ea80b74ce","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61f44bab7eba274ea80b74ce/BRbKX1jephdZ7D44FATl4.jpeg","isPro":false,"fullname":"Hyoung-Kyu Song","user":"deepkyu","type":"user"},{"_id":"6407e5294edf9f5c4fd32228","avatarUrl":"/avatars/8e2d55460e9fe9c426eb552baf4b2cb0.svg","isPro":false,"fullname":"Stoney Kang","user":"sikang99","type":"user"},{"_id":"6947f69751d7ae7c3c7b6908","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/PuIDZB9XDShHohKhYmdmp.png","isPro":true,"fullname":"Ben Kelly","user":"YellowjacketGames","type":"user"},{"_id":"653e7e0681f52ceb4d3c9a72","avatarUrl":"/avatars/6cb0d7c0ecf3d9890463f3935ccb32b9.svg","isPro":false,"fullname":"Alejandro Mozo","user":"amozo","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":1,"organization":{"_id":"5e6aca39878b8b2bf9806447","name":"google","fullname":"Google","avatar":"https://cdn-uploads.huggingface.co/production/uploads/5dd96eb166059660ed1ee413/WtA3YYitedOr9n02eHfJe.png"}}">
Papers
arxiv:2601.23265

PaperBanana: Automating Academic Illustration for AI Scientists

Published on Jan 30
ยท Submitted by
taesiri
on Feb 2
#1 Paper of the day
ยท google Google
Authors:
,
,
,
,
,

Abstract

_paperbanana is an agentic framework that automates the creation of publication-ready academic illustrations using advanced vision-language models and image generation techniques.

AI-generated summary

Despite rapid advances in autonomous AI scientists powered by language models, generating publication-ready illustrations remains a labor-intensive bottleneck in the research workflow. To lift this burden, we introduce PaperBanana, an agentic framework for automated generation of publication-ready academic illustrations. Powered by state-of-the-art VLMs and image generation models, PaperBanana orchestrates specialized agents to retrieve references, plan content and style, render images, and iteratively refine via self-critique. To rigorously evaluate our framework, we introduce PaperBananaBench, comprising 292 test cases for methodology diagrams curated from NeurIPS 2025 publications, covering diverse research domains and illustration styles. Comprehensive experiments demonstrate that PaperBanana consistently outperforms leading baselines in faithfulness, conciseness, readability, and aesthetics. We further show that our method effectively extends to the generation of high-quality statistical plots. Collectively, PaperBanana paves the way for the automated generation of publication-ready illustrations.

Community

Paper submitter

PaperBanana automates publication-ready AI research illustrations via an agentic framework using VLMs and image models, orchestrating reference retrieval, planning, rendering, and self-critique with a benchmarking suite.

This is excellent, I never considered science illustrations as a use-case for image gen models, but it makes total sense and I can see this applying to technical blogging as well.

Interestingly, I had to design a similar pipeline for illustrating games. We're a game studio trying to play "research lab" to push our frontiers, and the need to create structured illustrations at scale, with precision, seems to be a shared objective here.

We're just learning how to write up our results in a more "scientific" way besides "comments.md", and this is a helpful piece of the puzzle.

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Amazing work! ๐Ÿš€

It would be very cool to get a space demo of paperbanana so we can understand and try out how the vlms and generators are orchestrated.

ยท

someone should take inspiration from the twitter account "science diagrams that look like s***posts" and make a meme generator using this.

No description provided.

Released unofficial implementation with MCP server support: https://github.com/llmsresearch/paperbanana
We can use it until we have an official version from the Google research team.

Stunned! This is a huge tool for scientists! I was need this myself hahaha, I hate doing academic ilustrations manually.

MCP server & skills support is available now. You just need to use "uvx --from "paperbanana[mcp]" paperbanana-mcp" to configure the paperbanana mcp server. or, "claude mcp add paperbanana -e GOOGLE_API_KEY=your-key -- uvx --from "paperbanana[mcp]"
paperbanana-mcp" if using Claude.

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2601.23265 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2601.23265 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 29