Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456 Paper page - GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with
Point Cloud Priors
\n","updatedAt":"2024-06-08T18:49:53.721Z","author":{"_id":"6186ddf6a7717cb375090c01","avatarUrl":"/avatars/716b6a7d1094c8036b2a8a7b9063e8aa.svg","fullname":"Julien BLANCHON","name":"blanchon","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":176,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.5406104922294617},"editors":["blanchon"],"editorAvatarUrls":["/avatars/716b6a7d1094c8036b2a8a7b9063e8aa.svg"],"reactions":[],"isReport":false}}],"primaryEmailConfirmed":false,"paper":{"id":"2310.08529","authors":[{"_id":"6529558d4d0eef4a2bd41b3a","user":{"_id":"64e5ee4b93d04e3439f4e988","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64e5ee4b93d04e3439f4e988/g-THJvunoSPUyQ2v_fChU.jpeg","isPro":false,"fullname":"taoranyi","user":"thewhole","type":"user"},"name":"Taoran Yi","status":"claimed_verified","statusLastChangedAt":"2023-10-16T08:25:30.621Z","hidden":false},{"_id":"6529558d4d0eef4a2bd41b3b","user":{"_id":"652cf49cc6857682d3168f5a","avatarUrl":"/avatars/9ccffa04832429aad37320a301ea8e36.svg","isPro":false,"fullname":"Jiemin Fang","user":"JieminFang","type":"user"},"name":"Jiemin Fang","status":"claimed_verified","statusLastChangedAt":"2023-10-16T08:40:03.502Z","hidden":false},{"_id":"6529558d4d0eef4a2bd41b3c","user":{"_id":"65297efc802e3d1a4f7e23e8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/65297efc802e3d1a4f7e23e8/w7UdXJ_A8H6G8TN3OenSS.png","isPro":true,"fullname":"UniLat3D","user":"UniLat3D","type":"user"},"name":"Guanjun Wu","status":"claimed_verified","statusLastChangedAt":"2023-10-16T07:22:14.838Z","hidden":true},{"_id":"6529558d4d0eef4a2bd41b3d","name":"Lingxi Xie","hidden":false},{"_id":"6529558d4d0eef4a2bd41b3e","user":{"_id":"640bea9cb4f73086cddbe0a6","avatarUrl":"/avatars/ae460dd971de1d04660b266b22f81f9d.svg","isPro":false,"fullname":"Xiaopeng Zhang","user":"xpzhang","type":"user"},"name":"Xiaopeng Zhang","status":"admin_assigned","statusLastChangedAt":"2023-10-13T15:57:52.890Z","hidden":false},{"_id":"6529558d4d0eef4a2bd41b3f","name":"Wenyu Liu","hidden":false},{"_id":"6529558d4d0eef4a2bd41b40","name":"Qi Tian","hidden":false},{"_id":"6529558d4d0eef4a2bd41b41","user":{"_id":"62600de6d47e3dbae32ce1ce","avatarUrl":"/avatars/a536417cfec6e10ac415091bd1829426.svg","isPro":false,"fullname":"Xinggang Wang","user":"xinggangw","type":"user"},"name":"Xinggang Wang","status":"admin_assigned","statusLastChangedAt":"2023-10-13T16:03:02.428Z","hidden":false}],"publishedAt":"2023-10-12T17:22:24.000Z","submittedOnDailyAt":"2023-10-13T13:04:55.078Z","title":"GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with\n Point Cloud Priors","submittedOnDailyBy":{"_id":"60f1abe7544c2adfd699860c","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1674929746905-60f1abe7544c2adfd699860c.jpeg","isPro":false,"fullname":"AK","user":"akhaliq","type":"user"},"summary":"In recent times, the generation of 3D assets from text prompts has shown\nimpressive results. Both 2D and 3D diffusion models can generate decent 3D\nobjects based on prompts. 3D diffusion models have good 3D consistency, but\ntheir quality and generalization are limited as trainable 3D data is expensive\nand hard to obtain. 2D diffusion models enjoy strong abilities of\ngeneralization and fine generation, but the 3D consistency is hard to\nguarantee. This paper attempts to bridge the power from the two types of\ndiffusion models via the recent explicit and efficient 3D Gaussian splatting\nrepresentation. A fast 3D generation framework, named as \\name, is proposed,\nwhere the 3D diffusion model provides point cloud priors for initialization and\nthe 2D diffusion model enriches the geometry and appearance. Operations of\nnoisy point growing and color perturbation are introduced to enhance the\ninitialized Gaussians. Our \\name can generate a high-quality 3D instance within\n25 minutes on one GPU, much faster than previous methods, while the generated\ninstances can be directly rendered in real time. Demos and code are available\nat https://taoranyi.com/gaussiandreamer/.","upvotes":18,"discussionId":"6529558f4d0eef4a2bd41ba7","githubRepo":"https://github.com/hustvl/GaussianDreamer","githubRepoAddedBy":"auto","ai_summary":"A framework combining 2D and 3D diffusion models using 3D Gaussian splatting efficiently generates high-quality 3D assets from text prompts with real-time rendering.","ai_keywords":["3D diffusion models","2D diffusion models","3D Gaussian splatting","noisy point growing","color perturbation","real-time rendering"],"githubStars":814},"canReadDatabase":false,"canManagePapers":false,"canSubmit":false,"hasHfLevelAccess":false,"upvoted":false,"upvoters":[{"_id":"5dd96eb166059660ed1ee413","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5dd96eb166059660ed1ee413/NQtzmrDdbG0H8qkZvRyGk.jpeg","isPro":true,"fullname":"Julien Chaumond","user":"julien-c","type":"user"},{"_id":"64b22e6b0a54158d66f18688","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64b22e6b0a54158d66f18688/Cbl3oMMMANbnCMoSUYenI.png","isPro":true,"fullname":"Benhao Huang","user":"HuskyDoge","type":"user"},{"_id":"6090856b1e62cfa4f5c23ccb","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6090856b1e62cfa4f5c23ccb/XtetET8dL65viJDOKIWzl.jpeg","isPro":true,"fullname":"Thomas Wood","user":"odellus","type":"user"},{"_id":"633b71b47af633cbcd0671d8","avatarUrl":"/avatars/6671941ced18ae516db6ebfbf73e239f.svg","isPro":false,"fullname":"juand4bot","user":"juandavidgf","type":"user"},{"_id":"65297efc802e3d1a4f7e23e8","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/65297efc802e3d1a4f7e23e8/w7UdXJ_A8H6G8TN3OenSS.png","isPro":true,"fullname":"UniLat3D","user":"UniLat3D","type":"user"},{"_id":"648eb1eb59c4e5c87dc116e0","avatarUrl":"/avatars/c636cea39c2c0937f01398c94ead5dad.svg","isPro":false,"fullname":"fdsqefsgergd","user":"T-representer","type":"user"},{"_id":"63d4c8ce13ae45b780792f32","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63d4c8ce13ae45b780792f32/QasegimoxBqfZwDzorukz.png","isPro":false,"fullname":"Ohenenoo","user":"PeepDaSlan9","type":"user"},{"_id":"6032802e1f993496bc14d9e3","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6032802e1f993496bc14d9e3/w6hr-DEQot4VVkoyRIBiy.png","isPro":false,"fullname":"Omar Sanseviero","user":"osanseviero","type":"user"},{"_id":"5f17f0a0925b9863e28ad517","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5f17f0a0925b9863e28ad517/fXIY5i9RLsIa1v3CCuVtt.jpeg","isPro":true,"fullname":"Victor Mustar","user":"victor","type":"user"},{"_id":"636ac507e3ad78bc68b31cfe","avatarUrl":"/avatars/e6dd4027945909c7cf13c61807c78f23.svg","isPro":false,"fullname":"Anas Saeed","user":"SaeedAnas","type":"user"},{"_id":"61e7c06064d3c6c929057bee","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/61e7c06064d3c6c929057bee/QxULx1EA1bgmjXxupQX4B.jpeg","isPro":false,"fullname":"่็็","user":"gary109","type":"user"},{"_id":"65139ed913aa0728132b9da1","avatarUrl":"/avatars/7c6c0c5b53e1f742e5539e238b6989d0.svg","isPro":false,"fullname":"winjvlee","user":"winjvlee","type":"user"}],"acceptLanguages":["*"],"dailyPaperRank":0}">
A framework combining 2D and 3D diffusion models using 3D Gaussian splatting efficiently generates high-quality 3D assets from text prompts with real-time rendering.
AI-generated summary
In recent times, the generation of 3D assets from text prompts has shown
impressive results. Both 2D and 3D diffusion models can generate decent 3D
objects based on prompts. 3D diffusion models have good 3D consistency, but
their quality and generalization are limited as trainable 3D data is expensive
and hard to obtain. 2D diffusion models enjoy strong abilities of
generalization and fine generation, but the 3D consistency is hard to
guarantee. This paper attempts to bridge the power from the two types of
diffusion models via the recent explicit and efficient 3D Gaussian splatting
representation. A fast 3D generation framework, named as \name, is proposed,
where the 3D diffusion model provides point cloud priors for initialization and
the 2D diffusion model enriches the geometry and appearance. Operations of
noisy point growing and color perturbation are introduced to enhance the
initialized Gaussians. Our \name can generate a high-quality 3D instance within
25 minutes on one GPU, much faster than previous methods, while the generated
instances can be directly rendered in real time. Demos and code are available
at https://taoranyi.com/gaussiandreamer/.