
DreamFusion is a research method for generating 3D content from text prompts by leveraging pre-train
DreamFusion is a research method for generating 3D content from text prompts by leveraging pre-trained 2D diffusion models. Developed by Google Research, it optimizes a neural radiance field (NeRF) so that rendered views of a 3D scene align with images implicitly defined by a powerful text-to-image diffusion model, such as Imagen. Instead of requiring 3D training data, DreamFusion uses score distillation sampling to convert the diffusion modelβs knowledge into a 3D representation.
The approach produces textured 3D objects that can be viewed from arbitrary camera angles and relit, making it particularly useful for synthetic asset creation in graphics, games, AR/VR, and prototyping. The website presents qualitative results, technical explanations, and supplementary material, including comparisons with prior work like Dream Fields.
Please sign in to comment
π¬ No comments yet
Be the first to share your thoughts!
Explore 183+ top alternatives to DreamFusion

RoomGPT is a web-based AI tool that generates alternative interior design concepts for user-provided room photos in various styles and layouts.

Volograms is a tool that converts standard 2D video recordings of people into 3D volumetric holograms for use in mobile, web, and XR applications.

ArchiVinci is a web-based AI tool that converts natural language descriptions into detailed vector illustrations, icons, and diagrams for design and documentation workflows.
MagicVideo-V2 is a text-to-video generation system that integrates image synthesis, motion generation, reference image embedding, and frame interpolation into an end-to-end pipeline.

Tailornova is an online 3D fashion design platform that generates custom sewing patterns and virtual garment visualizations based on user-defined measurements and style choices.

Cross is a 3D capture tool that uses LiDAR and photogrammetry to create, process, and export realistic 3D models from mobile devices and the web.

Gliastar is an AI-powered video creation tool that animates brand mascots from text input, generating character-driven animations for marketing, social media, and digital content.
Depth Anything 3 is a state-of-the-art monocular depth estimation model designed to recover detailed