Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s

The concept of instantly generating a 3D representation from a single image of any object is undeniably captivating. This breakthrough promises to significantly advance applications in industrial design, animation, gaming, and the realms of Augmented Reality (AR) and Virtual Reality (VR). Besides, the remarkable achievements in natural language processing and image processing have inspired researchers to delve into the realms of learning a universal 3D foundation for reconstructing objects from single images.

In a new paper LRM: Large Reconstruction Model for Single Image to 3D, a research team from Adobe Research and Australian National University introduces an innovative Large Reconstruction Model (LRM). This groundbreaking model has the remarkable ability to predict a 3D model of an object from a single input image in a mere 5 seconds.

The LRM approach adopts a robust transformer-based encoder-decoder architecture for acquiring 3D object representations from a single image in a data-driven fashion. The model takes an image as input and regresses a Neural Radiance Field (NeRF) in the form of a triplane representation. To achieve this, LRM employs the pre-trained visual transformer DINO (Caron et al., 2021) as the image encoder to generate image features. Subsequently, it learns an image-to-triplane transformer decoder to project the 2D image features onto the 3D triplane through cross-attention, effectively modeling relationships among the spatially-structured triplane tokens via self-attention.

The output tokens from the decoder are then reshaped and upsampled to create the final triplane feature maps. This enables LRM to render images from any viewpoint by decoding the triplane feature of each point. It does so with the aid of an additional shared multi-layer perceptron (MLP) to determine color and density, facilitating volume rendering.

What sets LRM apart is its design, which boasts high scalability and efficiency. In addition to employing a fully transformer-based pipeline, the triplane NeRF it employs stands out as a concise and scalable 3D representation. Compared to other alternatives like volumes and point clouds, it is computationally efficient. Furthermore, it offers superior locality with respect to the input image.

One of the remarkable aspects of LRM is its training process, which involves minimizing the difference between rendered images and ground truth images at novel perspectives. This is done without the need for excessive 3D-aware regularization or intricate hyper-parameter tuning, making the model exceedingly efficient during training and adaptable to a wide range of multi-view image datasets.

Empirical results underscore the remarkable fidelity of LRM when handling various inputs, spanning real-world images, synthetic creations, and rendered images featuring diverse subjects with distinct textures. It stands out as a state-of-the-art solution for single-image-to-3D reconstruction when compared to One-2-3-45.

In summary, this groundbreaking work demonstrates the potential of LRM to swiftly predict a 3D model of any object from a single, arbitrary image found in the wild. This development opens up a broad array of real-world applications that can benefit from this rapid and accurate 3D reconstruction capability.

Video demos and interactable 3D meshes can be found on this website: https://yiconghong.me/LRM/. The paper LRM: Large Reconstruction Model for Single Image to 3D on arXiv.

Author: Hecate He | Editor: Chain Zhang

We know you don’t want to miss any news or research breakthroughs. Subscribe to our popular newsletter Synced Global AI Weekly to get weekly AI updates.

6 comments on “Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s”

Whoget

2023-12-20

Hope A Small World Cup game is your way! ⏱️ You choose the tempo now! Set match duration to 45 or 90 seconds and adjust gameplay to suit your perfect match experience.

Loading...

Jammy

2024-01-12

Community and Connection:
suika game extends beyond the individual gaming experience. Join a thriving community of enthusiasts, share strategies, and witness the diverse approaches players take to master the art of fruit stacking. The game’s appeal lies not only in its mechanics but in the shared joy of exploring its nuances with fellow players.

Loading...

Paul Campbell

2024-01-30

I recently explored ProEssayWriting through https://topwriting.services/reviews/proessaywriting-review. Impressed with their professionalism and timely delivery. Legit service with top-notch writers. Highly recommend for anyone seeking reliable assistance with academic writing tasks. Great experience overall!

Loading...

meeloun education

2024-03-06

Understanding the qualifications of assignment ghostwriting https://www.lxws.net/particular.php?id=177 is crucial. An excellent assignment agency should have a professional and experienced team of writers with relevant academic background, rich writing experience, and good writing skills.

Loading...

Tiny Fishing

2024-04-21

Great! Get ready for great games on Tiny Fishing

Loading...

seadweer

2025-12-18

Atlas Copco Air Compressor & Spare Parts Supplier – Seadweer is a distributor and supplier of Atlas Copco air compressor, only sell genuine guarantee air compressors and original genuine spare parts on https://www.aircompressoragent.com/

Loading...

Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s

Like this:

6 comments on “Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s”

Leave a Reply Cancel reply

Related

Share this:

Like this:

6 comments on “Adobe & ANU’s LRM Reconstructs Models For Single Image to 3D in 5s”

Leave a Reply Cancel reply

Related