Hacker News

SMERF: Streamable Memory Efficient Radiance Fields

by duckworthdon 12/13/2023, 7:03:13 PM with 46 comments

by barrkelon 12/13/2023, 8:01:21 PM
The mirror on the wall of the bathroom in the Berlin location looks through to the kitchen in the next room. I guess the depth gauging algorithm uses parallax, and mirrors confuse it, seeming like windows. The kitchen has a blob of blurriness as the rear of the mirror intrudes into kitchen, but you can see through the blurriness to either room.
The effect is a bit spooky. I felt like a ghost going through walls.
by nojvekon 12/14/2023, 4:07:06 AM
Holy mother of god. Wow!
Either matterport takes and runs with this or this is a startup waiting to disrupt Realestate.
I can’t believe how smooth this ran on my smartphone.
Feedback: if there was a mode to use the phone compass and gyro for navigation, it’d feel natural. Felt weird to navigate with fingers and figure how to move in xyz dimension.
As others have said, VR mode would be epic.
by promiseofbeanson 12/13/2023, 8:09:24 PM
It runs impressively well on my 2yo s21fe. It was super impressive how it streamed in more images as I explored the space. The tv reflections in the Berlin demo were super impressive.
My one note is that it look a really long time to load all the images - the scene wouldn't render until all ~40 initial images loaded. Would it be possible to start partially rendering as the images arrive, or do you need to wait for all of them before you can do the first big render?
by VikingCoderon 12/13/2023, 8:11:20 PM
Wow. Some questions:
Take for instance the fulllivingroom demo. (I prefer fps mode.)
1) How many images are input?
2) How long does it take to compute these models?
3) How long does it take to prepare these models for this browser, with all levels, etc?
4) Have you tried this in VR yet?
by westurneron 12/14/2023, 12:42:36 AM
"Researchers create open-source platform for Neural Radiance Field development" (2023) https://news.ycombinator.com/item?id=36966076
NeRF Studio > Included Methods, Third-party Methods: https://docs.nerf.studio/#supported-methods
Neural Radiance Field: https://en.wikipedia.org/wiki/Neural_radiance_field
by tomatotomato31on 12/13/2023, 10:13:07 PM
I'm following this through two minutes paper and I'm looking forward to using it.
My grandpa died 2 years ago and in hindsight I took pictures for using them as in your demo.
Awesome thanks:)
by refulgentison 12/13/2023, 8:00:09 PM
This is __really__ stunning work, huge, huge, deal that I'm seeing this in a web browser on my phone. Congratulations!
When I look at the NYC scene in the highest quality on desktop, I'm surprised by how low-quality ex. the stuff on the counter and shelves is. So then I load the lego model, and see that's _very_ detailed, so it doesn't seem inherent to the method.
Is it a consequence of input photo quality, or something else?
by annoyingnoobon 12/13/2023, 8:40:51 PM
There is a market here for Realtors to upload pictures and produce walk-throughs of homes for sale.
by xnxon 12/13/2023, 11:34:09 PM
Does the an open source toolchain exist for capturing, processing, and hosting navigable 3D walkthroughs like this (e.g. something like an open-source Matterport)?
by yargon 12/13/2023, 10:04:10 PM
What I'm seeing from all of these things is very accurate single navigable 3D images.
What I haven't seen anything of is feature and object detection, blocking and extraction.
Hopefully a more efficient and streamable codec necessitates the sort of structure that lends itself more easily to analysis.
by zyangon 12/14/2023, 2:28:58 AM
Why is there a 300m^2 footprint limit if the sub-models are dynamically loaded. Is this constrained by training, rasterizing, or both?
by catskul2on 12/13/2023, 9:13:06 PM
When might we see this in consumer VR? I'm surprised we don't already but I was suspecting it was a computation constraint.
Does this relieve the computation constraint enough to run on Quest 2/3?
Is there something else that would prevent binocular use?
by asgerhbon 12/14/2023, 9:56:59 AM
Wow! What am I even looking at here? Polygons, voxels, or something else entirely? How were the benchmarks recorded?
by edrxtyon 12/13/2023, 11:55:13 PM
Is there any relation between this class of rendering techniques and the way the BD scenes in Cyberpunk 2077 were created? The behavior of the volume and the "voxels" seem eerily similar.
by cubefoxon 12/13/2023, 11:31:36 PM
Very impressive! Any information on how this compares to 3D Gaussian splatting in terms of performance, quality or data size?
by sim7c00on 12/13/2023, 7:12:35 PM
this looks really amazing. i have a relatively old smartphone (2019) and its really surprisingly smooth and high fidently. amazing job!
by heliophobicdudeon 12/13/2023, 8:32:50 PM
Great work!!
Question for the authors, are there opportunities, where they exist, to not use optimization or tuning methods for reconstructing a model of a scene?
We are refining efficient ways of rendering a view of a scene from these models but the scenes remain static. The scenes also take a while to reconstruct too.
Can we still achieve the great look and details of RF and GS without paying for an expensive reconstruction per instance of the scene?
Are there ways of greedily reconstructing a scene with traditional CG methods into these new representations now that they are fast to render?
Please forgive any misconceptions that I may have in advanced! We really appreciate the work y'all are advancing!
by digdugdirkon 12/14/2023, 4:14:28 AM
Can you recommend a good entry point into the theory/math behind these? This is one of those true "wtf, we can do this now?" moments, I'm super curious about how these are generated/created.
by jacoblambdaon 12/13/2023, 8:00:22 PM
Is there a relatively easy way to apply these kinds of techniques (either NeRFs or gaussian splats) to larger environments even if it's lower precision? Like say small towns/a few blocks worth of env.
by duragon 12/13/2023, 8:14:44 PM
Any plans to do this in VR? I would love to try this.
by zeuskon 12/13/2023, 7:45:02 PM
Are radiance fields related to Gaussian splattering?
by slalomskiingon 12/14/2023, 4:56:23 AM
I wonder since this runs at real time framerate if it would be possible for someone to composite a regular rasterized frame on top of something like this (with correct depth testing) to make a game
For example a 3rd person game where the character you control and the NPCs/enemies is raster but the environment is all radiance fields
by SubiculumCodeon 12/13/2023, 8:43:56 PM
Im not sure why this demo runs so horribly in Firefox but not other browsers..anyone else having this?
by kolja005on 12/15/2023, 7:36:43 AM
Been following Jon Barron et al’s work for a little while now. The speed of improvement given the complexity of these types of systems is mind-boggling.
I wonder how long it takes before Google street view gets replaced by some NeRF variant.
by monlockandkeyon 12/13/2023, 9:42:55 PM
Get this on a VR headset and you have a game changer literally.
by modelesson 12/13/2023, 9:44:06 PM
How long until you can stitch Street View into a seamless streaming NeRF of every street in the world? I hope that's the goal you're working towards!
by RagnarDon 12/14/2023, 12:03:52 PM
I'm curious how the creators would compare this to the capabilities of Unreal Engine 5 (as far as the display technology goes.)
by mdrznon 12/14/2023, 1:03:32 PM
Impressive is not a big enough statement! This is incredibly smooth on my phone and crazy good on a desktop pc. Keep it up!
by blovescoffeeon 12/13/2023, 8:30:42 PM
Since you're here @author :) Do you mind giving a quick rundown on how this competes with the quality of zip-nerf?
by germandiagoon 12/13/2023, 11:40:40 PM
Amazing, impressive, almost unbelievable :O
by boppo1on 12/15/2023, 1:16:11 PM
How long until I can make my own?
by yieldcrvon 12/13/2023, 10:52:22 PM
I had read about a competing technology that was suggesting NeRF's were a dead end
but perhaps that was biased?
by guywithabowtieon 12/13/2023, 7:36:23 PM
Any plans to release the models ?
by smusamashahon 12/13/2023, 11:19:53 PM
This is very impressive but given its by Google, will some code ever be released?
by fngjdflmdflgon 12/13/2023, 10:06:32 PM
>Google DeepMind Google Research Google Inc.
What a variety of groups! How did this come about?
by azinman2on 12/14/2023, 8:01:16 AM
Will there be any notebooks or other code released to train our own models?
by rzzzton 12/13/2023, 10:53:21 PM
What kind of modes does the viewer cycle through when I press the space key?
by jerpinton 12/13/2023, 8:48:56 PM
Just ran this on my phone through a browser, this is very impressive
by aapplebyon 12/13/2023, 7:57:44 PM
Very impressive demo.
by nox100on 12/13/2023, 9:30:38 PM
memory efficient? It downloaded 500meg!
by varelseon 12/13/2023, 11:46:38 PM
[dead]
by joeldgon 12/13/2023, 7:48:55 PM
[dead]
by jsk95505on 12/14/2023, 4:29:09 PM
[flagged]
by twelfthnighton 12/13/2023, 10:47:24 PM
Hope this doesn't come as snarky, but does Google pressure researchers to do PR in their papers? This really is cool, but there is a lot of self-promotion in this paper and very little discussion of limitations (and the discussion of them is bookended by qualifications why they really aren't limitations).
It makes it harder for me to trust the paper if I feel like the paper is trying to persuade me of something rather than describe the complete findings.