4K4D: Real-Time 4D View Synthesis at 4K Resolution

cchance · on Oct 18, 2023

Holy sh*t, can you imagine a year from now if they start using something like this for concerts or basketball games? Like imagine rewatching a basketball game but being able to move the camera on the court???? Might not be possible yet but this shows the techs possible. Let alone someday being able to scale it to realtime someday maybe lol

o-o- · on Oct 18, 2023

A thought experiment I like to employ when imagining the impact of a new piece of tech: reverse the timeline. If the tech were the status quo, how would the current status quo be marketed?

If moving the camera were the norm, the current status quo would probably be marketed along the lines of "No more micro-managing camera views, arguing over playback speed or fiddling with the timeline – this next leap in technology introduces pre-edited video, where each game is preprocessed by a team of highly-skilled, professional producers, selecting the best viewing angle and playback speed so you and your friends can just sit back, relax and enjoy the game."

If the fictional press release sounds good enough, the tech probably won't hit.

I remember 30 years ago during switching to digital TV broadcasting that proponents of the tech tried to sell a future where viewers of sports events would be able to select which camera to watch. Then again, imagine watching a game with 10 friends and trying to agree over cameras...

oasisbob · on Oct 18, 2023

Great point. Are you familiar with McLuhan's media tetrad?

The hypothetical status quo is a wonderful example of retrieval. Every new technology should be expected to have the effect of reemphasizing something previously obsolete.

pinko · on Oct 20, 2023

I think for routine game-watching you're 100% right. But for replaying amazing/interesting/controversial plays, this tech would an enormous improvement over being captive to the broadcast team. Everyone would love to be able to grab control and fly around and zoom in on that one power dunk, critical fumble, bad foul call, etc. on demand.

MPSimmons · on Oct 18, 2023

>10 friends

Wait, you have 10 friends?

MBCook · on Oct 18, 2023

You want a killer app for VR/AR goggle style things? You’re right this would be amazing.

Apple demoed some kind of volumetric video to the press with the Vision Pro. There was a short clip of a concert and an NBA game (Nuggets?) among other things. I heard a number of people said it was like being there.

This is a step past that. Apple’s was recorded from some kind of special camera rig (I assume), but I seriously doubt it was full volumetric video from a large number of angles. It sounded more like volumetric video if you were stuck in a (very good) seat in the venue.

I’d be curious to know just how much horsepower it takes to play these back.

SequoiaHope · on Oct 18, 2023

I thought they recorded their special videos using the Vision Pro itself, which has enough sensors to build depth maps of the scene and provide novel views within a small range from the original position.

But I am half speculating and I don't really remember. That's just the impression I remember having.

MBCook · on Oct 18, 2023

That’s a feature of the headset, but I think some of the demo videos were recorded in some other manor. I seem to remember hearing a discussion on a podcast (The Talk Show? Dithering?) where it was mentioned you could see a camera rig somehow.

Perhaps that was something you could see on TV when they accidentally showed the rig that was at the same event?

Of course it’s possible they were only using a camera rig so that no one would see the device before they were ready to invade it. Which would be very Apple like.

So I’m speculating a bit. But even if they were going to professionally record events I would think they would do better than just have someone sit there with a headset on.

kridsdale1 · on Oct 18, 2023

I saw the image you’re referencing yesterday. The unit appeared to be a plane, briefcase sized. It had two fisheye lenses about 10cm diameter each and separated by what I expect is an average human IPD.

So I think that unit was simply capturing 180 degree stereo video. Not enough to compute volume without most of it being some ML inference.

MBCook · on Oct 18, 2023

It’s not full volumetric video. Probably can simulate a few degrees off axis but nothing like what this article shows.

keras2 · on Oct 18, 2023

From the paper, it seems a 3060 is enough for 60FPS for the DNA-Rendering dataset. On the full-screen datasets it manages 25 fps. A 4090 might be needed to stay above 60 fps.

Still pretty heavy I’d say but it certainly came a long way and shows us real volumetric video is doable.

16bitvoid · on Oct 18, 2023

That's real-time rendering though, right? Is there anything preventing it from being pre-rendered in non-real-time first? Or does it have to be rendered in real-time?

I'm not familiar with any of this at all, so I'm genuinely curious.

l33tman · on Oct 18, 2023

If the context was to use in a VR/AR headset it has to be real-time. And I guess that use-case, and the related that you interactively want to walk around a scene are two of the main use-cases

detourdog · on Oct 18, 2023

I think the best way to consider it.. is as a 3-d cloud of individually addressable pixels. The size of the cloud is dimensions of real-time rendering.

jayd16 · on Oct 18, 2023

The interesting bit is that it's feasible to render in realtime. We have plenty of ways to render a 3D scene if render time is not an issue.

nomel · on Oct 18, 2023

I started working on something like that a couple decades ago. I figured, with all of the camera feeds at a football game, there would be plenty of views generate 3d models, even with relatively naive approaches. Then the NFL did it shortly after (2008) [1], and it didn't catch on.

It's definitely the future!

[1] 2008, https://www.cnet.com/tech/services-and-software/nfl-demos-li...

[2] Some renewed interest, in 2016: https://bleacherreport.com/articles/2659861-future-of-the-nf...

Cthulhu_ · on Oct 18, 2023

This is what they use in the latest FIFA / FC games, they call it Volumetric Data Capture, basically using video footage from real sports events to capture model and animation data for the players, allowing their unique mannerisms and movements to translate to the game. In previous iterations they would have football players in motion capture suits, but they wouldn't get all the players, plus if they did it would be in stifled, studio conditions, not their natural environment.

Anyway, not quite the same as turning a match into 3D, but definitely related.

https://gamermatters.com/ea-sports-fc-24-makes-use-of-volume...

soylentgraham · on Oct 20, 2023

We did this (as a side effect) for the premier league ~2009-2012 (liquidated JUST before VR appeared, where the content worked fantastically, and then ~2014 with the moverio glasses, even better in AR)

We did live player tracking (~33 cameras) on-site at every game, and for fun rendered players fifa-style free-camera. We even did some renders (capture of realtime engine) for canal+ highlights as an experiment.

edit: my own gpgpu-only,(frag shaders :), sub-100ms, uncalibrated-cameras (footage directly from sky/match of the day) r&d a few years later, also works really well on a LookingGlass https://twitter.com/HoloSports/status/1327375694884646913 (I took this to sky sports but they said it was a bit too in-the-future)

js98 · on Oct 18, 2023

Actually a company has been working on this for a few years now, and I believe they are currently in production. Their focus is football/soccer I believe. I was going to do a research internship at them before I dropped it for a different one. Here it is: https://www.beyondsports.nl/

Looking at it, they heavily focus on tracking the movements of players now to replay in AR

pedalpete · on Oct 18, 2023

This is what we were building at my previous start-up, though we had a focus on outdoor sports. We had built a 3D virtual world and used GPS tracks to follow athletes (ultra-marathons, paragliders), etc.

We theorized that 2 go-pro cameras on the athlete would let us completely re-create the entire scene from all angles, and inform an AI of how to re-paint our virtual world with real-world weather environments etc.

Unfortunately, 5 years ago, everyone said I was crazy to think any of this was possible.

There is a video capture of our 3D scenes from 2017 on our old website (we were a full 3d world, not video) https://ayvri.com - the tech was acquired just over a year ago.

chedabob · on Oct 18, 2023

They were demoing this last year https://www.theverge.com/2022/3/16/22982243/nets-mavericks-e...

barbecue_sauce · on Oct 18, 2023

Will probably be just as boring as watching a basketball game on TV.

ggurface · on Oct 18, 2023

They don't even stream the NBA games in 4k because TV networks only support 1080p. I doubt they'd buy into such an expensive technology for such a niche audience.

jcims · on Oct 18, 2023

If they could sell it for $100 ppv they might consider it.

bloopernova · on Oct 18, 2023

It will be very interesting to watch how tech like this affects mainstream society.

I imagine pornography will use it at some point soon. Maybe something like chaturbate where your interactions with the cam performer are more customized?

Could it be used with CCTV to reconstruct crime scenes or accidents?

Wedding videos might be a popular use, being able to watch from new angles could be a killer app.

Or a reworking of the first Avengers movie, view all the action from multiple viewpoints.

And all this will probably be built in to the pixel 18 pro or something.

rasz · on Oct 18, 2023

>CCTV to reconstruct crime scenes

Here is Jack Black demoing similar NSA technology in 1998 https://www.youtube.com/watch?v=3EwZQddc3kY

VikingCoder · on Oct 18, 2023

I went to high school with the girl who plays the sales clerk standing next to Will Smith, Ivana Miličević.

actionfromafar · on Oct 18, 2023

Good catch! “It can hypothetise”

Also the Star Trek Next Generation forensics episode with the holodeck…

qingcharles · on Oct 18, 2023

There's an ST:TNG episode I remember too where they have an image and they get the computer to back-trace all the reflections in the image to produce what isn't easily seen.

qingcharles · on Oct 18, 2023

It turns out this was actually a documentary.

thomastjeffery · on Oct 18, 2023

"Light Field" photography has existed for a few years now, yet there is still no porn using it that I am aware of.

I tried a demo a while back that was very impressive, despite being relatively low-res stock footage. Simply being able to move your head a few inches in any direction without taking the world with you is a much better experience than contemporary VR video.

accrual · on Oct 18, 2023

This seems unprecedented. Imagine if you have this but you can update the scene programmatically. Ask your AI to change the location or actors. Now you have a very convincing artificial scene with anything you can imagine in it.

mistermann · on Oct 18, 2023

With all the various more specialized demos I've seen, that day doesn't seem too far away.

MPSimmons · on Oct 18, 2023

I imagine that most people will have the capacity in their home machines to render stuff like this in video games in 10ish years.

fragmede · on Oct 18, 2023

10? I think we'll both be surprised by where we're at in 3.

That's October, 2026.

MPSimmons · on Oct 18, 2023

Re-reading the paper, I totally missed the hardware that they did this on, which was consumer-level GPUs, so I think you may be right - 3 years is probably a good time frame for seeing this kind of tech in commercial games.

My reasoning for initially saying 10ish years:

GPU architecture release cadence is frequent, but not THAT frequent. NVidia released the Ampere (RTX 30X0) in 2020, 3 years ago. Ada Lovelace (RTX 40X0) was released a year ago, in Oct 2022. It's _possible_ to use 4090s to do medium neural-network things right now, but to be on the leading edge, you need multiple datacenter-level cards, each of which is > $5k. Even though it's possible to do crazy things with the most recent generation of GPUs, there don't yet exist games that really take advantage of it. The closest that I'm aware of is the Cyberpunk-level games that make native use of raytracing capabilities.

I think it'll probably be a year or two before we see games come out that really require the level of 40-series cards ( or 7900-series if you're of the AMD persuasion), which is a lag-time of ~3 years after the cards were released. I think the software development time and market saturation are the driving factors in the gap here.

I was under the mistaken impression that the video output was produced on the datacenter cards. They got reasonable performance on a 30-series NVidia card. In 3 years, it's totally reasonable to expect that AAA game players will have that level of GPU performance in their gaming machines, so yeah, I think you're right.

kridsdale1 · on Oct 18, 2023

Developers target where the market is. Until 2027 or so, that will be the PS5, which is somewhere around RTX 3060 or 2080 I think.

ChuckMcM · on Oct 18, 2023

I imagine this would be helpful when making movies if you could basically play around with the scenes without having to refilm it several times to get the best one.

Cthulhu_ · on Oct 18, 2023

When it comes to perspective and the like, they already do this; multiple camera angles, CGI, and the odd reshoot. Like having Henry Cavill come back for a reshoot, then CGIing out the mustache he had for his next role.

TeMPOraL · on Oct 18, 2023

Between this and LLMs, we're half-way to building a holodeck. What's missing at this point is just hard light - i.e. the being able to feel the physical substance of simulated objects, and being able to experience it all without a wearable/personal device.

shric · on Oct 18, 2023

> What's missing at this point is just hard light

Yeah, "just". Even though we have no idea how to even approach that.

TeMPOraL · on Oct 18, 2023

> Even though we have no idea how to even approach that.

We'll have to cheat somehow.

Sure, this is still effectively magic, but a few years ago I thought we're also anywhere near having the software layer solved - specifically, the Star Trek style computer interaction and holodeck "storyteller" - the thing that would let you create a high-fidelity interactive world with believable characters and a story generated as you play, out of a command like "computer, give me a cafe in Paris, circa 1890". Now we suddenly have all the pieces for that - we can literally just do it, as long as we constrain the medium to just text, with perhaps some generated imagery for extra mood[0]. And I'm not even talking about GPT-4 - I've had a convincing holodeck textual roleplay on AI Dungeon back ~2-3 years ago, back when GPT-3 just came out.

Note: I don't want to hijack this thread and make it into yet another LLM discussion - but I want to point out we have a bunch of parts converging into an entirely new kind of experience. And while we may not crack hard light any time soon, a wearable VR/AR holodeck experience now seems in reach in under a decade. Or perhaps closer to SAO[1] than a holodeck, but still - something that felt way beyond our capabilities just a couple years ago.

--

[0] - Though GPT-4V should be able to play a board game where you send an image of an updated board each turn, shouldn't it?

[1] - Sword Art Online

bmicraft · on Oct 18, 2023

> I want to point out we have a bunch of parts converging into an entirely new kind of experience

We do have a bunch of parts, but I don't see any converging happening

volemo · on Oct 18, 2023

I believe the way to go is hijacking neural connections, something between Neuralink and STEM from Upgrade.

bzzzt · on Oct 19, 2023

Just plug into the Matrix ;)

bregma · on Oct 18, 2023

With all the money generated by the porn industry I'm surprised there isn't more spent on research and development of applicable technology.

kridsdale1 · on Oct 18, 2023

There is sort of a hardware arm to that industry. They’ve made it so you can send money to a model who is on stream and your “donation” will trigger a mechanical actuator that …does things…

But as for sensation for the viewer, no, there’s just what you’d find in a toy shop.

bregma · on Oct 19, 2023

So... you can build something out of Lego?

jayd16 · on Oct 18, 2023

I suppose you could build up what would essentially be a 4D sprite sheet or animation set of a character and use that to support natural looking arbitrary movement. I'm not sure that isn't just a mo-cap character with extra steps, though.

kridsdale1 · on Oct 18, 2023

Even the most skilled animators with years of budget still can’t escape the uncanny valley which is why CG animation has converged on a style of blob-humans as the current standard.

I have very little hope of AI driven animation looking ok in the next many decades. Don’t underestimate how hardwired your senses are at finding artifacts in movement. Static images are much easier to “fake”.

calibas · on Oct 18, 2023

> we precompute the physical properties on the point clouds for real-time rendering. Although large in size (30 GiB for 0013_01), these precom- puted caches only reside in the main memory and are not explicitly stored on disk,

Does the cache size scale linearly with the length of the video? 0013_01 is only 150 frames. And how long does the cache take to generate?

keras2 · on Oct 18, 2023

Looks so, suspect the authors precomputed everything they can to reach the highest frame rate. Like predecoding all frames in a movie into raw pixels?

I think volumetric video should be thought of as a regular video, where the decoding and playback happen at the same time. A few papers down the line this could be easily implemented.

pineconewarrior · on Oct 18, 2023

Incredible!

How many cameras does this method require? As far as I can tell from the paper it still generates from multi-view source data. I can't say for sure but it seems like a large number from what I can parse as a layman.

fooblaster · on Oct 18, 2023

I'm no expert but the DNA rendering dataset uses 60 synchronously captured cameras.

cchance · on Oct 18, 2023

So, something we could easily see done at say... an NBA event or football field... hell I imagine i can think of some ... adult use cases that would probably make a bundle off of tech like this if it can be optimized down... as my favorite youtuber would say ... WHAT A TIME TO BE ALIVE!

sandworm101 · on Oct 18, 2023

Very cool renderings, but ironically my browser is having a heck of a time rendering their website. The short videos keep jumping around, starting and stopping randomly... which i guess is very VR.

lima · on Oct 18, 2023

These are just normal HTML5 <video> elements and the jumping around is part of the video material.

belter · on Oct 18, 2023

On Firefox Android is crashing.

sandworm101 · on Oct 18, 2023

Yup. I was viewing on my android tablet using chrome. It wasn't crashing but it was very jittery.

tomalaci · on Oct 18, 2023

Add volumetric sound, integrate VR and you almost have recreated braindance from the Cyberpunk 2077 game. Doesn't seem that far off in the distance.

The missing component from complete braindance would be integrating physical senses. AFAIK we are pretty far away from having anything revolutionary in that domain. Would love to be proven wrong, however.

kridsdale1 · on Oct 18, 2023

You can always hire someone to come over and poke you while you have the headset on.

pard68 · on Oct 18, 2023

This seems neat but I don't understand the use of 4D. It's not four dimensional. It's 3D with the ability to have an arbitrary perspective.

jahewson · on Oct 18, 2023

If I’m understanding the paper correctly then the four dimensions are the position, density, radius, and color of the spheres in their volumetric model. So for any given viewing position and point in time, their model produces a 4D scene that is then rasterized to 2D.

firtoz · on Oct 18, 2023

When it comes to volumetric clouds, the D stands for how many dimensions you store the points in.

A 2d volume (similar to an image) has pixels stored in 2d coordinates

A 3d volume has points stored in 3d coordinates. Imagine an image for every vertical slice of a brain scan.

A 4d volume has points stored in 4d coordinates, where the newest dimension is time. Imagine a 3d volumetric capture for each frame in time.

hnben · on Oct 18, 2023

don't forget to add the 3 color dimensions. (this may seem pendantic, but when doing feature-extraction, these extra dimensions really are significant)

pard68 · on Oct 18, 2023

Ah so Vector4

bassrattle · on Oct 18, 2023

It's volumetric video

easeout · on Oct 18, 2023

So's a video game, and we call that "real-time 3D". Time is mentioned, but it isn't counted again as a dimension, perhaps because any given momentary view is a time slice, not a time range like it is an XYZ range.

datameta · on Oct 18, 2023

I think the difference is that in a video game you are in one location only at any given moment and things travel only forward in time. We can view from any location at any time in volumetric video.

prmoustache · on Oct 18, 2023

In a lot of racing simulators you can change the position of the "virtual camera". It can be in the cockpit, on the hood, behind the car and on some games in an arbitrary position. Usually replays allows you to see from other competitors and where TV cameras would seat in real world.

ben-schaaf · on Oct 18, 2023

Some games absolutely let you do that. It's not a limitation of the medium.

datameta · on Oct 18, 2023

I'm curious what are a few examples? I've missed the last decade of new games largely.

ben-schaaf · on Oct 18, 2023

CS:GO, TF2, GTA5 and Trackmania (and likely many more) have replay systems where you can pause, play and rewind with a freefly camera. Lots of games have a rewind mechanic: Grid, Baba is You & Viewfinder come to mind. Others have a "Photo Mode" where you can pause with a freefly camera: Starfield & Witcher 3 come to mind.

fragmede · on Oct 18, 2023

Talking about rewind mechanic in games, you can't possibly leave out Braid.

https://store.steampowered.com/app/26800/Braid/

datameta · on Oct 18, 2023

Valid, yeah. It occurs to me though that the difference is we are making a representation of the real world that can be manipulated like such, as opposed to a simulation of a fabricated world.

johnnyworker · on Oct 18, 2023

Not a freefly cam but, this was 1989 after all: https://en.wikipedia.org/wiki/Indianapolis_500:_The_Simulati...

gmerc · on Oct 18, 2023

Unless it’s a cyberpunk 2077 brain dance in the editor

rangewookie · on Oct 18, 2023

3D + video (motion)

It's not a 3D model that is animated using a skeleton and keyframes like traditional 3D. It's many consecutive 3D models that create the illusion of continuous motion (aka video).

4D is the name that has come to describe the jump from static 3D models (photogrammetry) to 3D "video" models.

jorgemf · on Oct 18, 2023

Time is the forth dimension. The input data is a video, so the model learns the colors and the position of the elements (basically points). You can rende the scene from any angle at any time once the models is trained

GaggiX · on Oct 18, 2023

It has a time dimension.

renewiltord · on Oct 18, 2023

Downvoted at the time I see it, but actually correct. It's based on K-planes https://arxiv.org/pdf/2301.10241.pdf which effectively splits each space-time relationship off from the spatial relationship. It's just mathematics, guys. The original NeRF paper talked about a 5D coordinate. You know like a k-dimensional vector?

madmads · on Oct 18, 2023

Yea it's probably to have a catchy name and get some attention. Although it's technically accurate to call it 4D since it includes time, I think 3D video recording would probably get the point across to more people in a less sensationalist way.

jayd16 · on Oct 18, 2023

Is it technically accurate? Seems like its actually 6Dof view angles + time. The paper mentions 4D view, 4D point cloud, dynamic 3D scene and 4D feature grid.

JansjoFromIkea · on Oct 18, 2023

Related: there was a small project that done similar stuff with Kinect v2 a ~7 years ago that was really impressive for the time. https://github.com/MarekKowalski/LiveScan3D

Now that Kinect v2 can be found for next to nothing and is very easy to mod to use without an expensive adaptor it's a bit of a shame the project was abandoned, from what I've seen the bigger limitations of the project can be overcome (only one Kinect per PC, mainly).

rlt · on Oct 18, 2023

And as usual the first application of this new technology will be porn.

But seriously, this is killer technology for AR/VR.

coffeebeqn · on Oct 18, 2023

Wow that site really killed my phone for a minute or so

RyanMathewson · on Oct 18, 2023

It crashes Chrome on my iPhone.

aravindgp · on Oct 18, 2023

It crashes chrome on my android phone

ShamelessC · on Oct 18, 2023

It crashes Chrome on my smart fridge.

skykooler · on Oct 18, 2023

It brought my laptop to a crawl, too.

gloyoyo · on Oct 18, 2023

Crashes Edge Canary on mine.

sroussey · on Oct 18, 2023

No promo here. 15 pro

nuancebydefault · on Oct 18, 2023

No probs here. Firefox on an 'old' samsung galaxy.

darknavi · on Oct 18, 2023

Always fun to see ImGui used in random projects. What a gift to software engineers everywhere!

lowbloodsugar · on Oct 18, 2023

Watched the video for where the idea of IMGUI came from and it was frankly terrifying. I mean, just the assumption that the frame rate is fast enough that mouse up and mouse down occur in two different passes.

NavinF · on Oct 18, 2023

link to said video?

r3trohack3r · on Oct 19, 2023

I'm skeptical.

The code page leads to a repository that just has a README.md saying the source code is "coming soon"

If it actually works, this is huge. I'd be using it tomorrow.

But that first demo gif strikes me as something being off.

The algorithm isn't picking up on the legs in the background painted on the wall... In the paper, I don't understand how what they've built could differentiate between a picture of someone painted on a wall and the part of the scene that should be rendered in 3D.

melchebo · on Oct 18, 2023

I wonder what kind of rig is needed for recording that. It has to be at a least a few different viewpoints.

addandsubtract · on Oct 18, 2023

This is my question as well. What's the input required to generate these 3D scenes? Is RGB video enough or does it also require spacial data? Is planning around the same scene enough or does it require multiple cameras?

Hard_Space · on Oct 24, 2023

I think there has been some serious misinterpretation of what 'real time' means in the context of this paper; and, possibly, that the researchers have avoided overt clickbait claims because they knew the term 'real time' would do the work for them.

This is not some neural codec that can convert any novel or unseen object live, like a kind of 3D YOLO - the paper mentions that it requires up to 24 hours of training on a per case basis.

Nothing can be edited, no textures or movements - all you can do is remove people or speed them up or slow them down, and that's been possible in NeRF for a few years now.

mattsan · on Oct 18, 2023

The speed of development in this space is incredible

avrionov · on Oct 18, 2023

Red Dwarf predicted this:

https://www.youtube.com/watch?v=JMIHNiR3CP8

qingcharles · on Oct 18, 2023

Red Dwarf was a documentary sent back in time.

What's funny here is the use of the word "uncrop"! I had never heard that word used before DALLE*2 was released. And I've been working in computer graphics for 30 years lol. Also I watched a lot of Red Dwarf.

Was that word known before Red Dwarf?

fullarr · on Oct 18, 2023

The effect is cool but I must be the only person on this website that doesn't see a future for it

Seems very niche, with massive data size restrictions, making it difficult to broadcast or stream on existing infrastructure.

But even if you solved the infrastructure problem, it feels like a gimmick that would be uninteresting pretty quickly.

Sporting events maybe benefit a bit by being able to find the right angle for any shot, but honestly they will probably just find the best angle and post that video as a clip

LastMuel · on Oct 18, 2023

You may be missing the benefit of capturing historical data.

Image being able to don AR glasses and to be present for the moon landing or experience the assembly of the Eiffel tower from a position at its base.

Entertainment is one thing, but capturing critical events in full dimension would be world changing.

stevofolife · on Oct 18, 2023

Everyone wear AR glasses will be recording the event. The recordings are streamed for 4K4D processing and streamers can watch the event remotely.

scheeseman486 · on Oct 19, 2023

Movement is per-object, meaning camera movement can be encoded as a vector while the scene it's moving within remains largely static, that leaves a lot of redundant data. Streams could be frustum culled based on the user/camera's perspective. There's potential for high compressibility.

shultays · on Oct 18, 2023

One of my favorite things in VR is google maps, I like "walking" around in cities without leaving my house. I am longing for the day that we can also do this

andybak · on Oct 18, 2023

I presume you mean Google Earth VR: https://store.steampowered.com/app/348250/Google_Earth_VR/

I'm keen that it gets the credit it deserves - I'm terrified it will stop working one day soon and the world will have lost a true wonder.

(They did just open up the underlying APIs so it would be possible to build a replacement now - although it's free in preview and pricing hasn't been announced - so no idea if it's economically viable)

shultays · on Oct 19, 2023

Yeah, that is the application I meant. I would miss it too!

MPSimmons · on Oct 18, 2023

How would you stream the output of something like this, if you wanted to? So that people could continue to change the viewpoints.

You couldn't possibly stream the full list of voxels generated by capturing the entire image with all of the cameras, right? That would probably exceed PCI bandwidth capabilities.

You'd need the server-side to generate models, send those models, and then stream the vectors?

kridsdale1 · on Oct 18, 2023

I only first heard about this Gaussian Splat field at the start of this week, and it seems it has advanced a decade’s worth by Wednesday!

andybak · on Oct 18, 2023

Hold on to your papers...

pk-protect-ai · on Oct 18, 2023

There was a paper related to NeRF dynamic scenes yesterday, but FPS and quality of this one is so superior!

spandextwins · on Oct 18, 2023

Soon I'll just put on a headset, sit on my chair with my food tube and 'bate all the time!

cooper_ganglia · on Oct 18, 2023

"Go away, I'm 'batin!"

sheepscreek · on Oct 18, 2023

I imagine this can be used as an insanely efficient compression scheme. Transitions in videos may not need as many frames using this.

the8472 · on Oct 18, 2023

Those samples don't look like 4k to me.

mkaic · on Oct 18, 2023

The render resolution is 4K, but the subjective image quality and sharpness is not quite up to what you would expect for 4K.

rvz · on Oct 18, 2023

Now that is what I call, unreal. Literally.

cchance · on Oct 18, 2023

can't sit down to read this now anyone know if this is using standard nerfs or gaussians?

IshKebab · on Oct 18, 2023

Neither.

icyriver2023 · on Oct 18, 2023

sosodev · on Oct 18, 2023

Does anybody else get the impression that holograms are inevitable? This type of tech seems like the medium now all we need is a good way of displaying them.

judge2020 · on Oct 18, 2023

We've been making plausible 3d models of humans via game engines for a long time now. The limiter is bringing holograms to real life.

machdiamonds · on Oct 18, 2023

Mixed reality headsets can already do holograms, and there is a clear path to improving them.

cooper_ganglia · on Oct 18, 2023

This is where my mind goes with all of these advancements. Always and immediately to how they will contribute to making a photorealistic Holodeck via mixed reality a closer reality.

zardo · on Oct 18, 2023

Holographic displays do exist, though I'm not aware of any being commercially available yet.

https://www.nature.com/articles/s41467-020-19298-4#MOESM3

IshKebab · on Oct 18, 2023

https://lookingglassfactory.com/

supermatt · on Oct 18, 2023

Is a lenticular lens in front of a display really considered a hologram? I thought you needed to actually capture the light wave pattern to be a hologram, whereas a display is just colour and intensity.

IshKebab · on Oct 18, 2023

Well, I didn't know that it was only 1D lenticular (they keep that under their hat!) But let's pretend for a second that it is 2D lenticular.

In that case, yes it is absolutely the same as a hologram. Consider this thought experiment. Take a real hologram. Cover everywhere up except a tiny opening - a pixel effectively. Now what information does this pixel encode? It's just colour as a function of view angle.

Now do the same thing for a 2D lenticular display. You can reproduce exactly the same thing - a colour that varies as a function of angle. Therefore it is the same.

I guess you could consider LookingGlass to be a hologram in one dimension rather than two. Or alternatively it is a hologram if you promise never to move up or down!

zardo · on Oct 18, 2023

> Now what information does this pixel encode? It's just colour as a function of view angle.

I think you get phase and polarization information as well. Not that you need it for watching football.

IshKebab · on Oct 18, 2023

Luckily humans can't see phase or polarization!

melchebo · on Oct 18, 2023

LookingGlass' small portrait display is more like (2D) lenticular. Their larger displays use a micro lens system that directs pixels out in beams in different direction. So you also get parallax in the up & down direction.

IshKebab · on Oct 18, 2023

Actually you don't. See this page at the bottom:

https://docs.lookingglassfactory.com/keyconcepts/filming-a-l...