Hacker News new | past | comments | ask | show | jobs | submit login
Sony Pictures to Use AI to Produce Movies and Shows in "More Efficient Ways" (hollywoodreporter.com)
33 points by adrian_mrd 6 months ago | hide | past | favorite | 52 comments



"AI" ia very efficient at creating the regurgitated slop the movie industry has become known for these days. Hopefully such an abundance will focus the audiences eye on creative regional works that could never be made with ML.


If something like SORA gets so good that anyone can make content, I’d imagine that we will get an explosion in the amount of content created. Right now, the niche you like might not get a lot of shows or movies created due to cost. But if you really like the Dune movies, maybe they’ll make one every 6 months instead of every 3 years.


I want art and cinema, not "content". Bottom-of-the-barrel made-for-streaming "shows" are somehow worse than mid-'00s vehicles for commercials every few minutes.


Yes, you get an explosion of content, but of the mediocre kind and focused on the average viewer.


The assumption is that while it’s easier to make mediocre content, it also makes it far faster and cheaper to make quality content.


You also get more creators who didn't even know that they are. How many channels you could get before youtube and twitch? 300?


Impregnate 9 woman to obtain a child in a month instead of 9 months.


Faster iteration times will reduce total time to produce things, this isn't adding more parallel processes this is making the process itself faster.

If we gene edited humans to grow in the womb over 1 month then yes we can get a baby in 1 month.


No clue why you’re making that statement.

We have ample evidence of technology increasing productivity.


He's saying that you get something under-developed in a month versus something fully-formed and viable in nine.


Why can’t AI help us make something fully formed in a month instead?

There is a quality bar for Sony Pictures I assume.

My point is that we constantly invent technology to increase productivity. In this case, all I’m saying is that the cost to make a movie of the exact same quality should decrease with better tools.


Most of the blockbusters of the past decade, in particular "superhero movies", could have been made by AI and we wouldn't know the difference. Arguably the blockbuster industry was always low in creativity, but right now big studios don't even have courage to create truly new IP - making movies a perfect target for recycling content through AI.

Unfortunately, I don't see creative regional works growing as a result... Unless there is a way to channel these straight to the streaming services in some profitable way.


Yea, but there is room for both. Sometimes you just wanna rest your brain with a super predictable movie where you can tell from the first minutes exactly what will happen.


AI can create creative regional works. Anything a human being can create an AI can too.


In principle yes; in practice you can tell they're not yet "anything you can do I can do [better, cheaper, faster]" by observing that most of us still have jobs.


> Anything a human being can create an AI can too.

Not yet. And the more niche the longer it will take for AI to be able to do it since you have less data for it.


I think this is true in principal but in practice the current crop of AI really, really struggles to produce anything novel. And given the way the AI is trained, that kind of makes sense.


For the current crop of LLM it's more that anything a human has created before, an AI can mimic.


But it is all derrivative. No way to produce something actually new.


Good luck finding anything that is actually 100% new with everything that has been produced already, AI or not.


Lets say a human work is 1% new, 99% derrivative. AI work is 100% derrivative.


I don't see much of a difference between both here and that's why the result is nearly indistinguishable.

Everything is derivative nowadays, AI or not. That's even how the schools themselves are structured.


That is the entire point of school? To make sure that no original work was created, as it threatens the status quo.


I think that's more a case of school valuing what is easy to measure.


“2h movie about batman, with a double twist in the end.”

I am a director now.


Super easy, barely an inconvenience.


Actually, Github is director now.


I envision a near future where we strap on a VR headset and experience fully immersive content generated by prompts.

So instead of buying a movie in a video format, we buy prompts.

A prompt can be something like “a horse enters the room”. But because it’s generated on the fly, the horse entering will walk around your kitchen island so it doesn’t bump into it.

So everyone’s experience will be slightly different but the story remains the same.


Can we do this without messing up our hair and makeup? What is the benefit over a traditionally produced movie? Why do I want this experience, what makes it superior? Has the general public been demanding that their content work this way, that the biggest pain point with traditional content is the inability to generate it on the fly?

The status quo is that I can pay something like $10 a month to get access to hundreds of critically-acclaimed shows crafted be extremely skilled specialists that I can watch on my $300 55” 4K TV screen. The average American spends about 5 hours watching content (TV and digital) per day.

So let’s take the Vision Pro at $3500. We’re going to make the battery life 5x longer, we will also remove any straps or pieces of the device that touch your face since that messes up hair and makeup, so we can say now it’s a pair of glasses. And we’re going to reduce the cost from $3500 to $300 to match the price of a 55” TV from Costco. We will also add on-device AI processing so that this product can deliver on your vision. Which technology advancements in the next 10-20 years are going to make something like that remotely possible?

In the 15 years that we’ve had modern smartphones on the market when have they ever dropped in price by 10x or gained battery life by 5x? I can’t seem to buy a new iPhone for $60 that gets a week of battery life.

These sci-fi future visions are hilarious in how shortsighted they are, like when The Jetsons “predicted” living in the sky with flying cars. Of course the problem is that doing something like that has no benefits and a laundry list of drawbacks and impracticalities.


Let me ask you, is watching content on a 2D display the best way to experience a crafted story?

If your answer is yes, nothing I say will convince you.

If your answer is no, then something will replace content on 2D screens for story telling.


The answer is neither, because 2D stories are crafted under a specific paradigm which everyone "gets" with time, so it's a default but not a natural one. E.g. people don't act irl like in movies/animations and events don't get viewed from efficient angles with repetitions and timely attention switches. So no, 2D isn't the best way cause we had to adapt 3D-to-3D to it.

On your initial comment, there's already a {f|c}ringe group of LLM users who use [E]RP to participate in... scenarios (text, can generate pics using SD). It's hardly googlable, you may or may not want to dig into this, I warned.


I can fully accept that what VR gives you is far more immersive, but if you’ve got no answer to the laundry list of downsides. It doesn’t really matter that it’s better at some point. We as the mainstream consumer don’t all run out and buy racing simulator rigs just because it’s better than playing on an Xbox controller.

Plenty of higher technology products fail when simple and cheap solutions compete with them.

If you want to tell me that 3D, depth, and space is main the draw, I could point to the various failures of 3D movies and television that were supposedly the technologies of the future.

I used to own a VR headset, a Valve Index. I loved it for what it was. Games on that system were immersive like nothing I had experienced before.

But there were numerous downsides.

Cost, not really solved by today’s products.

Eye fatigue, not solved even by the Vision Pro.

Group experience. Not really solved unless you’re a total hermit. Am I going to go to enjoy a SuperBowl party where all the attendees are wearing a VR headset? How does VR benefit the experience of watching shows with family/friends on the couch? Is it an upgrade to put on individual headsets and using virtual avatars? How do I cuddle and kiss my spouse while we watch a movie with a headsets on? How do I watch VR content at a bar or restaurant?

Multitasking: At best not an upgrade, at worst a downgrade. If I use a TV I can watch the show talk to my friend and order toilet paper on my smartphone that I already own at the same time. What upgrade am I getting there with VR?

Battery life. Not solved. Can I even get through a SuperBowl game on a headset? Is my SuperBowl party going to be a group of 30 people standing around the power outlet?

Nausea and VR acclimation. We could call this partially solved. A pretty high percentage of the population just can’t do VR physically.

Sweaty face, messed up hair, not solved. Period.

Compatibility with corrective vision: not really solved, every headset needs custom prescription lenses or needs space to fit glasses (this was uncomfortable in my Index, I was lucky to be able to wear contacts).

Visual fidelity: not solved, a standard TVs still beat the Vision Pro at pixels per FOV. 4K per eye includes all the periphery vision that doesn’t represent the main focus of your content.

Profit, not solved. The only companies making a profit on VR are companies like Valve that aren’t burning cash on R&D. How long will public companies like Meta and Apple tolerate wildly unprofitable product lines? which Apple product that sold under 500,000 units at launch is still with us today? Maybe the Mac Pro?

Let’s also not forget that high technology in the medium isn’t typically the driver of what is popular in content. The 5th highest grossing film in 2023 was a film shot on analog film technology that primarily used practical effects. The most watched show in 2023 was a cop procedural that has been on antenna television for over 20 years. The top selling video game of all time is Minecraft. Nobody really cares about being blown away by immersive visuals when they just want to watch deckhands on a chartered yacht reality show sleep around and gossip.


It's fine if you're a VR skeptic. But I never gave a timeline on when my vision might come true.

Do you think we'll have solved the problems you outlined in 5 years? 8 years?

I'd rather focus on the actual vision I laid out rather than the limitations we have today.


Maybe the timeline isn't important to you, but the timeline is the most important thing because it determines whether public companies continue to invest in VR/XR as a money-losing business. If it takes too long to realize this vision it will not be a viable business. Every dollar invested in a technology is a dollar that can be invested in an alternative that returns its investment faster.

Microsoft and Google backed out of AR/VR and they were early adopters. Why would mega-tech companies with "unlimited money" like that back out of the market if it's so promising? Microsoft even seemed to have the B2B/industrial AR market cornered and they just dropped the whole thing. But then again, why should Microsoft spend $10 to make $10 on AR when they can spend $10 on Azure and make $20 selling basic compute services?

Meta is spending 5x more on R&D than it is making back in revenue from its Reality Labs division. This is 5 years after they launched their first consumer device, an eternity in technology. Compare that to the smartphone business where Apple was making wild profits from day one.

It's very important to note just how capital intensive VR/XR is to develop and deliver in a way that's truly reality-bending. The development time and effort is closer to a commercial airplane than most traditional technology products, like PCs that are designed with generally off-the-shelf parts and software. Seriously: Boeing spent $32 billion to develop the 787, Meta burns that on VR R&D in just 2 or 3 years and they aren't selling planes to make up for it.

So, the answer to your question is, absolutely not. These problems won't be resolved in 5-8 years. Too few have been resolved in the past 5. It's just all moving too slowly.

Let's just single out the battery life issue as an example of why these problems can't be resolved fast enough. Apple's silicon is the best product out there for VR/XR but there hasn't been a generational leap in their computing power since the M1 chip. Apple is constrained by the lithography that TSMC can produce so each new generation of processor is in that 10-20% range of improvement. Therefore we can expect their processing to be no better than about twice as fast as their current chips in around 5-8 years. Battery technology won't progress in density significantly between now and then, so what you're left with is the Vision Pro of 5 years from now possibly doubling the battery life or shrinking the battery so it doesn't need to be external anymore while delivering pretty comparable battery life. It's still going to be a situation where you need a big bulky ski goggle thing on your head messing up your hair and makeup.

Finally, we have to talk about sheer market size numbers and just how wildly far off VR/XR is from the kind of numbers you see with smartphones, TVs, etc. You need an almost impossible growth rate that just hasn't showed up even though we're on the third iteration of the market leader's product line, which is sold at a relatively affordable game console-like price.

According to Statista the total size of the XR market installed base is 50 million units as of 2024. They expect the total size to be 100 million by 2026.

Let's compare that to 5 years after the iPhone launched, 2012. Smartphone sales were 600 million in 2012 alone. The global television market is sized at about $250 billion (which is not that bad for devices that have an average selling price well under $1000 and last much longer than a smartphone or computer).

We've seen what it looks like when Apple launches a successful product. The Apple Watch and AirPods were flawed products in their first iteration but we all IMMEDIATELY saw them EVERYWHERE within a year or two of their launch. The Vision Pro is approaching 6 months past the launch date, and it's supposed to be the first immersive spatial computer where you can be productive anywhere anytime, the ultimate movie system for the airplane, the ultimate mobile workstation, and in all my time living in an affluent city with lots of people and going to the airport almost every month I've NEVER seen one in a single coffee shop, coworking space, on the street, on the plane, nothing. Which makes a ton of sense when we find out that Apple was barely able to sell 500,000 of them, and we hear rumors that they are delaying/canceling next year's model.


I'm failing to see how that's anywhere near equivalent to watching a movie.


You’re still watching/experiencing the story. Just that the prompts will take into account your current environment.

Right now, we are limited to a 2D screen for experiencing story telling. But I think technology is converging where we might completely change the way movies are presented.


Entertainment that tells a tale or creates an emotion.

Like you, I'm not sure I'd call it "like a movie"… but also not "like a video game" — it might be a new thing, part way between, that takes up time that would otherwise be spend on the aforementioned entertainment.


If you translate this to movies, then what you're buying is a script. Or if you want to keep it generic over experiences, a trailer prompt.

This feels like a future where it will be very easy to get it wrong and you end up with terrible content. An AI needs a personality to make interesting content or else it just grabs the bell-curve of what "personality" it has derived from its training. Then again, what is the right personality to make a great movie in Y genre? How can an AI possibly get this right when by definition it's an "average of learnt inputs"?


Yep, you’re buying a well crafted script and the script generates the content on the fly.

That’s what I envision.

I’m not sure about the “easy to get it wrong” part. I assume we will figure out how to control the experience such that it will always be a consistently good experience.

I’m not referring to an AI generating the script. The script is still crafted by humans - maybe with specific tools.


Interactive storytelling, while recently mainstream due to netflix, didn’t get much traction in the video context. 3d TVs famously flopped. I doubt interacting with immersive content would replace 2d films. Perhaps it would coexist as another medium.


"Interactive storytelling" has been mainstream for the last few decades already. We call it video games.


We call it visual novels and it’s not that mainstream. Other video game genres can have interactive storytelling too, but it’s bundled with other aspects, most importantly gameplay.


I can’t predict anything.

But I recently got a Vision Pro demo and watched some 3D scenes. I thought it was amazing and clearly more entertaining than a 2D.

Then I saw SORA and thought this is amazing too.

So combine them together and we might have a new way of experiencing stories. We don’t have to watch stories on a 2D screen. We can “experience” them in our own environment. Maybe share the same experience with friends and family who also have a VR headset on.


> strap on a VR headset and experience fully immersive content generated by prompts

this is pretty much the only way we'll be able to keep entire colonies of people entertained during multi-generational journeys to distant stars


At any given required quality level, I'd give 50-50 odds that "creating scenes from prompts" is easier/harder than "creating list of prompts from vague script idea".

So we might not buy anything.


I ain't strappin' shit to my face, chief.


So I'm just hearing "fuck Sony and any other money-grubbing megacorp or persons with zero appreciation of the arts".


Another reason not to watch Sony Pictures movies then. Good.

What studies like them don't understand is: once you go AI one day we'll go all-in on AI. Why should I bother watching some Sony Pictures movie a writer and direction came up with if I can just AI to create one for me in the first place with the parameters I like.

If it's all fake anyway...


There exists a transition period where GenAI isn’t good enough to produce quality content people want to pay for. But it might help studios create content that people are willing to pay for in a more efficient way.

So yes, in the future, creating 2D movies/shows will be trivial.


It will still cost crazy amount of computing power. The economics dont allow it.


Instead of making a logically sound movie, perhaps it would be more efficient to generate animated visual experiences that are meant to invoke certain experience. I wonder if it would be possible to generate randomly looking noise that invokes awe and joy directly.


See Jackson Pollock for a huge divide over whether people think it is art, evocative, or just drips of paint that mean nothing.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: