The Video breakthrough we were waiting for? by dionysus

AI Wam group

AI Wam

"The Video breakthrough we were waiting for?"

7 posts 763 views

9/5/259/5/25

Quote Link

Labeled female, synthetic

Report

dionysus

Videos Posts

This post is more for the techies here, but check out this paper and associated model on Huggingface. Not quite in the package needed to be a user-facing model yet, but the core tech requirements of generating minute-plus videos maintaining context, consistency and quality appear to have been solved! Implications should be kind of obvious!

https://arxiv.org/abs/2508.21058

9/5/259/5/25

Quote Link Report

RobbyWLP

Videos Posts

As I always say, a few years from now we will think it was "cute" to make one minute vids like this as A.I. takes us into the age of 30 minute (and longer) vids.

Robby

9/7/259/7/25

South England

The impetus for this is surely commercial film and media. You can imagine how much advertising agencies for example would love to be able to generate a commercial entirely by AI.

9/7/259/7/25

Quote Link Report

messg ✓

Gallery Posts

Gloopsuit said: The impetus for this is surely commercial film and media. You can imagine how much advertising agencies for example would love to be able to generate a commercial entirely by AI.

They already are. Coca Cola released an advert early in the year. Most advertising shops now use AI to mock up and story board.
You also have to consider current public models aren't necessarily the same models being shown and used by professional studios. Public models have been quantised and shrunk down.
All that said, the biggest limitation is censorship on public models. It doesn't matter how good they are when they're overly censored. Open source models are great and all but they're severely limited by how much memory is available on GPUs. That is going to take a lot longer to catch up and become cheaper.

9/8/259/8/25

Quote Link Report

dionysus

Videos Posts

yeah this is the main point: the big foundation models, served by public APIs, are fine for most advertising tasks.

I disagree on your timeline a bit though. Distillation has been persistently undercounted, and right now the best foundation video models are only marginally better than the best open-weights models. We're also not limited by peoples' home GPUs when cloud services like Runpod are considered.

9/8/259/8/25

Quote Link Report

messg ✓

Gallery Posts

dionysus said: yeah this is the main point: the big foundation models, served by public APIs, are fine for most advertising tasks.

I disagree on your timeline a bit though. Distillation has been persistently undercounted, and right now the best foundation video models are only marginally better than the best open-weights models. We're also not limited by peoples' home GPUs when cloud services like Runpod are considered.

Google is flying ahead due to their investment in TPUs, it's the reason why they can serve veo so cheaply and start adding speech and audio. Wan2.2 is phenomenal for what it is but I'd argue it's at least a generation behind VEO3 and severly bottlenecked by resources. You're right though, you could through H100s but who's going to do that?
I'm lucky enough to have an RTX pro 6000, 96Gb. It's still time consuming to train Qwen and Wan2.2 at any decent precision. Nivida could have easily released consumer cards with 48Gb+ but chose not to rather than cannibalize their pro cards. Without competition, it's really going to slow open weight models down. The amount of loras and finetunes of the larger Qwen, HiDream and WAN models is significantly lower than SDXL and even flux because people can't afford to.

9/17/259/17/25

Quote Link Report

the real GoOfBaLL

Gallery Posts

RobbyWLP said: As I always say, a few years from now we will think it was "cute" to make one minute vids like this as A.I. takes us into the age of 30 minute (and longer) vids.

Robby

Hell I remember back in the days when we were impressed when a 30 second or less video was able to be shared across the internet. I have no doubt in 10 years AI will be everywhere.

New Thread

Back to Group

About AI Wam Group

Images and discussion about computer-generated content

GuidelinesIf you upload content that features photorealistic models that are actually computer-generated or deep fakes, you must check the appropriate box on the upload form that tags it as synthetic content. The actual technology used to generate the content does not matter, whether it's artificial intelligence, Photoshop, or a really skilled human artist.

If you upload content that features the likeness of real person but has been generated or altered to depict them in a fantasy scenario, that person must be you, your model, or someone from whom you've gotten consent to use their likeness for this purpose. We will need to remove any content that we feel may be using a person's likeness without their consent.

Photorealistic generated and altered content is subject to the same rules as natural content: Copyright laws still apply, subjects must be clearly portrayed as adults, and if the content depicts any nudity, the uploader must have a verified account.

Textual content generated by computer must be tagged as synthetic.

Posts, uploads, and other interactions may not be done by a bot or any other automated process.