As far as I'm aware, individual AI videos are currently limited to 8 seconds' duration (unless you're an extremely clever person who can seamlessly splice multiple videos together!), which gives the video creator very little time to convey the story they're trying to tell.
Bearing that in mind, please consider which of the following options most closely matches your view regarding generated characters speaking during AI videos. It's possible either to make characters say specific sentences or to provide the AI engine with a general idea of what's being said and it comes up with the actual words.
It's assumed that non-spoken vocalisations like laughing etc are accepted.
"talking or not" Most of the time, real or AI, I have the sound off. I prefer static images, or progressions, because I can make up my own stories, often muiltiple ones for the same images