Video generation models as world simulators

This technical report focuses on (1) our method for turning visual data of types into a unified representation that enables large- training of , and () qualitative of Sora's capabilities and . and implementation details are not included in this report.

Much prior work has studied generative of video data using a variety of , including

Read more

Related Posts