Beijing Academy of AI unveils next-gen multimodal model Emu3, achieving unified understanding and generation of video, images and text. Emu3 focuses on predicting the next part of a sequence, removing …