亚洲精品1234,久久久久亚洲国产,最新久久免费视频,我要看一级黄,久久久性色精品国产免费观看,中文字幕久久一区二区三区,久草中文网

<noframes id="ibup6"><rp id="ibup6"></rp><td id="ibup6"><span id="ibup6"></span></td>

<small id="ibup6"><rp id="ibup6"></rp></small>

China

Home / China / Innovation

Chinese developer launches multimodal model unifying video, image, text

Xinhua | Updated: 2024-10-22 11:03

Share

Share - WeChat

BEIJING -- The Beijing Academy of Artificial Intelligence (BAAI) on Monday released Emu3, a multimodal world model that unifies the understanding and generation of text, image, and video modalities with next-token prediction.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks, said Wang Zhongyuan, director of BAAI, in a press release.

"By tokenizing images, text, and videos into a discrete space, we train a single transformer from scratch on a mixture of multimodal sequences," Wang said, adding that Emu3 eliminates the need for diffusion or compositional approaches entirely.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, according to BAAI, which has open-sourced the key technologies and models of Emu3 to the international technology community.

Technology practitioners have said that a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models (LLMs).

"In the future, the multimodal world model will promote scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference," Wang said.

Photo

First-of-its-kind pearl auction held utilizing Hainan FTP

Agarwood exhibition steeps Shanghai museum in fragrance

Passengers of diverted flight transferred to hotels in Lanzhou

Shenyang winter expo showcases intangible cultural heritage

Linyi's Langya Ancient City unfolds living history

Chongqing railway attendants undergo training ahead of travel rush

Latest

State Council News

2026 Share your views on China's finance

Exclusive

Beijing West Railway Station marks 30th anniversary

Taiwan compatriots will see more opportunities

Why do China's cultural and creative products go viral?

Newsmaker

Tsinghua math talent rivals top US peers, Yau says

Kelsang Pedron: A Tibetan female pilot in the Chinese PLA Air Force

Ten photos you don't wanna miss

Ten photos from across China: Jan 16 – 22

Special Coverage

Live: Hong Kong residential area fire

4th Plenary Session of the 20th CPC Central Committee

BACK TO THE TOP

English

中文

Copyright 1994 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.

Registration Number: 130349

About China Daily

Advertise on Site

Expat Employment

FOLLOW US

<button id="x28z3"></button>