一级大片免费_成人免费观看在线_国产一区二区三区精品久久久无广告_久久99精品久久久久久青青91_com.黄_久久久久久久国产免费看

position: EnglishChannel  > AI ripples> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-12-17 15:44:35 | Author: Gong Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

Large Unmanned Cargo Aircraft Makes its Debut

China's domestically developed tonne-class large unmanned transport aircraft recently completed its maiden flight in Shandong province, marking a significant advancement in the field of high-end unmanned aviation equipment.

Open Scientific Infrastructure: Catalyst for Intl. Sci-tech Cooperation

It is necessary to promote the opening up and sharing of scientific research infrastructure, make good use of multilateral mechanisms, and establish and improve international open sharing platforms, Chen Jiachang, China’s vice minister of science and technology, said at the Open Science International Forum, part of the 2025 Zhongguancun Forum Annual Conference, on March 28.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網(wǎng)頁

您可以進(jìn)行以下操作:

1.將瀏覽器切換回極速模式

2.點(diǎn)擊下面圖標(biāo)升級或更換您的瀏覽器

3.暫不升級,繼續(xù)瀏覽

繼續(xù)瀏覽
主站蜘蛛池模板: 亚洲成人蜜桃 | 亚洲精品wwww| gay图片 | 亚洲国产精品综合久久20 | 国产黄色大片网站 | 探花论坛 | 日韩免费观看av | 国产小视频在线免费观看 | 黄色国产在线看 | 国产精品系列视频 | 国产成人久久久精品一区 | 欧美日本一 | 黄色麻豆 | 香蕉久久夜色精品国产 | 日本免费黄色 | 在线成人精品国产区免费 | 天天操人人看 | 成长av影片免费观看网站 | 国产美女极度色诱视频 | 国产网站在线 | 新加坡毛片 | 91大神视频在线免费观看 | 久久社区| 狠狠操很很爱 | 日韩欧美精品在线播放 | 色多多入口 | 日韩av在线免费看 | 久操视频免费观看 | 久久精品色播 | 91丨精品丨蝌蚪丨白丝jk | 四虎精品一区二区永久在线观看 | 国产美女久久久久 | 久久9视频 | 日韩一区二区三区免费观看 | 高清二区 | 99中文字幕 | 亚洲不卡1区 | 国产熟睡乱子伦午夜 | www.69国产| 少妇毛片一区二区三区 | 欧美日韩国产一区二区三区在线 |