Alibaba Cloud Opens AI Video Generation Model 'Wan2.1'

Alibaba Cloud announced on the 27th that it will open source and release its artificial intelligence (AI)-based video generation model 'Wan2.1' for free.



The Wan2.1 series recorded a VBench LeaderBoss composite score of 86.22%. [Photo = Alibaba Cloud]



Alibaba Cloud is releasing four 14 billion (14B) and 1.3 billion (1.3B) parameter models from the Wan2.1 series, the latest version of its video foundation model 'Tongyi Wanxiang', as open source, further enhancing the openness and scalability of AI technology. The four



models released this time are T2V-14B, T2V-1.3B, I2V-14B-720P, and I2V-14B-480P, and are designed to generate high-quality images and videos based on text and image input. These models can be downloaded from Alibaba Cloud's AI model community, ModelScope, and collaborative AI platform, Hugging Face, and are open for free use by academic researchers and enterprises around the world.



The T2V-14B model is optimized for high-quality video generation with complex motions, while the T2V-1.3B model provides an ideal solution for a variety of developers conducting research and secondary development by balancing generation quality and computational efficiency. The I2V-

14B-720P and I2V-14B-480P models support not only text-based video generation but also image-based video generation functions.



The Wan2.1 series, released earlier this year, is the first AI video generation model to support text effects in Chinese and English. It excels in realistic video generation capabilities such as precise processing of complex motions, improved pixel quality, compliance with physical principles, and optimized command execution accuracy.



Wan2.1 ranked first on VBench Leaderboard, a comprehensive benchmark for video generation models. It is the only open source video generation model among the top five models on HuggingFace's VBench leaderboard.



According to VBench, the Wan2.1 series achieved a comprehensive score of 86.22%, demonstrating top-level performance in key evaluation items such as naturalness of movement, spatial relationship, color expression, and multi-object interaction.



Alibaba Cloud explained that "training video generation AI models requires massive computing resources and a large amount of high-quality training data," and "open sourcing these models can lower the barriers to AI utilization, and enterprises can produce high-quality video content optimized for their businesses in a more efficient and economical way."





https://www.inews24.com/view/blogger/1818307

Comments

Popular posts from this blog

Livestock Manure Methane Is Soaring, But ‘Resource Recovery’ Isn’t Working [Now is a Climate Crisis]

KT-MS, 'Super Cooperation' to Improve Korea's AI Capabilities... "Providing AI Education to All Citizens" (Comprehensive)

"TSMC to have 2nm line with monthly production capacity of 50,000 sheets by year-end"