Wan2.1-T2V-1-3B_T2V-1_3B

0
0
0
0
PhotographyBoyStyle Boost
Cập nhật gần đây: Đăng tải lần đầu:
Photography,Boy,Style Boost,Checkpoint,Wan VideoImage info
Photography,Boy,Style Boost,Checkpoint,Wan VideoImage info


Wan2.1 offers these key features:

👍 SOTA Performance: Wan2.1 consistently outperforms existing open-source models and state-of-the-art commercial solutions across multiple benchmarks.

👍 Supports Consumer-grade GPUs: The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with almost all consumer-grade GPUs. It can generate a 5-second 480P video on an RTX 4090 in about 4 minutes (without optimization techniques like quantization). Its performance is even comparable to some closed-source models.

👍 Multiple Tasks: Wan2.1 excels in Text-to-Video, Image-to-Video, Video Editing, Text-to-Image, and Video-to-Audio, advancing the field of video generation.

👍 Visual Text Generation: Wan2.1 is the first video model capable of generating both Chinese and English text, featuring robust text generation that enhances its practical applications.

👍 Powerful Video VAE: Wan-VAE delivers exceptional efficiency and performance, encoding and decoding 1080P videos of any length while preserving temporal information, making it an ideal foundation for video and image generation.

This repository hosts our T2V-1.3B model, a versatile solution for video generation that is compatible with nearly all consumer-grade GPUs. In this way, we hope that Wan2.1 can serve as an easy-to-use tool for more creative teams in video creation, providing a high-quality foundational model for academic teams with limited computing resources. This will facilitate both the rapid development of the video creation community and the swift advancement of video technology.


Thảo luận

Phổ biến nhất
|
Mới nhất
Gửi
Đến sớm
Tải xuống
(0.00KB)
Chi tiết
Loại
Số lần tạo hình ảnh trực tuyến
0
Tải xuống
0
Tham Số Đề Xuất
Sampler method
CFG
6.8
VAE
Không

Bộ sưu tập hình ảnh

Phổ biến nhất
|
Mới nhất