Feb 25, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation. Wan2.1 offers these key features: Jul 28, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models We are excited to introduce Wan2.2, a major upgrade to our foundational video models. With Wan2.2, we have focused on incorporating the following innovations: π Effective MoE Architecture: Wan2.2 introduces a Mixture-of-Experts (MoE) architecture into video diffusion models. VisoMaster is a powerful yet easy-to-use tool for face swapping and editing in images and videos. It utilizes AI to produce natural-looking results with minimal effort, making it ideal for both casual users and professionals. Video RepublicLabs.AI - multi-model simultaneous generation from a single prompt, fully unrestricted and packed with the latest greatest AI models. Based AI - AI Intuitive Interface for Video creating Promptchan.ai - Explore over 10 million NSFW AI Porn creations generated by our amazing community. LTX-Video is the first DiT-based video generation model that can generate high-quality videos in real-time. It can generate 30 FPS videos at 1216×704 resolution, faster than it takes to watch them. The model is trained on a large-scale dataset of diverse videos and can generate high-resolution videos with realistic and diverse content. The model supports image-to-video, keyframe-based Contribute to kijai/ComfyUI-WanVideoWrapper development by creating an account on GitHub. Feb 23, 2025 · Video-R1 significantly outperforms previous models across most benchmarks. Notably, on VSI-Bench, which focuses on spatial reasoning in videos, Video-R1-7B achieves a new state-of-the-art accuracy of 35.8%, surpassing GPT-4o, a proprietary model, while using only 32 frames and 7B parameters. This highlights the necessity of explicit reasoning capability in solving
video tasks, and confirms the About π¬ ε‘ε‘εεΉε©ζ | VideoCaptioner - εΊδΊ LLM ηζΊθ½εεΉε©ζ - θ§ι’εεΉηζγζε₯γζ ‘ζ£γεεΉηΏ»θ―ε ¨ζ΅η¨ε€ηοΌ - A powered tool for easy and efficient video subtitling. Introduced a novel taxonomy for Vid-LLMs based on video representation and LLM functionality. Added a Preliminary chapter, reclassifying video understanding tasks from the perspectives of granularity and language involvement, and enhanced the LLM Background section. A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018. - k4yt3x/video2x