LongCat-VideoA Unified Foundational Video Generation Model
LongCat-Video turns prompts, images, or partial footage into coherent, minutes-long videos with rich motion and consistent subjects—fast enough for production workflows.
LongCat-Video Features
Discover the powerful capabilities of LongCat-Video, the unified video generation model designed for professional workflows.
Unified architecture for multiple tasks
LongCat-Video unifies Text-to-Video, Image-to-Video, and Video-Continuation tasks within a single video generation framework. The LongCat-Video model natively supports all these tasks with a single architecture and consistently delivers strong performance across each individual task. With LongCat-Video, you get one powerful model for all your video generation needs.

Long video generation
LongCat-Video is natively pretrained on Video-Continuation tasks, enabling the LongCat-Video model to produce minutes-long videos without color drifting or quality degradation. LongCat-Video excels at generating extended narratives while maintaining visual consistency throughout.

Efficient inference
LongCat-Video generates 720p, 30fps videos within minutes by employing a coarse-to-fine generation strategy along both the temporal and spatial axes. The LongCat-Video architecture leverages Block Sparse Attention to further enhance efficiency, particularly at high resolutions, making LongCat-Video production-ready for demanding workflows.

Strong performance with multi-reward RLHF
Powered by multi-reward Group Relative Policy Optimization (GRPO), comprehensive evaluations on both internal and public benchmarks demonstrate that LongCat-Video achieves performance comparable to leading open-source video generation models as well as the latest commercial solutions. LongCat-Video's advanced RLHF training ensures optimal quality across all generation tasks.

LongCat-Video Show Cases
Discover how LongCat-Video empowers creators across various industries with unified video generation capabilities. Explore real-world examples showcasing LongCat-Video's powerful text-to-video, image-to-video, and video continuation features.
Text-to-Video Gallery






Image-to-Video Gallery






Long Video Gallery






Interactive Video Gallery
0:00 The bathroom is elegantly designed with marble countertops and a large, well-lit mirror above the sink. A variety of makeup products are neatly arranged on the countertop, and a plush towel hangs on a rack nearby. The woman is wearing a stylish black dress and has her hair styled in loose waves. The woman stands in front of the large mirror, gently adjusting its angle to get a better view. She appears focused, ensuring the mirror reflects her image perfectly.
0:06 The woman turns on the faucet and starts washing her hands.
0:11 The woman picks up the towel hanging on the wall.
0:16 The woman dries her hands while looking herself in the mirror.
0:00 A young man wearing a gray t-shirt and blue jeans is seated at a modern wooden desk in a well-lit room. The desk holds a sleek laptop, a pair of black headphones resting beside it, and a small potted plant in the corner. The walls are painted a soft white, creating a bright and airy atmosphere. The young man is focused as he types on the laptop keyboard with both hands, his fingers moving swiftly over the keys. The screen displays a document he is working on.
0:06 The man reaches out to the headphones and put them on.
0:11 The man closes the laptop.
0:16 The man stands up from the chair and moves away from the desk.
How to Use LongCat-Video
Get started with LongCat-Video in three simple steps. The LongCat-Video workflow is designed for both beginners and professionals.
Describe or reference
Write a detailed prompt, or start from a still image or a partial video. LongCat-Video understands subjects, style, motion, and camera intent. Input your creative vision into LongCat-Video's intuitive interface.
Generate & extend
Produce a first pass with LongCat-Video, then extend the sequence via LongCat-Video's video continuation feature to reach minutes-long narratives with consistent identity and look. LongCat-Video makes it easy to expand and refine your content.
Refine & export
Iterate on length, framing, and motion; export 720p/30fps edit-ready clips for your timeline. LongCat-Video keeps temporal coherence intact as you refine. The LongCat-Video export process ensures professional-quality output ready for your production pipeline.
Why Choose LongCat-Video
One Model for All Tasks
LongCat-Video's unified architecture handles Text-to-Video, Image-to-Video, and Video-Continuation in a single framework. With LongCat-Video, there's no need for multiple specialized models—LongCat-Video does it all.
Minutes-Long Videos
LongCat-Video generates high-quality, minutes-long videos without color drifting or quality degradation. The LongCat-Video model is built specifically for long-form video generation, making it ideal for extended narratives and story-driven content.
Fast & Efficient
LongCat-Video generates 720p, 30fps videos within minutes using coarse-to-fine generation and Block Sparse Attention for optimal performance at high resolutions. The efficient LongCat-Video architecture delivers professional-quality results faster than traditional methods.
State-of-the-Art Performance
Multi-reward RLHF with GRPO ensures LongCat-Video matches leading open-source models and commercial solutions in comprehensive benchmark evaluations. LongCat-Video's performance metrics demonstrate its position as a leading unified video generation model.
Loved by Creators Worldwide
Real feedback from creators using LongCat-Video for content creation, marketing, entertainment, and research
"LongCat-Video let our social team turn a single product shot into a minute-long hero sequence. Identity stayed locked, motion felt natural, and export was fast enough for our weekly drops."
"As a solo YouTuber, I used LongCat-Video to continue scenes from one still frame. I got smooth 720p/30fps clips without switching tools. It changed my production cadence."
"We prototyped ad concepts in hours, not days. LongCat-Video kept characters consistent across multiple beats, so our clients could approve story logic before we shot anything."
"LongCat-Video nailed long-form continuity for our explainer series. Scene extensions matched lighting and camera intent—no weird color drift, no identity collapse."
"I started with a single image and LongCat-Video handled image-to-video and continuation in one flow. The unified pipeline cut revision loops in half for our launch campaign."
"For game trailers, LongCat-Video is perfect for stitching beats together. It extends action with coherent motion, then lets us selectively regenerate moments to refine pacing."
"Client asked for "a longer story, same look." LongCat-Video kept wardrobe, lighting, and style consistent while we pushed duration past a minute. That's rare."
"LongCat-Video let our social team turn a single product shot into a minute-long hero sequence. Identity stayed locked, motion felt natural, and export was fast enough for our weekly drops."
"As a solo YouTuber, I used LongCat-Video to continue scenes from one still frame. I got smooth 720p/30fps clips without switching tools. It changed my production cadence."
"We prototyped ad concepts in hours, not days. LongCat-Video kept characters consistent across multiple beats, so our clients could approve story logic before we shot anything."
"LongCat-Video nailed long-form continuity for our explainer series. Scene extensions matched lighting and camera intent—no weird color drift, no identity collapse."
"I started with a single image and LongCat-Video handled image-to-video and continuation in one flow. The unified pipeline cut revision loops in half for our launch campaign."
"For game trailers, LongCat-Video is perfect for stitching beats together. It extends action with coherent motion, then lets us selectively regenerate moments to refine pacing."
"Client asked for "a longer story, same look." LongCat-Video kept wardrobe, lighting, and style consistent while we pushed duration past a minute. That's rare."
"LongCat-Video let our social team turn a single product shot into a minute-long hero sequence. Identity stayed locked, motion felt natural, and export was fast enough for our weekly drops."
"As a solo YouTuber, I used LongCat-Video to continue scenes from one still frame. I got smooth 720p/30fps clips without switching tools. It changed my production cadence."
"We prototyped ad concepts in hours, not days. LongCat-Video kept characters consistent across multiple beats, so our clients could approve story logic before we shot anything."
"LongCat-Video nailed long-form continuity for our explainer series. Scene extensions matched lighting and camera intent—no weird color drift, no identity collapse."
"I started with a single image and LongCat-Video handled image-to-video and continuation in one flow. The unified pipeline cut revision loops in half for our launch campaign."
"For game trailers, LongCat-Video is perfect for stitching beats together. It extends action with coherent motion, then lets us selectively regenerate moments to refine pacing."
"Client asked for "a longer story, same look." LongCat-Video kept wardrobe, lighting, and style consistent while we pushed duration past a minute. That's rare."
"LongCat-Video let our social team turn a single product shot into a minute-long hero sequence. Identity stayed locked, motion felt natural, and export was fast enough for our weekly drops."
"As a solo YouTuber, I used LongCat-Video to continue scenes from one still frame. I got smooth 720p/30fps clips without switching tools. It changed my production cadence."
"We prototyped ad concepts in hours, not days. LongCat-Video kept characters consistent across multiple beats, so our clients could approve story logic before we shot anything."
"LongCat-Video nailed long-form continuity for our explainer series. Scene extensions matched lighting and camera intent—no weird color drift, no identity collapse."
"I started with a single image and LongCat-Video handled image-to-video and continuation in one flow. The unified pipeline cut revision loops in half for our launch campaign."
"For game trailers, LongCat-Video is perfect for stitching beats together. It extends action with coherent motion, then lets us selectively regenerate moments to refine pacing."
"Client asked for "a longer story, same look." LongCat-Video kept wardrobe, lighting, and style consistent while we pushed duration past a minute. That's rare."
LongCat-Video FAQs
Frequently asked questions about LongCat-Video, the unified video generation model. Learn more about LongCat-Video's capabilities, features, and usage.
LongCat-Video is Meituan's unified video generation model for text-to-video, image-to-video, and video continuation, designed to produce minutes-long 720p/30fps outputs with efficient inference.