Generate HD video up to 15 seconds with native synced audio and lip-sync, from text or a single image.
Resultados de muestra. Abre Studio para generar los tuyos.
Generate short video with native dialogue and lip-sync in a single pass, no separate audio pipeline.
Bring a single still image to life as a moving clip using image-to-video mode.
Produce 720p or 1080p clips up to 15 seconds straight from a text prompt.
Wan 2.6 is Alibaba's latest Wan video model, built for cost-effective HD video that arrives with sound already attached. It runs in two modes: text-to-video, where a written prompt is all you need, and image-to-video, where a single reference image guides the motion. You pick the output you want and Wan 2.6 handles the rest in one pass.
What sets it apart is native synchronized audio. Dialogue, sound effects, and lip-sync are generated together with the picture, so there is no separate audio step to stitch on afterward. Output is available at 720p and 1080p, in clip lengths of 5, 10, or 15 seconds.
On HexGen you choose your resolution and duration, add an optional reference image for image-to-video, and generate. Pricing is per second of output and tiered by resolution, so a short 720p clip costs less than a longer 1080p one. You always know what you are paying for before you run.
Copia, ajusta y ejecuta. Un buen prompt te lleva casi todo el camino.
A barista in a cozy cafe looks up at the camera and says good morning, steam rising from the espresso machine, warm light, soft ambient chatter in the background.
Waves crash against dark rocks at sunset, seagulls calling overhead, slow cinematic push-in over the shoreline.
A golden retriever bounds across a sunny park chasing a red ball, leaves crunching underfoot, bright cheerful daytime scene.
Paga solo por lo que generes. 1 USD = 1,000 HGcoins. Los HGcoins nunca caducan y las ejecuciones fallidas se reembolsan automáticamente.
How Wan 2.6 stacks up against other video models in the HexGen catalog. Ranks are relative across these siblings.
| Modelo | Calidad | Velocidad | Coste | Elígelo cuando |
|---|---|---|---|---|
Wan 2.6 Este Alibaba | El mejor | Rápido | Coste medio | Pick Wan 2.6 when you want HD video with native synced audio and lip-sync in a single pass, from text or one image. |
Kuaishou | Muy bueno | Rápido | Coste medio | A capable Kuaishou video model for general text-to-video and image-to-video work. |
ByteDance | Muy bueno | El más rápido | Coste bajo | ByteDance's video model for faster, lower-cost clip generation. |
Wan 2.6 is Alibaba's video model that generates HD video with native synchronized audio and lip-sync. It runs in text-to-video mode and image-to-video mode.