Vídeo
Alibaba

Wan 2.6

Generate HD video up to 15 seconds with native synced audio and lip-sync, from text or a single image.

Desde 750 HGcoins / generación·pago por generación, sin suscripción
Ejemplos

Creado con Wan 2.6

Resultados de muestra. Abre Studio para generar los tuyos.

Para qué sirve

Dónde brilla Wan 2.6

Talking Clips

Generate short video with native dialogue and lip-sync in a single pass, no separate audio pipeline.

Image to Motion

Bring a single still image to life as a moving clip using image-to-video mode.

HD Shorts

Produce 720p or 1080p clips up to 15 seconds straight from a text prompt.

Puntos fuertes

  • Runs both text-to-video and image-to-video from a single optional reference image
  • Generates native synchronized audio with lip-sync in the same pass, with no separate audio step
  • Outputs HD video at 720p and 1080p
  • Supports clip lengths up to 15 seconds, in 5, 10, or 15 second options

Concesiones

  • Priced per second of output, so longer clips cost proportionally more
  • 1080p costs more than 720p
  • Image-to-video accepts at most one reference image
  • No aspect-ratio control is exposed in the HexGen form
Especificaciones

De un vistazo

Type
Text-to-video and image-to-video
Vendor
Alibaba (Wan)
Resolution
720p or 1080p
Duration
5, 10, or 15 seconds
Reference images
Up to 1 (optional, for image-to-video)
Audio
Native synchronized audio with lip-sync

Acerca de Wan 2.6

Wan 2.6 is Alibaba's latest Wan video model, built for cost-effective HD video that arrives with sound already attached. It runs in two modes: text-to-video, where a written prompt is all you need, and image-to-video, where a single reference image guides the motion. You pick the output you want and Wan 2.6 handles the rest in one pass.

What sets it apart is native synchronized audio. Dialogue, sound effects, and lip-sync are generated together with the picture, so there is no separate audio step to stitch on afterward. Output is available at 720p and 1080p, in clip lengths of 5, 10, or 15 seconds.

On HexGen you choose your resolution and duration, add an optional reference image for image-to-video, and generate. Pricing is per second of output and tiered by resolution, so a short 720p clip costs less than a longer 1080p one. You always know what you are paying for before you run.

Ideas de prompts

Puntos de partida

Copia, ajusta y ejecuta. Un buen prompt te lleva casi todo el camino.

A barista in a cozy cafe looks up at the camera and says good morning, steam rising from the espresso machine, warm light, soft ambient chatter in the background.

Waves crash against dark rocks at sunset, seagulls calling overhead, slow cinematic push-in over the shoreline.

A golden retriever bounds across a sunny park chasing a red ball, leaves crunching underfoot, bright cheerful daytime scene.

Precios
750
HGcoins / generación · ≈ $0.75

Paga solo por lo que generes. 1 USD = 1,000 HGcoins. Los HGcoins nunca caducan y las ejecuciones fallidas se reembolsan automáticamente.

Comparar

Wan 2.6 frente a otros modelos

How Wan 2.6 stacks up against other video models in the HexGen catalog. Ranks are relative across these siblings.

Wan 2.6 frente a otros modelos
ModeloCalidadVelocidadCosteElígelo cuando
Wan 2.6
Este
Alibaba
El mejor
Rápido
Coste medio
Pick Wan 2.6 when you want HD video with native synced audio and lip-sync in a single pass, from text or one image.
Kuaishou
Muy bueno
Rápido
Coste medio
A capable Kuaishou video model for general text-to-video and image-to-video work.
ByteDance
Muy bueno
El más rápido
Coste bajo
ByteDance's video model for faster, lower-cost clip generation.
En resumen: elige Wan 2.6 cuando pick wan 2.6 when you want hd video with native synced audio and lip-sync in a single pass, from text or one image.. Si no, uno de los modelos de arriba encajará mejor: toca una fila para comparar.

Preguntas frecuentes

Wan 2.6 is Alibaba's video model that generates HD video with native synchronized audio and lip-sync. It runs in text-to-video mode and image-to-video mode.