Vidéo
Alibaba

Wan 2.6

Generate HD video up to 15 seconds with native synced audio and lip-sync, from text or a single image.

À partir de 750 HGcoins / génération·paiement à la génération, sans abonnement
Exemples

Créé avec Wan 2.6

Exemples de rendus. Ouvrez Studio pour générer les vôtres.

À quoi ça sert

Là où Wan 2.6 excelle

Talking Clips

Generate short video with native dialogue and lip-sync in a single pass, no separate audio pipeline.

Image to Motion

Bring a single still image to life as a moving clip using image-to-video mode.

HD Shorts

Produce 720p or 1080p clips up to 15 seconds straight from a text prompt.

Points forts

  • Runs both text-to-video and image-to-video from a single optional reference image
  • Generates native synchronized audio with lip-sync in the same pass, with no separate audio step
  • Outputs HD video at 720p and 1080p
  • Supports clip lengths up to 15 seconds, in 5, 10, or 15 second options

Compromis

  • Priced per second of output, so longer clips cost proportionally more
  • 1080p costs more than 720p
  • Image-to-video accepts at most one reference image
  • No aspect-ratio control is exposed in the HexGen form
Specs

En un coup d'œil

Type
Text-to-video and image-to-video
Vendor
Alibaba (Wan)
Resolution
720p or 1080p
Duration
5, 10, or 15 seconds
Reference images
Up to 1 (optional, for image-to-video)
Audio
Native synchronized audio with lip-sync

À propos de Wan 2.6

Wan 2.6 is Alibaba's latest Wan video model, built for cost-effective HD video that arrives with sound already attached. It runs in two modes: text-to-video, where a written prompt is all you need, and image-to-video, where a single reference image guides the motion. You pick the output you want and Wan 2.6 handles the rest in one pass.

What sets it apart is native synchronized audio. Dialogue, sound effects, and lip-sync are generated together with the picture, so there is no separate audio step to stitch on afterward. Output is available at 720p and 1080p, in clip lengths of 5, 10, or 15 seconds.

On HexGen you choose your resolution and duration, add an optional reference image for image-to-video, and generate. Pricing is per second of output and tiered by resolution, so a short 720p clip costs less than a longer 1080p one. You always know what you are paying for before you run.

Idées de prompts

Points de départ

Copiez, ajustez et lancez. Un bon prompt fait l'essentiel du travail.

A barista in a cozy cafe looks up at the camera and says good morning, steam rising from the espresso machine, warm light, soft ambient chatter in the background.

Waves crash against dark rocks at sunset, seagulls calling overhead, slow cinematic push-in over the shoreline.

A golden retriever bounds across a sunny park chasing a red ball, leaves crunching underfoot, bright cheerful daytime scene.

Tarifs
750
HGcoins / génération · ≈ $0.75

Payez seulement ce que vous générez. 1 USD = 1,000 HGcoins. Les HGcoins n'expirent jamais et les échecs sont remboursés automatiquement.

Comparer

Wan 2.6 face aux autres modèles

How Wan 2.6 stacks up against other video models in the HexGen catalog. Ranks are relative across these siblings.

Wan 2.6 face aux autres modèles
ModèleQualitéVitesseCoûtÀ choisir quand
Wan 2.6
Celui-ci
Alibaba
Excellent
Rapide
Coût moyen
Pick Wan 2.6 when you want HD video with native synced audio and lip-sync in a single pass, from text or one image.
Kuaishou
Très bon
Rapide
Coût moyen
A capable Kuaishou video model for general text-to-video and image-to-video work.
ByteDance
Très bon
Ultra-rapide
Coût réduit
ByteDance's video model for faster, lower-cost clip generation.
En résumé : choisissez Wan 2.6 quand pick wan 2.6 when you want hd video with native synced audio and lip-sync in a single pass, from text or one image.. Sinon, l'un des modèles ci-dessus conviendra mieux. Touchez une ligne pour comparer.

Questions fréquentes

Wan 2.6 is Alibaba's video model that generates HD video with native synchronized audio and lip-sync. It runs in text-to-video mode and image-to-video mode.