LTX-2.3 Prompt Generator
Prose prompts built for LTX-2.3, Lightricks' open-source, audio-and-video, run-it-yourself engine.
- ⚡ Best for
- Open-source, run-it-yourself video: dual-character dialogue, camera-controlled shots, and local/ComfyUI workflows.
- 🆕 Latest update
- LTX-2.3 rebuilt the latent space (sharper detail, stronger motion, less smear), bumped the text connector to ~20B params for closer prompt adherence, and made a big audio jump, far less of the metallic voice. Portrait video is now native, not a workaround.
- 💡 Top tip
- Write the prompt as scene + style + shot-by-shot beats. For two-character dialogue, name who speaks first and second; if you're supplying your own audio (e.g. ElevenLabs), describe motion and shot type and leave the words to the audio track.
- 💰 Cost
- Prompt is free here. LTX-2.3 itself is open-source and free: run it in LTX Studio, locally in ComfyUI with open weights, or on major AI video platforms.
- ✅ Verdict
- The best pick when you want control and ownership: it held its own against Veo and Kling in the 4K test.
Free · no signup · ⌘/Ctrl + Enter to generate
Ltx video prompt: turn a one-line idea into a LTX-2.3-ready prompt with this free tool, complete with a negative prompt, then paste it straight into LTX.
LTX-2.3 is Lightricks' open-source multimodal video model, the rare top-tier engine you can download, run on your own hardware, and fine-tune. It generates picture and audio together, leans on a rebuilt latent space for sharper detail and stronger motion, and (uniquely among the models here) lives inside an open ComfyUI ecosystem of community LoRAs for things like two-character dialogue and reference-driven camera moves. Because it's prompt-adherent prose rather than JSON, the brief you write here reads like a tight shot list.
LTX-2.3 is open-source with open weights on Hugging Face. You can run it three ways: in LTX Studio (Lightricks' own web app), locally in ComfyUI (where the community IC-LoRAs for dialogue and camera control live), or on the major AI video platforms that bundle LTX 2.3. This tool writes the prompt; you paste it into whichever surface you use.
Verdict
| Is LTX-2.3 powerful? | Yes. The rebuilt latent space gives sharper detail and stronger motion, and it held up against Veo 3.1 and Kling 3.0 in the 2026 4K test. |
|---|---|
| Is it easy to use? | Depends on the route. LTX Studio is a no-install web app; the advanced dialogue and camera tricks live in a more hands-on ComfyUI setup. |
| Best for everyone? | No. Closed models can edge it on raw dynamism in heavy action. It is the pick when you want open weights, local execution, and control. |
| Worth using in 2026? | Yes. It is the top open-source choice: free, ownable, fine-tunable, with a community LoRA ecosystem the closed models cannot match. |
Use LTX if you…
- You want open weights you can run locally in ComfyUI with no per-clip API fees
- You make two-character dialogue or podcast-style talking scenes
- You want to copy a real camera move from reference footage onto your shot
- You need native portrait or 4K output without crop-and-pray workarounds
- You want to fine-tune the model on your own footage and own the pipeline
Pick another model if you…
- You need the single most dynamic action take and will not chain shots
- You want a one-click box and will not touch ComfyUI for the advanced tricks
- You expect a full music and voice studio rather than automatic foley
- You have no capable GPU and will not use the hosted surfaces
Feature snapshot
| Capability | Rating | Take |
|---|---|---|
| Open-source and ownership | Excellent | Open weights, free local runs, and fine-tuning. |
| Detail and sharpness | Strong | New latent space cuts smear at the same resolution. |
| Dual-character dialogue | Strong | IC-LoRA Dual Character V2 handles two-person scenes. |
| Camera control | Strong | Camera Man V2 transfers a move from reference footage. |
| Audio quality | Good | Big 2.3 jump; solid foley, less metallic voice. |
| Heavy action dynamism | Moderate | A touch less dynamic than Veo on hard stunts. |
| Ease of setup | Moderate | Hosted is easy; ComfyUI plus GPU is hands-on. |
Pros
- Open-source and free with open weights, download it, run it locally in ComfyUI, fine-tune it, and own the whole pipeline instead of renting a closed API
- Rebuilt latent space delivers sharper detail, stronger motion and noticeably less smear at the same resolution (it nailed defined muscle and solid impact in the 4K boxer test)
- Big audio-quality jump in 2.3, far less of the metallic voice and fewer weird noises in the quiet parts than earlier versions
- Genuine control levers: motion guidance with first + last frame, native portrait/4K output, and an open LoRA ecosystem for dialogue and camera motion
- Larger ~20B text connector means it tracks closer to what you actually describe, rewarding detailed, structured prose
Cons
- On heavy action it can be less dynamic than the closed models, in the 4K test it gave one punch plus a long slow-motion beat where Veo packed in more
- The headline tricks (dual-character dialogue, camera-control transfer) live in community ComfyUI IC-LoRAs, not in a one-click box, LTX Studio is simpler but exposes fewer of them
- Audio is solid for dialogue and ambience but isn't a full music/voice studio, treat it as automatic foley, and supply your own track (e.g. ElevenLabs) when you need precise lines
- Running the full model locally is GPU-heavy; a 10s clip is minutes, not seconds, so the hosted surfaces (LTX Studio and other AI video platforms) are the easy on-ramp
What's new in LTX-2.3
Lightricks frames 2.3 as an engine upgrade, not a cosmetic one. CEO Zeev Farbman calls out three changes that came straight from community feedback: a new latent space, a bigger connector, and a better treatment of audio. The latent space matters most for the eye, a very compressive latent is efficient but loses detail, so the rebuilt one means that at the same resolution you get more detail and less smear, with stronger motion at the same speed even on consumer hardware.
The connector, the piece that links the language model to the diffusion model, grew to around 20 billion parameters, which translates into closer prompt adherence: the video tracks nearer to what you actually wrote. On top of that, 2.3 makes portrait video native rather than a crop-and-pray workaround, and supports native 4K at industry-standard frame rates. The practical upshot for prompting: detail pays off. A precise, structured brief gets rendered more faithfully than it would have in earlier versions.
How LTX compares to other AI video models
Where LTX-2.3 sits against the rest of the field on value and output quality, and how it scores capability by capability. Hover or tap any model for the detail.
| Model | Realism | Motion & physics | Audio & lip-sync | Camera control | Value |
|---|---|---|---|---|---|
| Seedance+ image | |||||
| LTX | |||||
| Veo 3.1 | |||||
| Kling 3.0 | |||||
| Sora 2+ image | |||||
| Runway | |||||
| Luma | |||||
| Grok+ image | |||||
| PixVerse | |||||
| Happy Horse | |||||
| Pika |
Scores are our editorial read of 2026 head-to-head tests, on a 1-5 scale, not vendor benchmarks. Every model shown is a video generator; a few (marked + image) also create stills. Use it to pick which model to write a prompt for, then generate on whichever platform hosts it.
The open-source advantage
LTX-2.3 is the only model in this lineup you can actually download and own. Lightricks ships open weights on Hugging Face (millions of downloads and counting) under an open license, and the whole pitch is that 'closed APIs are not enough', they want people building on top of an open multimodal engine. That's not just ideology; it's a different workflow. You can run it locally with no per-clip API bill, fine-tune it on your own footage, and tap a community LoRA ecosystem that adds capabilities the base model doesn't ship with.
If you don't want to touch ComfyUI, you don't have to. LTX Studio is Lightricks' own web app running 2.3, and major AI video platforms bundle LTX 2.3 alongside other models, so the same prompt you generate here works whether you're a tinkerer running open weights or a creator who just wants to paste and go. The deeper control (dialogue, camera transfer) rewards the ComfyUI route; the hosted surfaces are the fast, no-setup path.
Dual-character dialogue & custom audio
One of the most useful community workflows is the IC-LoRA Dual Character V2 setup in ComfyUI, built for the 'two people talking, podcast-style' scene that most models fumble. It runs on the distilled 2.3 checkpoint with the dual-character IC-LoRA doing the heavy lifting, and you can stack an enhanced-motion LoRA and a detailer LoRA to lift overall quality. The workflow flips between text-to-video and image-to-video, and lets you set duration directly in seconds.
The key prompting decision is audio. You can let LTX generate dialogue from your prompt, or feed your own custom track, for example lines you made in ElevenLabs or another TTS model. When you supply custom audio, the advice from the workflow author is to stop writing dialogue in the prompt and instead describe motion and shot type per beat (shot 1, shot 2, shot 3) and, crucially, who speaks first and who answers. When you don't supply audio, you write the dialogue lines in the prompt and LTX voices them. Either way, the prompt this tool produces is laid out as a clear shot list so it drops cleanly into that workflow.
Camera control with IC-LoRA
The other standout open-source trick is camera-motion transfer via the IC-LoRA Camera Man V2 (also from Hugging Face). The idea: build your scene as a keyframe image first, then hand LTX a reference video purely for its camera movement, a pan, a tracking shot, a rise-and-fall, and it transfers that exact camera path onto your new footage. It's not really image-to-video; it's image-plus-borrowed-camera-move. In the demos a dancing clip's tracking shot got mapped onto a motorbike on a stormy highway, and a sports-car clip's left-right cuts got mapped onto a spaceship over a city.
Two settings make or break it, both from the workflow author: keep resolution at 960×512 or above for clean output, and set the conditioning image strength between 0.5 and 0.7 to balance keyframe consistency against motion. The big win is that you stop writing paragraphs trying to describe a camera move in words, you just show LTX the move you want. You can even record your own phone footage as the camera reference.
LTX vs the closed models
In the 2026 4K head-to-head, LTX-2.3 was tested side by side with Veo 3.1 and Kling 3.0 on the same starting frame and prompt, at 4K quality. It impressed on detail, the reviewer singled out how defined the boxer's muscles looked and how solidly the hits landed, and it could open at high intensity before dropping into slow motion. The honest caveat: on that action shot it delivered one punch plus a long slow-motion beat, which felt 'a bit less dynamic' than the alternatives, and like Kling it lacked Veo's multi-image character-reference feature.
So the trade is clear. The closed models can edge it on raw dynamism and a few convenience features. LTX-2.3 gives you comparable 4K quality plus the things they can't: open weights, local execution with no metered API, fine-tuning, and an open LoRA ecosystem for dialogue and camera control. If you value ownership and control, write your prompt for LTX; if you only care about the single most dynamic action take, the gap closes by chaining shots and using first/last-frame guidance.
How to write a great LTX prompt
- Structure the prose as scene → style/tone → shot-by-shot beats (shot 1, shot 2, shot 3). The bigger text connector follows an explicit shot list far better than one vague sentence.
- For two people talking, state who speaks first and who answers, and the shot for each beat (medium, close-up). If you're using your own audio track, describe only motion and framing and leave the actual words to the audio.
- Use first + last frame as motion guidance: describe where the shot starts and where it ends so LTX has a clear path to move along, rather than guessing the arc.
- If you're in ComfyUI, lean on the IC-LoRAs, Camera Man V2 to copy a real camera move from reference footage, and keep image strength around 0.5–0.7 at 960×512 or higher for clean motion.
LTX-2.3 prompt examples
Idea: “Two coworkers bickering at a bus stop in the rain.”, here's the kind of prompt this tool writes for LTX-2.3:
Scene: a city bus stop on a rainy evening, neon storefront glow reflecting in puddles, two coworkers under a single small umbrella. Style: photoreal, cinematic, moody teal-and-amber color grade, gentle film grain, humorous tone. Characters: a tired man in a wet grey coat, and a sharp-witted woman holding the umbrella. Shot 1, medium two-shot, slow push-in: the man checks his phone and sighs as rain streaks the frame; he says, "This bus is basically a myth, everyone talks about it, nobody's seen it." Shot 2, close-up on the woman, slight handheld: she smirks and answers, "At this point I'm expecting a unicorn to show up first." Shot 3, back to the wider two-shot as headlights wash across them, the camera rising slightly: he laughs, "Fine, but if it does, you're paying the fare." Audio: steady rain on the umbrella, distant traffic hiss, clear unprocessed dialogue with natural room tone, no music. Negative prompt: no metallic or robotic voices, no warped faces or extra fingers, no flicker, no on-screen text or logos, no smeared motion, no sudden frame jumps.
LTX-2.3 prompt FAQs
Is LTX-2.3 free and open-source?
Yes. LTX-2.3 is genuinely open-source with open weights published by Lightricks on Hugging Face, and it's free to use. You can run it locally in ComfyUI with no per-clip API fees, fine-tune it on your own data, use Lightricks' LTX Studio web app, or use it on major AI video platforms. Writing the prompt here is also free with no signup.
How do I prompt two characters talking in LTX?
Use a shot-by-shot prose layout: set the scene and tone, then write shot 1, shot 2, shot 3 with the framing for each beat and, importantly, who speaks first and who answers. If you let LTX generate the voices, include the dialogue lines in the prompt. If you supply your own audio track (for example from ElevenLabs), the workflow advice is to drop the words from the prompt and describe only motion and shot type, since the audio already carries the lines. In ComfyUI this runs through the IC-LoRA Dual Character V2 workflow.
Can LTX-2.3 copy a camera movement from another video?
Yes, via the community IC-LoRA Camera Man V2 in ComfyUI. You build your scene as a keyframe image, then feed LTX a reference video purely for its camera motion, a pan, tracking shot or rise, and it transfers that camera path onto your footage. Keep resolution at 960×512 or higher and set the conditioning image strength to 0.5–0.7 to balance keyframe consistency with motion. It means you can show LTX the camera move instead of trying to describe it in words.
ComfyUI or LTX Studio, which should I use?
ComfyUI gives you the full open-source toolkit, including the IC-LoRAs for dual-character dialogue and camera control, but it's more setup and runs on your own GPU. LTX Studio and other hosted AI video platforms are no-install web apps running LTX 2.3, simpler and faster to start, with fewer of the advanced community levers exposed. The prompt this tool writes works in all of them; pick ComfyUI for control, the hosted surfaces for speed.
How does LTX-2.3 compare to Veo and Kling?
In the 2026 4K test LTX-2.3 held up well against Veo 3.1 and Kling 3.0, strong detail and solid motion, but on one heavy slow-motion action shot it felt a bit less dynamic, and like Kling it lacked Veo's multi-image character reference feature. The closed models can edge it on raw dynamism and a couple of conveniences; LTX wins on everything ownership-related: open weights, free local execution, fine-tuning, and an open LoRA ecosystem.
New to AI video? Read the image-to-video guide for the one rule that beats everything, or browse all the free prompt tools.
Other model prompt generators
Seedance 2.0 Prompt Generator
Cinematic, audio-paired prompts engineered for ByteDance's Seedance 2.0, the current value king.
OpenVeo 3.1 Prompt Generator
Structured JSON prompts engineered for Google's Veo 3.1, including its native audio.
OpenKling 3.0 Prompt Generator
Structured prompts built for Kling 3.0, the value-and-consistency pick for people, motion, and multi-shot cuts.
OpenRunway Gen-4.5 Prompt Generator
Control-first prompts for Runway Gen-4.5, the filmmaker's pick for precise camera moves.
OpenGrok Imagine Prompt Generator
Fast, cheap prose prompts built for xAI's Grok Imagine, the value pick for testing ideas.
OpenPika 2.1 Prompt Generator
Prose prompts tuned for Pika 2.1, realism, consistency, and the Ingredients feature.
OpenLuma Ray 3.2 Prompt Generator
Cinematic, HDR-ready prose prompts engineered for Luma Ray 3.2, with keyframes for controlled transitions.
OpenPixVerse V6 Prompt Generator
Prompts built for PixVerse V6 (punchy social clips) and C1 (character consistency across shots).
OpenHappy Horse 1.1 Prompt Generator
Prose prompts tuned for Alibaba's Happy Horse 1.1, native audio, one clean beat, and a likely open-source future.
OpenSora 2 Prompt Generator
Cinematic, true-to-life prose prompts engineered for OpenAI's Sora 2.
Open