1st movie clip!
Hmm, in addition to the general points I generated below, there are a few important assumptions missing:
- ComfyUI is the most advanced tool available, aside from those designed for programmers, and unless you started using it with the specific intention of doing everything manually from the very beginning, it’s extremely difficult beyond the basics. While end-to-end tutorials and instructional videos generally exist for the basic workflow, making it possible even for beginners, the actual work involved is highly advanced
- On top of that, video generation is one of the most difficult tasks in generative AI and one of the tasks that requires the most VRAM.
- For the reasons above, “there are almost never cases where the answer or workflow has already been explained.” This is a tool for people who want to create through trial and error. You’ll need to search GitHub for hints from fragmented information, ask generative AI for help, keep trying until it works, and in the worst case, build custom nodes or code yourself… It sounds simple, but it’s difficult without the necessary knowledge and experience
- In generative AI, tasks like fixing output tend to consume more VRAM than the initial generation. This is because more data is loaded during processing → Therefore, if you can regenerate the output instead of fixing it, that’s the more cost-effective approach.
- Achieving the same results with more limited resources (like VRAM) requires more advanced knowledge and skills. This isn’t limited to PCs, by the way…
- But 8GB of VRAM… even 24GB isn’t particularly powerful for video generation. Data center GPUs are in the 80GB range, after all…
For these reasons, I think you’ll probably need to become an advanced user first if you want to do everything perfectly… Or, look for an easier path, or don’t try to accomplish everything at once…
One reason you can’t find a satisfactory answer even after searching is that, under current conditions, it’s actually quite difficult…
You are not missing something obvious — you are trying to do a different job than most inpainting tutorials show
What most beginner tutorials teach is:
- load one image
- mask an area
- use a prompt to replace that area
- get a new image
That is generic inpainting.
What you want is different:
- you already have a broken frame from a video
- you know who the face is supposed to be
- you want to repair only the face
- then use that repaired frame as the starting point for the next shot
That is a real and common workflow. It is closer to identity repair or reference-guided face repair than to ordinary “mask + prompt” inpainting.
The short answer
Yes — there are real workflows for this.
For your case, the practical choices are:
Repair the bad frame Best when the shot is already good and only the face drifted.
Recreate the shot Best when pose, staging, camera, or multiple frames are wrong.
Use the repaired frame as the next shot’s input This is the key continuity trick.
For your hardware and current stage, I would not jump to training a custom LoRA yet. I would first make this simpler loop work:
- generate a short shot
- inspect the last frame
- repair that frame if only the face is wrong
- use that repaired frame to start the next short shot
That is the most realistic and least frustrating route on an 8 GB machine.
The core distinction: repair vs recreate
This is the biggest practical decision.
Repair
Repair when the frame is already mostly correct and only a local area is wrong.
Typical examples:
- the face became the wrong person
- the face is melted or soft, but the pose is still right
- the lighting, clothes, bench, and composition are all correct
- the frame would be usable if the face were fixed
In other words:
the shot is good, the face is bad
That is a repair case.
Recreate
Recreate when the problem is global , not local.
Typical examples:
- both people drifted too far
- the body pose is wrong
- the bench relationship is wrong
- the camera angle drifted
- the motion between two states is wrong
- several frames in a row are unusable
- fixing the face would still leave the shot bad
In other words:
the shot itself is bad
That is a recreate case.
The practical rule
A good production rule is:
- if the frame is about 80–90% right , repair
- if the shot is structurally wrong, recreate
That one rule will save you a lot of time.
What “repair that frame” actually means
It does not mean “run a magical video repair model over the whole movie.”
It usually means this:
- export or grab the bad last frame
- load it as a still image
- mask only the face
- repair the face using one of the methods below
- save the result as a new repaired frame
- feed that repaired frame into the next shot
So yes — it is often a still-image repair inside a video workflow.
That is normal.
The three repair methods that matter
1) Plain masked inpainting
Use this when the face is only slightly wrong.
Good for:
- soft face drift
- mild face deformation
- one eye or mouth going wrong
- keeping everything else in the frame unchanged
You can learn the base mechanic from the official ComfyUI inpaint tutorial:
- ComfyUI Inpainting Workflow
That tutorial covers:
- the Mask Editor
- masking only the area you want to change
- the
VAE Encoder (for Inpainting) - rebuilding just the masked region
This is the simplest route and the best one to learn first.
2) Face swap / identity replacement
Use this when the face became the wrong person.
Good for:
- “the shot is fine, but the character is now someone else”
- bridge-frame correction
- repeating the same identity across many frames
The most direct tool for this in ComfyUI is:
- ComfyUI-ReActor
ReActor is specifically a face-swap extension for ComfyUI and supports reusable face models, which makes it very attractive for recurring characters.
3) Reference-guided face repair
Use this when you want another image to actively guide the face repair.
Good for:
- “make this broken face look like this person”
- stronger identity control than a prompt alone
- cases where plain inpaint is too vague
The most relevant OSS tool here is:
- ComfyUI IPAdapter Plus
IPAdapter Plus is important because it supports FaceID-style identity guidance. The repo notes that most FaceID models require the matching LoRA, and that regional use is most effective inside an inpaint-style workflow.
The actual OSS options I would care about for your case
A. Official ComfyUI inpaint workflow
Best first step because it teaches the basic mask-and-repair loop.
- ComfyUI Inpainting Workflow
Why I care:
- it is official
- it is simple
- it teaches the exact mechanics you need for single-frame repair
B. Official Wan 2.2 5B workflow
This is the video-generation backbone I would use on 8 GB.
- Wan2.2 Video Generation ComfyUI Official Native Workflow Example
Why I care:
- official template path
- documented model placement
- explicitly aimed at the 5B low-VRAM workflow
C. Official Wan FLF workflow
This is the best tool for your sit-down transition.
- ComfyUI Wan FLF workflow
Why I care:
- first/last-frame control is exactly the right abstraction for:
- standing near bench
- seated on bench
- controlled transition between the two
D. ReActor
Most direct route for “wrong face in one frame.”
- ComfyUI-ReActor repo
Why I care:
- fast face swap
- reusable face models
- very good for bridge-frame identity repair
E. Impact Pack
Best fallback if ReActor keeps wasting time.
- ComfyUI Impact Pack
Why I care:
MaskPainterFaceDetailerMaskDetailer- designed for local detection/detail/inpaint workflows
The repo explicitly describes:
MaskPainteras a mask-drawing featureFaceDetaileras a face detector/improverMaskDetaileras a simple inpaint node that applies the detailer to the masked area
That is extremely close to your actual need.
F. IPAdapter Plus
Best when you want reference-guided identity repair.
- ComfyUI IPAdapter Plus
Why I care:
- strongest “use this face as the guide” option
- but more setup-heavy than plain inpaint
G. VACE
Important later, but not my first recommendation for your machine.
- Wan VACE To Video node docs
Why it matters:
- this is the heavier path for real video-region editing / masked V2V
- it is useful to know it exists
- but I would not start here on 8 GB
H. Musubi-tuner
Later, not now.
- musubi-tuner
Why it matters:
- useful reality check for training
- the repo recommends roughly:
- 12 GB+ for image training
- 24 GB+ for video training
That is why I would not make “train my own LoRA” your next move.
My actual recommendation for your case, as a whole
If this were my setup, I would do it in this order:
Phase 1 — Stabilize ComfyUI
Goal: get one repair path working, not ten.
1. Stop expanding the setup
For now, stop downloading more:
- checkpoints
- LoRAs
- random custom nodes
- giant workflow packs
Keep only a small active set.
2. Fix custom node installation properly
The official docs say every custom node install has two steps :
- put the node repo into
ComfyUI/custom_nodes - install that node’s Python dependencies in the same ComfyUI environment
Official docs:
- Install custom nodes in ComfyUI
- ComfyUI-Manager install
- Custom-node troubleshooting
If a node repo is sitting in something like custom_model, that is the wrong place. Node repos go in custom_nodes.
3. Read the startup logs every single time
Do not trust “installed successfully” messages alone.
Trust:
- startup console
import failed- missing dependency messages
- missing node report
The troubleshooting docs explicitly say to inspect logs and error reports.
4. Try ReActor once, cleanly
If you want ReActor, follow the repo literally:
- put it in
ComfyUI/custom_nodes/ComfyUI-ReActor - run its install step
- download required models
- restart ComfyUI
- check the logs
- confirm the nodes actually appear
If it still fails after one clean pass, stop fighting it and move to Impact Pack.
That is not “giving up.” That is good production discipline.
Phase 2 — Prove a minimal workflow works
Goal: one small workflow that is reliable.
5. Use the official Wan 2.2 5B template
Do not start from a huge community graph.
Start from:
- Wan2.2 Video Generation ComfyUI Official Native Workflow Example
Why:
- official
- documented
- suitable for your VRAM tier
- much easier to debug
6. Stop thinking “2-minute video”
Think in shots :
- walk
- approach bench
- sit
- seated
That is the correct structure for your project.
7. Create three still anchors first
Before any more video, create:
- Anchor A — both characters walking
- Anchor B — both characters near the bench
- Anchor C — both characters seated
If those three are not right, the video will not stay right.
These anchors are your cast lock.
Phase 3 — Use repair and recreate correctly
Goal: keep continuity without overcomplicating the workflow.
Shot 1 — Walk
Generate a short walking clip from Anchor A.
After rendering:
- if only the face drifted near the end → repair the last frame
- if body motion / spacing / camera drifted → recreate the shot
Shot 2 — Approach bench
Use the repaired last frame from Shot 1 as the next shot’s starting image.
Again:
- local face problem → repair
- global shot problem → recreate
Shot 3 — Sit down
This is where I would usually recreate , not repair.
Use:
- first frame = correct standing-near-bench frame
- last frame = Anchor C
- workflow = FLF
That is what Wan FLF is for.
Shot 4 — Seated
Generate from the seated anchor and keep motion subtle.
This is the safest place to get extra screen time.
How I would decide: repair vs recreate
Use this table mentally:
| Situation | What I would do |
|---|---|
| Face is wrong, everything else is good | Repair |
| Face is slightly deformed, everything else is good | Repair |
| Face is wrong and you have a reference image | Repair with identity guidance |
| Body pose is wrong | Recreate |
| Bench/camera/staging drifted | Recreate |
| Multiple frames in a row are bad | Recreate |
| Sit-down motion is wrong | Recreate with FLF |
That is the real production logic.
The actual repair routes I would use
Route 1 — Plain face-region inpaint
Use when the face is only moderately wrong.
Workflow
- load broken frame
- draw a tight mask over the face
- run inpaint
- save repaired frame
Learn from
- ComfyUI Inpainting Workflow
Best when
- the identity is close
- the face just needs correction
- you want the simplest path
Route 2 — ReActor face swap
Use when the face became the wrong character.
Workflow
- load broken frame
- load face reference / face model
- run ReActor
- save repaired frame
Source
- ComfyUI-ReActor
Best when
- the shot is fine
- the identity is wrong
- you want the fastest bridge-frame fix
Route 3 — Impact Pack + IPAdapter FaceID
Use when you want stronger reference guidance and local repair.
Workflow
- load broken frame
- mask the face with
MaskPainter - use
FaceDetailer/MaskDetailer - add IPAdapter FaceID guidance if needed
- save repaired frame
Sources
- ComfyUI Impact Pack
- ComfyUI IPAdapter Plus
Best when
- ReActor is unavailable or unstable
- you want stronger reference control
- you want local repair, not a global rewrite
What I would not do right now
1. I would not train a LoRA
Not yet.
That may become useful later, but the current official/community guidance makes it a poor next step for your hardware and stage.
Use:
- repair
- anchors
- short shots
- FLF
first.
2. I would not jump to full video inpainting
True masked video editing exists, and VACE is the right place to look for that.
But for your current setup, it is the later tier, not the next tier.
3. I would not keep downloading assets “just in case”
That is how disk fills up and clarity disappears.
For now, your active set should be small:
- Wan 2.2 5B
- one repair route
- three anchor stills
- your active project files
The exact “if this were my setup” plan
Day 1 — setup sanity
- clean
custom_nodes - fix folder locations
- try one clean ReActor install
- read startup logs
- if it still fails, switch to Impact Pack
Day 2 — learn one-frame repair
- use the official inpaint tutorial
- take one bad frame
- repair one face
- save a corrected frame
If you cannot do one bad frame → one repaired frame , do not touch the full video workflow yet.
Day 3 — generate one walk shot
- use official Wan 2.2 5B template
- keep it short
- inspect the last frame
Day 4 — use the repaired frame as the handoff
- repair the last frame if needed
- feed it into the approach shot
Day 5 — build the sit-down shot with FLF
- good standing frame
- good seated frame
- FLF transition
Day 6 — generate the seated shot
- short and subtle
- repair only if needed
Day 7 — assemble in an editor
Use editing for pacing. Do not ask the model to generate every second of your screen time if editing can solve it more reliably.
The most important takeaway
The winning pattern for your project is:
short shot → inspect last frame → repair if the problem is local → recreate if the problem is global → use the repaired frame to start the next shot
That is the cleanest, safest, least frustrating version of the workflow.
The actual links I would open first
If you only open a few tabs, I would make them these:
- Install custom nodes in ComfyUI
- ComfyUI-Manager install
- Custom-node troubleshooting
- ComfyUI Inpainting Workflow
- Wan2.2 Video Generation ComfyUI Official Native Workflow Example
- ComfyUI Wan FLF workflow
- ComfyUI-ReActor
- ComfyUI Impact Pack
- ComfyUI IPAdapter Plus
- Wan VACE To Video node docs
- musubi-tuner
Final recommendation
If I had to turn everything above into one practical recommendation, it would be this:
Right now
- keep the stack small
- fix installs properly
- use Wan 2.2 5B
- repair only bridge frames
- recreate bad transitions
- use FLF for the sit-down shot
- treat ReActor as useful but optional
- use Impact Pack if ReActor wastes time
Later
- stronger reference-guided repair
- true video-region editing
- training, if you still need it after the workflow is stable
That is the path I would trust most for your setup and your specific project.
Discussion in the ATmosphere