Troubleshooting (get cleaner isolation)

If your isolated track still contains other sounds (“leakage”), or the target disappears, use this checklist. Most improvements come from tightening the span and simplifying the prompt.

Best practice: start with a 10–20s clip where the target sound is most obvious. Iterate fast.

Problem: leakage (other sounds bleed through)

  • Use a shorter span where the target is clearest.
  • Make the prompt more specific (sound name + context).
  • Separate in passes: extract the loudest occurrence first.

Problem: target disappears or gets “thin”

  • Simplify the prompt (remove extra adjectives).
  • Try a different noun for the same sound (“dialogue” vs “speech”).
  • Use a span where the target isn’t masked by louder sources.

Problem: artifacts (watery/robotic)

  • Shorten the span and export multiple takes; pick the cleanest.
  • Prefer prompts that describe the sound event, not the scene.
  • If you need stems (vocals vs instrumental), consider classic stem tools as a baseline.

Problem: results vary between clips

  • Normalize input loudness; avoid clipping where possible.
  • Keep clip length consistent when comparing prompts.
  • Save your “best prompts” as reusable templates.

Next: segment audio guide or isolate sounds guide.