Skip to main content
k2k audio logo k2k audio

Back to Lexique

Neural

AI-powered source separation and resynthesis

AI-powered processing. These nodes use trained neural networks for tasks that traditional DSP can’t handle — like separating a mixed song into individual instruments.


Neural Stem Separator

What it does — Splits a full mix into four separate stems (Drums, Bass, Other, Vocals) using the Demucs v4 neural network.

When you’d reach for it — You have a finished mix or a bounced track and you need to isolate one element — pull out the vocal for a remix, grab just the drums for layering, or remove the bass to replace it with your own.

Quick example

  1. Feed your mixed audio into Neural Stem Separator.
  2. Wait for inference to complete (roughly 10-30 seconds depending on length).
  3. Choose which stem to preview in the viewer using the Preview selector.
  4. Route each of the four outputs (Drums, Bass, Other, Vocals) into separate processing chains.
  5. Blend the processed stems back together downstream.

Parameters

ParameterWhat it controlsRangeSweet spot hint
Preview in ViewerWhich stem displays in the main viewerDrums / Bass / Other / VocalsSwitch as needed to inspect each stem
MixBlend between silence and full separation0.00 - 1.00Keep at 1.00 for clean stems
Normalize OutputPrevents clipping by normalizing each stemOn / OffLeave on unless you need raw levels
Use GPU (CUDA)Runs inference on your GPU instead of CPUOn / OffTurn on if you have an NVIDIA GPU — dramatically faster

DDSP Resynth

What it does — Deconstructs a monophonic sound into pitch, loudness, and timbre, then rebuilds it from scratch using additive synthesis and shaped noise.

When you’d reach for it — You want to radically reshape the character of a solo instrument or voice — turn a flute into something synth-like, add breathiness to a clean vocal, or create evolving textures from a simple melodic line. Works best on single-note material like voice, violin, flute, or synth leads.

Quick example

  1. Feed a monophonic recording into DDSP Resynth.
  2. Set Quality to Standard for a good speed/accuracy balance.
  3. Open the Synthesis section and try the Breathy preset for an airy vocal texture.
  4. Adjust Harmonic Level and Noise Level to taste.
  5. Set Phase Mode to RTPGHI for cleaner output.

Parameters

Global

ParameterWhat it controlsRangeSweet spot hint
Frame RateHow many analysis snapshots per second50 - 250 Hz100 Hz is the standard; raise for fast passages
MixBlend between original and resynthesized audio0.00 - 1.001.00 for full resynth, dial back to layer with the original

Pitch Extraction

ParameterWhat it controlsRangeSweet spot hint
QualityCREPE model size — bigger is more accurate but slowerDraft / Standard / Precise / ExtremeStandard covers most material well
Voicing ThresholdConfidence level below which a frame is treated as unvoiced0.00 - 1.000.50 default; lower if pitched segments are dropping out
Fine Tune (cents)Pitch offset to compensate for detection bias-100 - +100-54 is the calibrated default; adjust if pitch drifts

Loudness Extraction

ParameterWhat it controlsRangeSweet spot hint
WeightingLoudness curve applied during analysisA-weighted / C-weighted / FlatA-weighted matches human perception best

Harmonic Analysis

ParameterWhat it controlsRangeSweet spot hint
HarmonicsNumber of overtones extracted from the spectrum1 - 10060 for rich timbres, lower for purer tones
InterpolationHow harmonic peaks are read from the spectrumNearest / Linear / ParabolicLinear is the safe default; Parabolic for precision
SmoothingTemporal smoothing across analysis frames0.00 - 0.900.10 keeps detail; raise to tame jitter
Power NormalizeScales harmonic amplitudes so total energy stays consistentOn / OffLeave on for predictable levels

Noise Analysis

ParameterWhat it controlsRangeSweet spot hint
BandsNumber of frequency bands in the noise filter16 - 12865 gives good resolution without excess cost
SmoothingTemporal smoothing on the noise envelope0.00 - 0.900.20 balances detail and stability
Floor (dB)Quietest level the noise model will represent-80 - -20 dB-60 dB catches most detail without amplifying silence

Synthesis

ParameterWhat it controlsRangeSweet spot hint
Harmonic Level (dB)Volume of the harmonic (tonal) component-24 - +12 dB0 dB keeps the original balance
Noise Level (dB)Volume of the noise (breath/texture) component-24 - +12 dB-12 dB for subtle texture; raise for breathier sounds
Harmonic Rolloff (dB/oct)Spectral tilt of the harmonic series-12 - +6 dB/oct0 preserves the analyzed timbre; negative darkens, positive brightens
Noise ColorSpectral shape of the noise component-1.00 - +1.00-1 is warm/pink, 0 is white, +1 is bright/blue
Output Gain (dB)Master level after synthesis-24 - +12 dB0 dB; adjust to match surrounding levels
Phase ModeHow phase is reconstructed in the outputNone / RTPGHI / AnchoredRTPGHI for clean results; Anchored to preserve original phase character

Presets

PresetWhat it sets up
SawtoothRich harmonic series, minimal noise — classic synth tone
SquareOdd-harmonic emphasis, minimal noise — hollow, reedy character
BreathyFewer harmonics, prominent noise — airy, whispered quality
Warm PadGentle rolloff, pink-tinted noise — soft, enveloping texture
Bright LeadFull harmonics boosted, blue-tinted noise — cutting, present tone