Skip to main content
k2k audio logo k2k audio

Back to Lexique

Extractors

Separate audio into layers — transients, harmonics, noise, formants

X-ray vision for audio. Separate a sound into its ingredients — transients, harmonics, noise, formants — so you can work on each layer independently.

Every extractor splits your audio into two streams: Extracted (the layer you targeted) and Rejected (everything else). Toggle Invert to swap them.


Transient Extractor

What it does Detects and isolates the attack portion of each hit, pluck, or onset in your audio.

When you’d reach for it You want to soften harsh attacks without dulling the sustain, or grab just the snap of a snare to layer elsewhere.

Quick example

  1. Feed a drum loop into Transient Extractor.
  2. Set Duration to 20 ms and Sensitivity around 60%.
  3. Route the Extracted output (attacks only) through a Smooth node to tame them.
  4. Blend the softened attacks back with the Rejected output (sustain).

Parameters

ParameterWhat it controlsRangeSweet spot hint
DurationLength of the attack window captured around each onset10 — 100 ms20—40 ms for drums, 50—80 ms for plucked strings
SensitivityHow easily onsets are detected (higher = more hits caught)0 — 100%40—60% for clean tracks, 70%+ for busy mixes
IntervalMinimum gap between detected transients10 — 500 ms50 ms prevents double-triggers on flams
Fade OutGradually taper each transient window instead of a hard cutOn / OffOn for natural results
InvertSwap Extracted and Rejected outputsOn / Off

HPSS Extractor

What it does Splits audio into three layers: Harmonics (pitched, sustained tones), Percussives (attacks and broadband hits), and Residual (noise, breath, ambience).

When you’d reach for it You need a clean separation of melody from rhythm, or you want to isolate the room tone hiding underneath a performance.

Quick example

  1. Load a full mix into HPSS Extractor.
  2. Select Harmonics to grab vocals and sustained instruments.
  3. Process the harmonic layer with pitch correction or reverb.
  4. Recombine with the Rejected output (percussive + residual).

Parameters

ParameterWhat it controlsRangeSweet spot hint
ComponentWhich layer to extract: Harmonics, Percussives, or Residual3 choices
H KernelTemporal smoothing width for harmonic detection (must be odd)11 — 5131 for general use; lower for fast-changing pitched content
P KernelSpectral smoothing width for percussive detection (must be odd)11 — 5131 for general use; lower for narrow-band percussion
PowerMask sharpness — higher values give harder separation1.0 — 4.02.0 for soft Wiener masks, 3.0+ for near-binary splits
InvertSwap Extracted and Rejected outputsOn / Off

Noise Floor Extractor

What it does Isolates continuous, unpitched noise — hiss, room tone, tape noise, breath — from pitched and transient content.

When you’d reach for it You recorded a field recording with steady background hiss and want to pull it out, either to remove it or to use it as a texture.

Quick example

  1. Feed a vocal recording into Noise Floor Extractor.
  2. Set Threshold around -55 dB and Smoothing to 50%.
  3. The Extracted output contains the breath and room hiss.
  4. Flip Invert on to get a cleaner vocal on Out1 instead.

Parameters

ParameterWhat it controlsRangeSweet spot hint
ThresholdMagnitude level below which content is considered noise-80 — -20 dB-60 dB for light hiss, -45 dB for heavy noise
SmoothingTemporal smoothing of the noise mask (reduces flicker)0 — 100%40—60% for steady noise; lower for pulsing artifacts
InvertSwap Extracted and Rejected outputsOn / Off

Formant Extractor

What it does Detects and isolates vocal formants (F1 through F4) — the resonant peaks that give vowels their character.

When you’d reach for it You want to reshape a vocal’s vowel quality without touching its pitch, or extract just the “body” frequencies of a voice for layering.

Quick example

  1. Feed a vocal take into Formant Extractor.
  2. Select All to capture every formant region, or pick F1 for vowel height only.
  3. Set Width to 200 Hz for broad capture, or narrow to 80 Hz for surgical work.
  4. Route the Extracted output through a Gain or EQ node to reshape the vocal character.

Parameters

ParameterWhat it controlsRangeSweet spot hint
FormantWhich formant(s) to extract: F1, F2, F3, F4, or All5 choicesAll for general reshaping; F1+F2 individually for vowel design
WidthBandwidth around each formant center frequency50 — 500 Hz100—200 Hz for vocals; wider for instruments with broad resonances
StrengthIntensity of the extraction mask0 — 100%100% for clean separation; lower for partial blending
OrderCepstral analysis resolution (higher = more detail in envelope)8 — 6016 for speech; 30—40 for singing with vibrato
InvertSwap Extracted and Rejected outputsOn / Off

Spectral Envelope Extractor

What it does Separates an audio signal into its timbre (spectral envelope) and its pitch content (excitation), based on source-filter theory.

When you’d reach for it You want to transplant the timbre of one sound onto the pitch of another, or isolate pure pitch content for analysis.

Quick example

  1. Feed a cello recording into Spectral Envelope Extractor.
  2. Select Envelope to get the cello’s tonal color without its melodic content.
  3. Use a Blend node to apply that timbral shape onto a synthesizer sound from another branch.

Parameters

ParameterWhat it controlsRangeSweet spot hint
OutputWhich component to extract: Envelope (timbre) or Excitation (pitch)2 choices
OrderCepstral resolution — lower values give a smoother envelope20 — 8040 for general use; 20—30 for broad timbral shape; 60+ for detailed formant tracking
IterationsRefinement passes for the envelope estimate5 — 2010 for most material; 15—20 for sounds with sharp spectral peaks
InvertSwap Extracted and Rejected outputsOn / Off

Spectral Peak Extractor

What it does Finds and isolates the N strongest resonant peaks in the frequency spectrum.

When you’d reach for it You have a resonant instrument and want to grab specific overtones for processing, or you need to notch out ringing frequencies.

Quick example

  1. Feed a piano recording into Spectral Peak Extractor.
  2. Set Peaks to 8 and Width Hz to 30.
  3. The Extracted output contains only the strongest overtones.
  4. Flip Invert to remove those resonances from the original instead.

Parameters

ParameterWhat it controlsRangeSweet spot hint
AveragingHow peaks are measured across time: Mean, Median, or Max3 choicesMean for sustained tones; Max to catch brief resonances
PeaksNumber of peaks to extract1 — 203—5 for dominant partials; 10+ for a full harmonic portrait
Width HzFrequency width of the extraction window around each peak10 — 200 Hz30—50 Hz for precise overtone targeting; 100+ for broad resonance bands
Thresh dBMinimum magnitude for a peak to qualify-60 — 0 dB-40 dB ignores low-level noise peaks
InvertSwap Extracted and Rejected outputsOn / Off

Temporal Envelope Extractor

What it does Extracts the slow-moving amplitude shape (dynamics contour) from each frequency band, separating it from fast fluctuations like tremolo or flutter.

When you’d reach for it You want to capture the ADSR-like dynamic shape of a sound to apply it elsewhere, or strip away micro-variations to get a steadier tone.

Quick example

  1. Feed a bowed violin phrase into Temporal Envelope Extractor.
  2. Set Smoothing to 80 ms and mode to RMS.
  3. The Extracted output is the smooth, sustained bow movement.
  4. The Rejected output contains the vibrato and bow noise detail.

Parameters

ParameterWhat it controlsRangeSweet spot hint
TypeEnvelope calculation method: Mean (smooth), Max (preserves attacks), or RMS (perceptually balanced)3 choicesRMS for most material; Max when you need to preserve transient peaks
SmoothingWindow size for the moving average1 — 500 ms30—80 ms for musical dynamics; 200+ ms for very broad contours
InvertSwap Extracted (envelope) and Rejected (detail) outputsOn / Off

Pitch Class Extractor

What it does Isolates every occurrence of a chosen musical note — or an entire chord — across all octaves at once.

When you’d reach for it You want to pull all the C notes out of a piano recording regardless of octave, or extract just the notes belonging to an Am7 chord from a complex mix.

Quick example

  1. Feed a guitar strumming recording into Pitch Class Extractor.
  2. Switch to Chord mode, set the root to G, and choose Major.
  3. The Extracted output contains only the frequencies belonging to a G major chord.
  4. Process the rejected output separately to reshape the non-chord tones.

Parameters

ParameterWhat it controlsRangeSweet spot hint
ModeSingle note or Chord selectionSingle / Chord
NoteWhich pitch class to extract (C through B) — acts as root in Chord mode12 notes
ChordChord quality: Maj, Min, D7, M7, m7, Dim, Aug (visible in Chord mode only)7 types
TuningA4 reference frequency430 — 450 Hz440 Hz standard; adjust for vintage or non-standard tunings
WidthPitch tolerance — how much spectral blur around each target note0 — 100%10% for precise extraction; 30%+ for looser, more forgiving grabs
InvertSwap Extracted and Rejected outputsOn / Off

Micro Transient Extractor

What it does Detects and isolates ultra-short events under 10 ms — clicks, pops, vinyl crackle, digital glitches.

When you’d reach for it You have a recording with vinyl clicks or digital pops and want to grab them for removal, or you want to harvest those tiny artifacts as a percussion layer.

Quick example

  1. Feed a vinyl transfer into Micro Transient Extractor.
  2. Set Duration to 3 ms and Sensitivity to 80%.
  3. The Extracted output is a stream of isolated clicks.
  4. Flip Invert to get the cleaned audio on Out1.

Parameters

ParameterWhat it controlsRangeSweet spot hint
DurationLength of the capture window around each micro-event1 — 10 ms2—5 ms for clicks; 8—10 ms for slightly longer pops
SensitivityDetection threshold (higher = more events caught)0 — 100%60—80% for typical vinyl; lower for sparse glitches
IntervalMinimum gap between detected events1 — 50 ms10 ms prevents merging adjacent crackle
InvertSwap Extracted and Rejected outputsOn / Off

Silence Extractor

What it does Finds quiet or silent regions in your audio based on an energy threshold.

When you’d reach for it You want to isolate the gaps between phrases for creative filling, or trim dead air from a long recording.

Quick example

  1. Feed a spoken-word recording into Silence Extractor.
  2. Set Threshold to -55 dB and Duration to 200 ms.
  3. The Extracted output contains only the pauses.
  4. Flip Invert to keep just the speech with silences removed.

Parameters

ParameterWhat it controlsRangeSweet spot hint
ThresholdEnergy level below which a frame counts as silent-80 — -20 dB-60 dB for clean recordings; -45 dB for noisy sources
SmoothingSoftens the edges of detected silent regions0 — 100%30% for gentle fades; 0% for hard cuts
DurationMinimum length a quiet region must last to count as silence10 — 1000 ms100—200 ms skips brief pauses; 500+ ms catches only long gaps
InvertSwap Extracted and Rejected outputsOn / Off