Extractors

Separate audio into layers — transients, harmonics, noise, formants

X-ray vision for audio. Separate a sound into its ingredients — transients, harmonics, noise, formants — so you can work on each layer independently.

Every extractor splits your audio into two streams: Extracted (the layer you targeted) and Rejected (everything else). Toggle Invert to swap them.

Transient Extractor

What it does Detects and isolates the attack portion of each hit, pluck, or onset in your audio.

When you’d reach for it You want to soften harsh attacks without dulling the sustain, or grab just the snap of a snare to layer elsewhere.

Quick example

Feed a drum loop into Transient Extractor.
Set Duration to 20 ms and Sensitivity around 60%.
Route the Extracted output (attacks only) through a Smooth node to tame them.
Blend the softened attacks back with the Rejected output (sustain).

Parameters

Parameter	What it controls	Range	Sweet spot hint
Duration	Length of the attack window captured around each onset	10 — 100 ms	20—40 ms for drums, 50—80 ms for plucked strings
Sensitivity	How easily onsets are detected (higher = more hits caught)	0 — 100%	40—60% for clean tracks, 70%+ for busy mixes
Interval	Minimum gap between detected transients	10 — 500 ms	50 ms prevents double-triggers on flams
Fade Out	Gradually taper each transient window instead of a hard cut	On / Off	On for natural results
Invert	Swap Extracted and Rejected outputs	On / Off	—

HPSS Extractor

What it does Splits audio into three layers: Harmonics (pitched, sustained tones), Percussives (attacks and broadband hits), and Residual (noise, breath, ambience).

When you’d reach for it You need a clean separation of melody from rhythm, or you want to isolate the room tone hiding underneath a performance.

Quick example

Load a full mix into HPSS Extractor.
Select Harmonics to grab vocals and sustained instruments.
Process the harmonic layer with pitch correction or reverb.
Recombine with the Rejected output (percussive + residual).

Parameters

Parameter	What it controls	Range	Sweet spot hint
Component	Which layer to extract: Harmonics, Percussives, or Residual	3 choices	—
H Kernel	Temporal smoothing width for harmonic detection (must be odd)	11 — 51	31 for general use; lower for fast-changing pitched content
P Kernel	Spectral smoothing width for percussive detection (must be odd)	11 — 51	31 for general use; lower for narrow-band percussion
Power	Mask sharpness — higher values give harder separation	1.0 — 4.0	2.0 for soft Wiener masks, 3.0+ for near-binary splits
Invert	Swap Extracted and Rejected outputs	On / Off	—

Noise Floor Extractor

What it does Isolates continuous, unpitched noise — hiss, room tone, tape noise, breath — from pitched and transient content.

When you’d reach for it You recorded a field recording with steady background hiss and want to pull it out, either to remove it or to use it as a texture.

Quick example

Feed a vocal recording into Noise Floor Extractor.
Set Threshold around -55 dB and Smoothing to 50%.
The Extracted output contains the breath and room hiss.
Flip Invert on to get a cleaner vocal on Out1 instead.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Threshold	Magnitude level below which content is considered noise	-80 — -20 dB	-60 dB for light hiss, -45 dB for heavy noise
Smoothing	Temporal smoothing of the noise mask (reduces flicker)	0 — 100%	40—60% for steady noise; lower for pulsing artifacts
Invert	Swap Extracted and Rejected outputs	On / Off	—

Formant Extractor

What it does Detects and isolates vocal formants (F1 through F4) — the resonant peaks that give vowels their character.

When you’d reach for it You want to reshape a vocal’s vowel quality without touching its pitch, or extract just the “body” frequencies of a voice for layering.

Quick example

Feed a vocal take into Formant Extractor.
Select All to capture every formant region, or pick F1 for vowel height only.
Set Width to 200 Hz for broad capture, or narrow to 80 Hz for surgical work.
Route the Extracted output through a Gain or EQ node to reshape the vocal character.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Formant	Which formant(s) to extract: F1, F2, F3, F4, or All	5 choices	All for general reshaping; F1+F2 individually for vowel design
Width	Bandwidth around each formant center frequency	50 — 500 Hz	100—200 Hz for vocals; wider for instruments with broad resonances
Strength	Intensity of the extraction mask	0 — 100%	100% for clean separation; lower for partial blending
Order	Cepstral analysis resolution (higher = more detail in envelope)	8 — 60	16 for speech; 30—40 for singing with vibrato
Invert	Swap Extracted and Rejected outputs	On / Off	—

Spectral Envelope Extractor

What it does Separates an audio signal into its timbre (spectral envelope) and its pitch content (excitation), based on source-filter theory.

When you’d reach for it You want to transplant the timbre of one sound onto the pitch of another, or isolate pure pitch content for analysis.

Quick example

Feed a cello recording into Spectral Envelope Extractor.
Select Envelope to get the cello’s tonal color without its melodic content.
Use a Blend node to apply that timbral shape onto a synthesizer sound from another branch.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Output	Which component to extract: Envelope (timbre) or Excitation (pitch)	2 choices	—
Order	Cepstral resolution — lower values give a smoother envelope	20 — 80	40 for general use; 20—30 for broad timbral shape; 60+ for detailed formant tracking
Iterations	Refinement passes for the envelope estimate	5 — 20	10 for most material; 15—20 for sounds with sharp spectral peaks
Invert	Swap Extracted and Rejected outputs	On / Off	—

Spectral Peak Extractor

What it does Finds and isolates the N strongest resonant peaks in the frequency spectrum.

When you’d reach for it You have a resonant instrument and want to grab specific overtones for processing, or you need to notch out ringing frequencies.

Quick example

Feed a piano recording into Spectral Peak Extractor.
Set Peaks to 8 and Width Hz to 30.
The Extracted output contains only the strongest overtones.
Flip Invert to remove those resonances from the original instead.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Averaging	How peaks are measured across time: Mean, Median, or Max	3 choices	Mean for sustained tones; Max to catch brief resonances
Peaks	Number of peaks to extract	1 — 20	3—5 for dominant partials; 10+ for a full harmonic portrait
Width Hz	Frequency width of the extraction window around each peak	10 — 200 Hz	30—50 Hz for precise overtone targeting; 100+ for broad resonance bands
Thresh dB	Minimum magnitude for a peak to qualify	-60 — 0 dB	-40 dB ignores low-level noise peaks
Invert	Swap Extracted and Rejected outputs	On / Off	—

Temporal Envelope Extractor

What it does Extracts the slow-moving amplitude shape (dynamics contour) from each frequency band, separating it from fast fluctuations like tremolo or flutter.

When you’d reach for it You want to capture the ADSR-like dynamic shape of a sound to apply it elsewhere, or strip away micro-variations to get a steadier tone.

Quick example

Feed a bowed violin phrase into Temporal Envelope Extractor.
Set Smoothing to 80 ms and mode to RMS.
The Extracted output is the smooth, sustained bow movement.
The Rejected output contains the vibrato and bow noise detail.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Type	Envelope calculation method: Mean (smooth), Max (preserves attacks), or RMS (perceptually balanced)	3 choices	RMS for most material; Max when you need to preserve transient peaks
Smoothing	Window size for the moving average	1 — 500 ms	30—80 ms for musical dynamics; 200+ ms for very broad contours
Invert	Swap Extracted (envelope) and Rejected (detail) outputs	On / Off	—

Pitch Class Extractor

What it does Isolates every occurrence of a chosen musical note — or an entire chord — across all octaves at once.

When you’d reach for it You want to pull all the C notes out of a piano recording regardless of octave, or extract just the notes belonging to an Am7 chord from a complex mix.

Quick example

Feed a guitar strumming recording into Pitch Class Extractor.
Switch to Chord mode, set the root to G, and choose Major.
The Extracted output contains only the frequencies belonging to a G major chord.
Process the rejected output separately to reshape the non-chord tones.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Mode	Single note or Chord selection	Single / Chord	—
Note	Which pitch class to extract (C through B) — acts as root in Chord mode	12 notes	—
Chord	Chord quality: Maj, Min, D7, M7, m7, Dim, Aug (visible in Chord mode only)	7 types	—
Tuning	A4 reference frequency	430 — 450 Hz	440 Hz standard; adjust for vintage or non-standard tunings
Width	Pitch tolerance — how much spectral blur around each target note	0 — 100%	10% for precise extraction; 30%+ for looser, more forgiving grabs
Invert	Swap Extracted and Rejected outputs	On / Off	—

Micro Transient Extractor

What it does Detects and isolates ultra-short events under 10 ms — clicks, pops, vinyl crackle, digital glitches.

When you’d reach for it You have a recording with vinyl clicks or digital pops and want to grab them for removal, or you want to harvest those tiny artifacts as a percussion layer.

Quick example

Feed a vinyl transfer into Micro Transient Extractor.
Set Duration to 3 ms and Sensitivity to 80%.
The Extracted output is a stream of isolated clicks.
Flip Invert to get the cleaned audio on Out1.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Duration	Length of the capture window around each micro-event	1 — 10 ms	2—5 ms for clicks; 8—10 ms for slightly longer pops
Sensitivity	Detection threshold (higher = more events caught)	0 — 100%	60—80% for typical vinyl; lower for sparse glitches
Interval	Minimum gap between detected events	1 — 50 ms	10 ms prevents merging adjacent crackle
Invert	Swap Extracted and Rejected outputs	On / Off	—

Silence Extractor

What it does Finds quiet or silent regions in your audio based on an energy threshold.

When you’d reach for it You want to isolate the gaps between phrases for creative filling, or trim dead air from a long recording.

Quick example

Feed a spoken-word recording into Silence Extractor.
Set Threshold to -55 dB and Duration to 200 ms.
The Extracted output contains only the pauses.
Flip Invert to keep just the speech with silences removed.

Parameters

Parameter	What it controls	Range	Sweet spot hint
Threshold	Energy level below which a frame counts as silent	-80 — -20 dB	-60 dB for clean recordings; -45 dB for noisy sources
Smoothing	Softens the edges of detected silent regions	0 — 100%	30% for gentle fades; 0% for hard cuts
Duration	Minimum length a quiet region must last to count as silence	10 — 1000 ms	100—200 ms skips brief pauses; 500+ ms catches only long gaps
Invert	Swap Extracted and Rejected outputs	On / Off	—