docs(comparison): clarify NoiseRemove compand and document silence detection capabilities

flexiondotorg · flexiondotorg · commit 0daad35eda62 · 2026-03-14T18:33:49.000Z
- Correct Pass 2 NoiseRemove row: compand disabled when no noise profile
  exists; note that anlmdn filter always active
- Qualify compand dependency on noise profile in Section 4 capabilities
- Add two newly implemented Speech-Aware Processing capabilities:
  * Digital silence rejection in room tone candidate selection
  * Voice-activated recording detection from silence fraction

Signed-off-by: Martin Wimpress &lt;code@wimpress.io&gt;
diff --git a/docs/Levelator-Comparison-And-Gap-Analysis.md b/docs/Levelator-Comparison-And-Gap-Analysis.md
@@ -102,7 +102,7 @@ downmix → ds201_highpass → ds201_lowpass → noiseremove → ds201_gate →
 |--------|---------------------|-------|
 | **DS201 Highpass** | Frequency (60-120Hz), poles, mix | Spectral centroid, spectral decrease, noise floor |
 | **DS201 Lowpass** | Cutoff frequency, enable/disable | Content type detection (speech/music/mixed), rolloff, ZCR |
-| **NoiseRemove** | Compand threshold, expansion depth | Measured noise floor + 5 dB, noise severity |
+| **NoiseRemove** | Compand threshold, expansion depth (disabled when no noise profile) | Measured noise floor + 5 dB, noise severity; compand requires an elected silence region — `anlmdn` always active |
 | **DS201 Gate** | Threshold, ratio, attack, release, range, knee | LRA, noise floor, quiet speech estimate, spectral flux, entropy |
 | **LA-2A Compressor** | Threshold, ratio, attack, release, knee, mix | Kurtosis, flux, dynamic range, spectral centroid |
 | **De-esser** | Intensity (0.0-0.6) | Spectral centroid + rolloff |
@@ -120,7 +120,8 @@ downmix → ds201_highpass → ds201_lowpass → noiseremove → ds201_gate →
 
 Jivetalking employs speech profile extraction for adaptive tuning:
 
-- **Silence detection:** Uses 250ms interval sampling with spectral analysis
+- **Silence detection:** Uses 250ms interval sampling with spectral analysis; digital silence (below -115 dBFS) rejected as unsuitable room tone
+- **Voice-activated detection:** Recordings where >= 95% of silence candidates are digital silence (Riverside, Zencastr) are flagged; speech interruption tolerance widens from 2s to 10s for accurate speech region extraction
 - **Speech region detection:** Finds representative speech segments (30s+ duration)
 - **Golden sub-region refinement:** Identifies cleanest sub-windows for noise/speech profiling
 - **Speech metrics:** RMS level, crest factor, spectral centroid, kurtosis, flux for each profile
@@ -219,7 +220,7 @@ Jivetalking employs speech profile extraction for adaptive tuning:
 
 ### Capabilities Jivetalking Has That Levelator Lacked
 
-1. **Noise Reduction:** Non-Local Means denoising with adaptive compand
+1. **Noise Reduction:** Non-Local Means denoising (`anlmdn`) always active; adaptive compand applied when a silence region is elected as the noise profile
 2. **Gating:** Soft expander for inter-speech cleanup
 3. **True Peak Limiting:** Prevents inter-sample peaks
 4. **De-essing:** Automatic sibilance control