Skip to content

Commit 0daad35

Browse files
committed
docs(comparison): clarify NoiseRemove compand and document silence detection capabilities
- Correct Pass 2 NoiseRemove row: compand disabled when no noise profile exists; note that anlmdn filter always active - Qualify compand dependency on noise profile in Section 4 capabilities - Add two newly implemented Speech-Aware Processing capabilities: * Digital silence rejection in room tone candidate selection * Voice-activated recording detection from silence fraction Signed-off-by: Martin Wimpress <code@wimpress.io>
1 parent 9269d0e commit 0daad35

1 file changed

Lines changed: 4 additions & 3 deletions

File tree

docs/Levelator-Comparison-And-Gap-Analysis.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,7 @@ downmix → ds201_highpass → ds201_lowpass → noiseremove → ds201_gate →
102102
|--------|---------------------|-------|
103103
| **DS201 Highpass** | Frequency (60-120Hz), poles, mix | Spectral centroid, spectral decrease, noise floor |
104104
| **DS201 Lowpass** | Cutoff frequency, enable/disable | Content type detection (speech/music/mixed), rolloff, ZCR |
105-
| **NoiseRemove** | Compand threshold, expansion depth | Measured noise floor + 5 dB, noise severity |
105+
| **NoiseRemove** | Compand threshold, expansion depth (disabled when no noise profile) | Measured noise floor + 5 dB, noise severity; compand requires an elected silence region — `anlmdn` always active |
106106
| **DS201 Gate** | Threshold, ratio, attack, release, range, knee | LRA, noise floor, quiet speech estimate, spectral flux, entropy |
107107
| **LA-2A Compressor** | Threshold, ratio, attack, release, knee, mix | Kurtosis, flux, dynamic range, spectral centroid |
108108
| **De-esser** | Intensity (0.0-0.6) | Spectral centroid + rolloff |
@@ -120,7 +120,8 @@ downmix → ds201_highpass → ds201_lowpass → noiseremove → ds201_gate →
120120

121121
Jivetalking employs speech profile extraction for adaptive tuning:
122122

123-
- **Silence detection:** Uses 250ms interval sampling with spectral analysis
123+
- **Silence detection:** Uses 250ms interval sampling with spectral analysis; digital silence (below -115 dBFS) rejected as unsuitable room tone
124+
- **Voice-activated detection:** Recordings where >= 95% of silence candidates are digital silence (Riverside, Zencastr) are flagged; speech interruption tolerance widens from 2s to 10s for accurate speech region extraction
124125
- **Speech region detection:** Finds representative speech segments (30s+ duration)
125126
- **Golden sub-region refinement:** Identifies cleanest sub-windows for noise/speech profiling
126127
- **Speech metrics:** RMS level, crest factor, spectral centroid, kurtosis, flux for each profile
@@ -219,7 +220,7 @@ Jivetalking employs speech profile extraction for adaptive tuning:
219220

220221
### Capabilities Jivetalking Has That Levelator Lacked
221222

222-
1. **Noise Reduction:** Non-Local Means denoising with adaptive compand
223+
1. **Noise Reduction:** Non-Local Means denoising (`anlmdn`) always active; adaptive compand applied when a silence region is elected as the noise profile
223224
2. **Gating:** Soft expander for inter-speech cleanup
224225
3. **True Peak Limiting:** Prevents inter-sample peaks
225226
4. **De-essing:** Automatic sibilance control

0 commit comments

Comments
 (0)