Dual Savitzky-Golay Baseline Ratio (DSGBR) detects spectral peaks in frequency-domain signals. It is built for dense, noisy power spectra from fluid dynamics, vibration analysis, and related experimental work, where spectra slope over several decades and a fixed prominence threshold either drowns in low-frequency power or misses everything at high frequency.
A ten-harmonic frequency comb in a noisy, sloped synthetic spectrum: all ten
harmonics recovered with the default detector parameters, and no detections
away from the comb. The weakest harmonics barely clear the noise scatter, and
the comb crowds together on the log axis — both regimes where fixed-threshold
detectors merge or miss peaks. Generated by examples/readme_figure.py.
pip install dsgbrFor development:
git clone https://github.com/ricardofrantz/dsgbr.git
cd dsgbr
uv pip install -e ".[dev]"import numpy as np
from dsgbr import dsgbr_detector
# Synthetic PSD with known peaks
frequencies = np.linspace(0.001, 1.0, 2048)
psd = np.ones_like(frequencies)
psd[400] = 12.0 # inject a peak
psd[1200] = 8.0 # inject another
peak_f, peak_h = dsgbr_detector(frequencies, psd)
print(f"Detected {peak_f.size} peaks at f = {peak_f}")The defaults find both peaks. To tune, pass a case_info dictionary with
full names or short aliases:
peak_f, peak_h = dsgbr_detector(frequencies, psd, case_info={"RT": 2.5, "SW": 5})DSGBR compares the spectrum against a local estimate of its own background, so acceptance is a ratio rather than an absolute height: a peak three times above its surroundings is detected the same way at any frequency, regardless of the slope between them.
flowchart TD
PSD(["PSD P(f)"]) --> SEARCH["SEARCH
narrow Savitzky-Golay smooth"]
PSD --> BASELINE["BASELINE
wide rolling median"]
SEARCH --> RATIO{"SEARCH / BASELINE
≥ ratio_threshold ?"}
BASELINE --> RATIO
RATIO --> SPACING["greedy spacing selection
(strongest first)"]
SPACING --> REFINE["refine to raw-PSD maxima"]
REFINE --> ULF["ultra-low-frequency
Q-factor guardrail"]
ULF --> BANDS["band-balanced selection
(if > max_peaks)"]
BANDS --> OUT(["peak_frequencies, peak_heights"])
The two series are deliberately different estimators: SEARCH is a narrow
Savitzky-Golay smooth that suppresses single-bin noise while keeping peak
shapes, and BASELINE is a wide rolling median of the raw PSD, which sits
under narrow peaks instead of being dragged up by them. Accepted candidates
then pass spacing rules, are repositioned onto the raw PSD, and survive an
ultra-low-frequency guardrail that rejects broad leakage bumps near the left
edge of the spectrum. The full pipeline, design rationale, and parameter
sensitivity study are in docs/algorithm.md.
| Scenario | DSGBR F1 | tuned find_peaks F1 |
|---|---|---|
| clean_tones | 1.000 ± 0.000 | 1.000 ± 0.000 |
| dense_lowfreq | 0.658 ± 0.246 | 0.440 ± 0.310 |
| steep_slope | 0.967 ± 0.103 | 0.872 ± 0.149 |
| noisy_welch | 0.447 ± 0.251 | 0.296 ± 0.312 |
| no_peaks | 0.000 ± 0.000 | 0.000 ± 0.000 |
Fresh run: uv run python -m benchmarks.compare, 20 evaluation realizations per
scenario after per-scenario tuning of scipy.signal.find_peaks on 8 training
realizations.
DSGBR leads on the sloped, dense, and noisy scenarios above; parity is expected
for flat, well-separated peaks such as clean_tones, and both detectors return
no F1 credit on no_peaks. See docs/algorithm.md Parameter
sensitivity and benchmarks/
for reproduction.
All parameters are set through DetectionConfig or passed as a dictionary
via the case_info argument. Short aliases (RT, SW, BWF, etc.) are
supported for concise configuration.
| Parameter | Alias | Default | Description |
|---|---|---|---|
ratio_threshold |
RT | 3.3 | Min SEARCH/BASELINE ratio for acceptance |
smooth_window |
SW | 3 | Savitzky-Golay window for SEARCH (odd, >= 3) |
baseline_window_frac |
BWF | 0.05 | Baseline window as fraction of data length |
distance_low |
DL | 2 | Min bin separation below switch_frequency |
distance_high |
DH | 1 | Min bin separation above switch_frequency |
switch_frequency |
SF | 0.02 | Frequency threshold for spacing rules |
max_peaks |
MP | 25 | Maximum peaks returned |
smooth_polyorder |
— | 2 | Polynomial order for SG filter |
smooth_on_log |
— | True | Smooth log10(PSD) instead of linear |
baseline_window |
— | None | Fixed baseline window (overrides BWF) |
baseline_on_log |
— | True | Baseline smoothing in log domain |
band_strategy |
— | proportional | Band allocation: proportional or equal |
n_bands |
— | 10 | Number of logarithmic frequency bands |
ulf_fmax |
— | 0.001 | ULF band upper frequency limit |
ulf_min_q |
— | 9.0 | Minimum Q-factor for ULF peaks |
ulf_max_points |
— | 5 | Maximum ULF peaks to retain |
from dsgbr import compute_support_series
support = compute_support_series(frequencies, psd, case_info={"RT": 2.0})
# Plot SEARCH vs BASELINE overlay
import matplotlib.pyplot as plt
plt.semilogy(frequencies, support["search_series"], label="SEARCH")
plt.semilogy(frequencies, support["baseline_series"], label="BASELINE")
plt.semilogy(frequencies, support["rthreshold"], "--", label="Threshold")
plt.legend()
plt.show()from dsgbr import select_peaks_by_frequency_bands
# Reduce 100 peaks to 15, spread across frequency bands
sel_f, sel_h = select_peaks_by_frequency_bands(
peak_frequencies, peak_heights,
max_peaks=15, strategy="proportional", n_bands=8,
)from dsgbr import DetectionConfig
cfg = DetectionConfig(ratio_threshold=2.5, smooth_window=7, max_peaks=10)
print(cfg.to_metadata())| Function / Class | Description |
|---|---|
dsgbr_detector(f, psd, *, case_info, return_support) |
Main detection pipeline |
compute_support_series(f, psd, case_info) |
Return intermediate arrays for visualization |
select_peaks_by_frequency_bands(f, h, *, max_peaks, strategy, n_bands) |
Band-balanced down-selection |
find_nearest_frequency(target, frequencies, heights) |
Closest detected frequency lookup |
DetectionConfig |
Frozen dataclass with 17 parameters |
detect_peaks_case_adaptive(...) |
Deprecated alias for dsgbr_detector |
DSGBR_PARAM_ALIASES |
Short-to-long parameter name mapping |
See examples/ for runnable scripts:
basic_usage.py— minimal detection exampleparameter_tuning.py— sweep ratio_threshold, compare peak countsvisualization.py— SEARCH/BASELINE overlay plotreadme_figure.py— regenerate the figure at the top of this page
If you use DSGBR in your research, please cite:
@software{dsgbr2026,
author = {Frantz, Ricardo},
title = {{DSGBR}: Dual Savitzky--Golay Baseline Ratio spectral peak detector},
year = {2026},
url = {https://github.com/ricardofrantz/dsgbr},
}BSD 3-Clause. See LICENSE.
Contributions are welcome. Please open an issue to discuss changes before submitting a pull request. Run the full QA suite before submitting:
uv pip install -e ".[dev]"
pre-commit run --all-files
pytest --cov=dsgbr
