Audio: Phase Vocoder: Add new component #10541

singalsu · 2026-02-11T13:43:32Z

WIP

The rename is done to avoid possible conflict with other math libraries. The change is done to prepare add of 32 bit square root function. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

This patch adds a higher precision 32-bit fractional integer square root function to SOF math library. The algorithm uses a lookup table for initial value and two iterations with Newton-Raphson method to improve the accuracy. Both input and output format is Q2.30. The format was chosen to match complex to polar conversions numbers range for Q1.31 complex values. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

This patch helps with more generic use of complex numbers not directly related to the FFTs domain. It prepares to add polar complex numbers format that is commonly used in frequency domain signal processing. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

This patch adds functions sofm_icomplex32_to_polar() and sofm_ipolar32_to_complex(). In polar format the Q1.31 (real, imag) numbers pair is converted to (magnitude, angle). The magnitude is Q2.30 format and angle in -pi to +pi radians in Q3.29 format. The conversion to polar and back loses some quality so there currently is no support for icomplex16. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

The testbench quits after three file module copies without data written. The value is too low for components those accumulate more than one LL period of data before producing output or can't output at every copy. The value 10 should better ensure that testbench run is not ended too early. Currently testbench lacks the DP scheduler so, so the modules those are designed for DP can be run with this workaround. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

This patch adds the Phase Vocoder SOF module. It provides render speed control in range 0.5-2.0x. The pitch is preserved in audio waveform stretch or shorten. The module is using a frequency domain algorithm in STFT domain to interpolate magnitude and phase of output IFFT frames from input FFT frames. The render speed can be controlled via enable/disable switch and enum control with steps of 0.1, or with finer precision with bytes control. (WIP) The STFT parameters are configured with bytes control blob. The default is 1024 size FFT with hop of 256 and Hann window. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

singalsu · 2026-02-11T15:38:05Z

Fixed s16 processing, and did some improvements. The MCPS in MTL platform is 175 - 313 depending on selected speed.

lyakh

Do I understand it correctly that "vocoder" is a family of algorithms / audio processing methods, and this module implements one of them - speed control? Maybe call it vocoder_speed? Or does "Phase Vocoder" actually mean the same - only the speed processing part?

lyakh · 2026-02-12T10:38:27Z

src/audio/phase_vocoder/Kconfig

+# SPDX-License-Identifier: BSD-3-Clause
+
+config COMP_PHASE_VOCODER
+	tristate "Phase Vocoder component"


do you want to make it default m if LIBRARY_DEFAULT_MODULAR=

lyakh · 2026-02-12T10:39:18Z

app/boards/intel_adsp_ace30_ptl.conf

 # tests it can't use extra CONFIGs. See #9410, #8722 and #9386
 CONFIG_COMP_GOOGLE_RTC_AUDIO_PROCESSING=m
 CONFIG_GOOGLE_RTC_AUDIO_PROCESSING_MOCK=y
+CONFIG_COMP_PHASE_VOCODER=y


if you make it modular by default, then please drop =y from all ACE 3.0+ platforms

lyakh · 2026-02-12T10:53:29Z

src/audio/phase_vocoder/phase_vocoder-generic.c

+	fft_buf_ptr = fft->fft_buf;
+	for (j = 0; j < prev_data_size; j++) {
+		fft_buf_ptr->real = prev_data[j];
+		fft_buf_ptr->imag = 0;


I'm wondering if using *fft_buf_ptr = (struct icomplex32){.real = prev_data[j], .imag = 0} would give the compiler a better chance to optimise 64-bit writes, but maybe not.

Thanks, I'll check with profiler run! This will need as heaviest open-source component so far a lot of optimization.

lyakh · 2026-02-12T10:58:53Z

src/audio/phase_vocoder/phase_vocoder.c

+FILE *stft_debug_ifft_out_fh;
+#endif
+
+__cold static void phase_vocoder_reset_parameters(struct processing_module *mod)


it is called from .reset(), so it shouldn't be __cold?

lyakh · 2026-02-12T11:45:58Z

src/audio/phase_vocoder/phase_vocoder_common.c

+LOG_MODULE_REGISTER(phase_vocoder_common, CONFIG_SOF_LOG_LEVEL);
+
+/*
+ * The main processing function for PHASE_VOCODER


to which function is this comment referring?

singalsu · 2026-02-12T17:09:14Z

Do I understand it correctly that "vocoder" is a family of algorithms / audio processing methods, and this module implements one of them - speed control? Maybe call it vocoder_speed? Or does "Phase Vocoder" actually mean the same - only the speed processing part?

Need to think the name. Phase vocoder is more generic than this, the same technique also works for pitch shift. The history of this algorithm is as described by this Google AI response:

_"The phase vocoder algorithm, implemented in the Short-Time Fourier
Transform (STFT) domain, was first presented by James L. Flanagan and
Robert M. Golden in their 1966 paper, "Phase Vocoder," published in
The Journal of the Acoustical Society of America.

Original Paper: J. L. Flanagan and R. M. Golden, "Phase Vocoder,"
Journal of the Acoustical Society of America, vol. 40, no. 6,
pp. 1488, Nov. 1966.

Key Contribution: Flanagan and Golden proposed the phase vocoder as an
analysis-synthesis system that uses a filter bank (interpreted as a
sliding Short-Time Fourier Transform) to analyze speech by determining
the instantaneous phase and amplitude of signals within spectral
bands.

Significance: While earlier vocoders (Dudley, 1930s) used analog
hardware, the 1966 phase vocoder provided a digital, frequency-domain
approach to speech coding and processing, which later became the
foundation for time-stretching and pitch-shifting in audio
engineering."_

Name suggestions are welcome. We could also have later in SOF maybe a more generic frequency domain modules architecture with STFT half spectrum streamed from module to module. Instead of samples we could stream spectral coefficients, e.g. in this used 32 bit (magnitude, phase) format that is quite generic for all frequency domain processing.

singalsu added 7 commits February 11, 2026 13:27

Math: Rename square root function sqrt_int16() to sofm_sqrt_int16()

acbb847

The rename is done to avoid possible conflict with other math libraries. The change is done to prepare add of 32 bit square root function. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

Tools: Topology: Phase Vocoder: Bench topology to test the component

2199842

Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

singalsu force-pushed the phase_vocoder branch from 5c1da31 to e816969 Compare February 11, 2026 15:36

lyakh reviewed Feb 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio: Phase Vocoder: Add new component #10541

Audio: Phase Vocoder: Add new component #10541

singalsu commented Feb 11, 2026

Uh oh!

singalsu commented Feb 11, 2026

Uh oh!

lyakh left a comment •

edited

Loading

Uh oh!

lyakh Feb 12, 2026

Uh oh!

lyakh Feb 12, 2026

Uh oh!

lyakh Feb 12, 2026

Uh oh!

singalsu Feb 12, 2026

Uh oh!

lyakh Feb 12, 2026

Uh oh!

lyakh Feb 12, 2026

Uh oh!

singalsu commented Feb 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Audio: Phase Vocoder: Add new component #10541

Are you sure you want to change the base?

Audio: Phase Vocoder: Add new component #10541

Conversation

singalsu commented Feb 11, 2026

Uh oh!

singalsu commented Feb 11, 2026

Uh oh!

lyakh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lyakh Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

lyakh Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

lyakh Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

singalsu Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

lyakh Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

lyakh Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

singalsu commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lyakh left a comment •

edited

Loading

singalsu commented Feb 12, 2026 •

edited

Loading