A Depth-First Iterative Algorithm for the Conjugate Pair Fast Fourier Transform

10.36227/techrxiv.13489392.v1 ◽

2020 ◽

Author(s):

Alexandre Becoulet ◽

Amandine Verguet

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Memory Bandwidth ◽

Conjugate Pair ◽

Arithmetic Complexity ◽

Function Calls ◽

Novel Approaches ◽

The Cost ◽

Execution Pattern ◽

Bitwise Operations

The Split-Radix Fast Fourier Transform has the same low arithmetic complexity as the related Conjugate Pair Fast Fourier Transform. Both transforms have an irregular datapath structure which is straightforwardly expressed only in recursive forms. Furthermore, the conjugate pair variant has a complicated input indexing pattern which requires existing iterative implementations to rely on precomputed tables. It however allows optimization of the memory bandwidth as it requires a single twiddle factor load per radix-4 butterfly. In existing algorithms, this comes at the cost of using additional precomputed tables or performing recursive function calls. In this paper we present two novel approaches that handle both the butterfly scheduling and the input index generation of the Conjugate Pair Fast Fourier Transform. The proposed algorithm is cache-friendly because it is depth-first, non-recursive and does not rely on precomputed index tables. In order to achieve this, we relate the butterfly execution pattern of the Split-Radix and Conjugate Pair FFTs to the binary carry sequence. Based on this finding, we describe how common integer arithmetic and bitwise operations can be used to perform input reordering and depth-first traversal of the transform datapath with O(1) space complexity.<br>

Download Full-text

Erratum: Conjugate pair fast Fourier transform

Electronics Letters ◽

10.1049/el:19891040 ◽

1989 ◽

Vol 25 (22) ◽

pp. 1547

Author(s):

I. Kamar ◽

Y. Elcherif

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Conjugate Pair

Download Full-text

Conjugate pair fast Fourier transform

Electronics Letters ◽

10.1049/el:19890225 ◽

1989 ◽

Vol 25 (5) ◽

pp. 324 ◽

Cited By ~ 14

Author(s):

I. Kamar ◽

Y. Elcherif

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Conjugate Pair

Download Full-text

Reducing the Cost of Implementing Filters in LoRa Devices

Sensors ◽

10.3390/s19184037 ◽

2019 ◽

Vol 19 (18) ◽

pp. 4037

Author(s):

Shania Stewart ◽

Ha H. Nguyen ◽

Robert Barton ◽

Jerome Henry

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

System Performance ◽

Pulse Shaping ◽

Error Rates ◽

Lookup Table ◽

Significant Performance ◽

High Level ◽

The Cost ◽

The Impact

This paper presents two methods to optimize LoRa (Low-Power Long-Range) devices so that implementing multiplier-less pulse shaping filters is more economical. Basic chirp waveforms can be generated more efficiently using the method of chirp segmentation so that only a quarter of the samples needs to be stored in the ROM. Quantization can also be applied to the basic chirp samples in order to reduce the number of unique input values to the filter, which in turn reduces the size of the lookup table for multiplier-less filter implementation. Various tests were performed on a simulated LoRa system in order to evaluate the impact of the quantization error on the system performance. By examining the occupied bandwidth, fast Fourier transform used for symbol demodulation, and bit-error rates, it is shown that even performing a high level of quantization does not cause significant performance degradation. Therefore, the memory requirements of LoRa devices can be significantly reduced by using the methods of chirp segmentation and quantization so as to improve the feasibility of implementing multiplier-less filters in LoRa devices.

Download Full-text

Comment: Conjugate pair fast Fourier transform

Electronics Letters ◽

10.1049/el:19920721 ◽

1992 ◽

Vol 28 (12) ◽

pp. 1143 ◽

Cited By ~ 7

Author(s):

A.M. Krot ◽

H.B. Minervina

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Conjugate Pair

Download Full-text

Comment: Conjugate pair fast Fourier transform

Electronics Letters ◽

10.1049/el:19900351 ◽

1990 ◽

Vol 26 (8) ◽

pp. 541 ◽

Cited By ~ 6

Author(s):

H.-S. Qian ◽

Z.-J. Zhao

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Conjugate Pair

Download Full-text

Memory Bandwidth Efficient Two-Dimensional Fast Fourier Transform Algorithm and Implementation for Large Problem Sizes

2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines ◽

10.1109/fccm.2012.40 ◽

2012 ◽

Cited By ~ 18

Author(s):

Berkin Akin ◽

Peter A. Milder ◽

Franz Franchetti ◽

James C. Hoe

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Memory Bandwidth ◽

Two Dimensional ◽

Fast Fourier Transform Algorithm ◽

Large Problem

Download Full-text

Spectral analysis of the f0F2 data obtained at five PRIME sites during a 15 minute campaign in June 1993

Annals of Geophysics ◽

10.4401/ag-4011 ◽

1996 ◽

Vol 39 (4) ◽

Author(s):

Y. Tulunay ◽

S. A. Baykal ◽

Y. G. Yigit ◽

I. Stanislawska ◽

A. Rokicki ◽

...

Keyword(s):

Spectral Analysis ◽

Fourier Transform ◽

Fast Fourier Transform ◽

Confidence Level ◽

Original Data ◽

Easy Method ◽

The Cost

During the COST 238: PRIME project there was a campaign of 15-min intervals of the f0F2 soundings at Kandilli, Rome, Sofia, Poitiers and Lannion. The campaign took place for one month in June 1993. The spectral analysis of the data using a Fast Fourier Transform (FFT) algorithm proved a relatively easy method to reconstruct the original data at the confidence level = 0.05.

Download Full-text

TWO PARALLEL 1-D FFT ALGORITHMS WITHOUT ALL-TO-ALL COMMUNICATION

Parallel Processing Letters ◽

10.1142/s012962640600254x ◽

2006 ◽

Vol 16 (02) ◽

pp. 153-164

Author(s):

RAMI AL NA'MNEH ◽

W. DAVID PAN ◽

SEONG-MOO YOO

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

Parallel Systems ◽

Parallel Computers ◽

Limiting Factor ◽

Beowulf Cluster ◽

Simulation Results ◽

The Cost ◽

Fft Algorithms

Computing the 1-D Fast Fourier Transform (FFT) using the conventional six-step FFT algorithm on parallel computers requires intensive all-to-all communication due to the necessity of matrix transpose in three steps. This all-to-all communication is a limiting factor in improving the performance of FFT in its parallel implementations. In this paper, we present two parallel algorithms for implementing the 1-D FFT without all-to-all communication between processors, at the expense of increased inner-processor computation as compared to the conventional six-step FFT algorithm. Our analysis reveals the advantage of these two algorithms over the six-step FFT algorithm in parallel systems where the cost of inter-processor communication outweighs the cost of inner-processor computation. As a case study, we choose a 32-node Beowulf cluster with fast processors (running at 2 GHz) but relatively slow inter-processor communication (over a 100 Mbit/s switch). Simulation results on this cluster demonstrate that the proposed no-communication FFT algorithms can achieve a speedup ranging from 1.1 to 1.5 over the six-step FFT algorithm.

Download Full-text

Modules for Pipelined Mixed Radix FFT Processors

International Journal of Reconfigurable Computing ◽

10.1155/2016/3561317 ◽

2016 ◽

Vol 2016 ◽

pp. 1-7

Author(s):

Anatolij Sergiyenko ◽

Anastasia Serhienko

Keyword(s):

Fourier Transform ◽

Fast Fourier Transform ◽

High Speed ◽

Sampling Frequency ◽

Data Sampling ◽

Clock Frequency ◽

Ip Cores ◽

Point Fast Fourier Transform ◽

The Cost

A set of soft IP cores for the Winogradr-point fast Fourier transform (FFT) is considered. The cores are designed by the method of spatial SDF mapping into the hardware, which provides the minimized hardware volume at the cost of slowdown of the algorithm byrtimes. Their clock frequency is equal to the data sampling frequency. The cores are intended for the high-speed pipelined FFT processors, which are implemented in FPGA.

Download Full-text