Abstract
Recently, neural network (NN)-based fundamental frequency estimators have shown great promise in complex real-world problems such as audio processing, where model-based approaches tend to oversimplify. However, valuable insights can be gained by studying NNs under simpler and more controlled conditions. In this paper, we study the threshold effect, i.e., the rapid decrease in estimation accuracy with decreasing signal-to-noise ratio (SNR), of NN-based fundamental frequency estimators. We use the multiharmonic signal model and simulated data, which allows us to compare the networks with the threshold behavior of the maximum likelihood estimator (MLE) and with the Cramer-Rao lower bound (CRLB) above the threshold SNR. Finally, good practices for the design and training of such networks can be derived from our studies.
Original language | English |
---|---|
Title of host publication | Proceedings of the Asilomar Conference on Signals, Systems, and Computers (ACSSC 2024) |
Publisher | IEEE |
Number of pages | 6 |
Publication status | Published - Oct 2024 |
Fields of science
- 102019 Machine learning
- 202 Electrical Engineering, Electronics, Information Engineering
- 202037 Signal processing
JKU Focus areas
- Digital Transformation