# Investigating kernel shapes and skip connections for deep learning-based   harmonic-percussive separation

**Authors:** Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlb\"ack

arXiv: 1905.01899 · 2019-07-31

## TL;DR

This paper introduces an efficient deep learning encoder-decoder network for harmonic-percussive source separation, utilizing dense skip connections and varied kernel sizes to improve performance and reduce model complexity.

## Contribution

It presents a novel architecture with dense skip connections and variable kernel sizes that achieves state-of-the-art separation with fewer parameters and less training time.

## Key findings

- Achieves state-of-the-art HPSS performance.
- Reduces model parameters significantly.
- Maintains high-level feature learning.

## Abstract

In this paper we propose an efficient deep learning encoder-decoder network for performing Harmonic-Percussive Source Separation (HPSS). It is shown that we are able to greatly reduce the number of model trainable parameters by using a dense arrangement of skip connections between the model layers. We also explore the utilisation of different kernel sizes for the 2D filters of the convolutional layers with the objective of allowing the network to learn the different time-frequency patterns associated with percussive and harmonic sources more efficiently. The training and evaluation of the separation has been done using the training and test sets of the MUSDB18 dataset. Results show that the proposed deep network achieves automatic learning of high-level features and maintains HPSS performance at a state-of-the-art level while reducing the number of parameters and training time.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.01899/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1905.01899/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/1905.01899/full.md

---
Source: https://tomesphere.com/paper/1905.01899