This short article equates Daniel Falbel‘s ‘ Easy Audio Category’ short article from tensorflow/keras
to torch/torchaudio
The primary objective is to present torchaudio and show its contributions to the torch
environment. Here, we concentrate on a popular dataset, the audio loader and the spectrogram transformer. A fascinating side item is the parallel in between torch and tensorflow, revealing in some cases the distinctions, in some cases the resemblances in between them.
Downloading and Importing
torchaudio has the speechcommand_dataset
integrated in. It strains background_noise by default and lets us pick in between variations v0.01
and v0.02
# set an existing folder here to cache the dataset
DATASETS_PATH <% choose
(
class , pred_class)
, freq = x
$ n , col
= ifelse (
x
$
right, " lightblue"
, [1]" red"
)
, border = ifelse
([1] x
$
right , " lightblue"[1]
, " red")[, 1:10]
,[ CPUFloatType{1,10} ]
alpha = 0.6
,
conceal = x
$
n<