https://thegradient.pub/an-illustrated-tour-of-applying-bert-to-speech-data/
An Illustrated Tour of Applying BERT to Speech Data
Jonathan Boigne
10.May.2022
“Could we replace the text input in BERT with a speech sequence, mask a part of it, and similarly train the model to recover what is missing? Unfortunately, it is not as straightforward as one might assume.”