Convert the liepa2 corpus to fairseq
Jan 13, 2023
Raw, 8? 16? byte header, big endian PCM 44.1k
Jan 3, 2023
I wanted to know if it could be used to write an ARPA LM. It cannot
Dec 6, 2022
For Whisper's output
Nov 11, 2022
Because life's too short to install Kaldi again
Nov 10, 2022
wav2vec2 espeak phonetic model
Oct 18, 2022
Convert chunks to a tree
Oct 3, 2022
Writing the tsv/ltr files; from Kaggle
May 7, 2022
Resampled wav, more normalised text. From Kaggle
May 4, 2022
Notebook to split the audio from the liepa2 corpus
May 4, 2022
From Kaggle
May 4, 2022
LJSpeech comes with a normalised version, but it needs some extra work
May 2, 2022
Snippet to get Hunspell-based hyphenisation from pyphen
Apr 26, 2022
For use with PSST
Apr 4, 2022
With wav2vec2 and HuggingFace transformers
Mar 8, 2022