I can't remember what this was for; I'm sure I'll be reminded
Dec 15, 2023
Reading old data
Oct 17, 2023
Between English Wiktionary and phoneme recognition output
Oct 10, 2023
Fairseq data preparation for Waxholm phonetic transcriptions
Aug 10, 2023
Mostly, it's the push_to_hub part that I'll forget
Jan 21, 2023
Convert the liepa2 corpus to fairseq
Jan 13, 2023
Raw, 8? 16? byte header, big endian PCM 44.1k
Jan 3, 2023
I wanted to know if it could be used to write an ARPA LM. It cannot
Dec 6, 2022
For Whisper's output
Nov 11, 2022
Because life's too short to install Kaldi again
Nov 10, 2022
wav2vec2 espeak phonetic model
Oct 18, 2022
Convert chunks to a tree
Oct 3, 2022
Normalisation, adding boilerplate
Jun 30, 2022
Writing the tsv/ltr files; from Kaggle
May 7, 2022
Resampled wav, more normalised text. From Kaggle
May 4, 2022