Task list, 28/9/2021
Daily todo
Today
-
separation script: spleeter: see run_spleeter.py
-
Extend abair xml to return list of timestamps; segment long recordings: notebook
-
Add LM and timings: see here, repo, file, this issue, parlance/ctcdecode, wav2vec2_kenlm.py
-
Fingerprint for known audio: dejavu
-
Pass over input data, with this or something similar
-
MFA, based on this
Look into:
- Add official ASR CTC example to examples/pytorch/speech-recognition
- Rewrite padding logic from pure python to numpy
- Non-Adversarial Unsupervised Word Translation
- Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
- grtzsohalf/Audio-Phonetic-and-Semantic-Embedding
- SpeechToolsWorkers
Personal
--match-filter "license='Creative Commons Attribution license (reuse allowed)'"
Longer term
-
Scrape more Ros na Rún
-
Compare this with stuff from last year
-
Segmentation: run_cleanup_segmentation.sh, tedlium, AMI
Look at: