Interesting links, 9/11/2021
Misc. interesting things.
babysor/MockingBird — Chinese voice cloning
Irish lemmatiser for SpaCy, commit + data, commit
kylebgorman/SOTA-taggers — Code for Gorman & Bedrick’s “We need to talk about standard splits” (ACL ‘19)
kylebgorman/swipe — A pitch tracker using Camacho’s SWIPE’ algorithm, written in C
microsoft/UniSpeech — UniSpeech - Large Scale Self-Supervised Learning for Speech
facebookresearch/speech-resynthesis — An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. arXiv — not open source
flashlight/InferenceAndAlignmentCTC.ipynb
Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition, pdf
@inproceedings{ma21_interspeech,
author={Guodong Ma and Pengfei Hu and Jian Kang and Shen Huang and Hao Huang},
title={Leveraging {P}hone {M}ask {T}raining for {P}honetic-{R}eduction-{R}obust {E2E} {U}yghur {S}peech {R}ecognition},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={306--310},
doi={10.21437/Interspeech.2021-964}
}
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training, pdf
@inproceedings{hsu21_interspeech,
author={Wei-Ning Hsu and Anuroop Sriram and Alexei Baevski and Tatiana Likhomanenko and Qiantong Xu and Vineel Pratap and Jacob Kahn and Ann Lee and Ronan Collobert and Gabriel Synnaeve and Michael Auli},
title={Robust wav2vec 2.0&: {A}nalyzing {D}omain {S}hift in {S}elf-{S}upervised {P}re-{T}raining},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={721--725},
doi={10.21437/Interspeech.2021-236}
}
wav2vec-C: A Self-Supervised Model for Speech Representation Learning, pdf
@inproceedings{sadhu21_interspeech,
author={Samik Sadhu and Di He and Che-Wei Huang and Sri Harish Mallidi and Minhua Wu and Ariya Rastrow and Andreas Stolcke and Jasha Droppo and Roland Maas},
title={wav2vec-{C}: {A} {S}elf-{S}upervised {M}odel for {S}peech {R}epresentation {L}earning},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={711--715},
doi={10.21437/Interspeech.2021-717}
}
audino A Modern Annotation Tool for Audio and Speech, midas-research/audino
Description d’un parler irlandais de Kerry
Getting to Know the Mel Spectrogram
The Most Important Music Theory And How It Helps You Play Better
Diatonic Triads | Diatonic 7th Chords |
---|---|
C: C E G B | Cmaj7 |
Dm: D F A C | Dm7 |
Em: E G B D | Em7 |
F: F A C E | Fmaj7 |
G: G B D F | G7 |
Am: A C E G | Am7 |
Bo: B D F A | Bm7b5 (Bø) |