Interesting links, 17/11/2021
Misc. interesting things.
lumaku/ctc-segmentation — Segment an audio file and obtain utterance alignments. (Python package)
Multilingual Transfer of Acoustic Word Embeddings Improves When Training on Languages Related to the Target Zero-Resource Language, pdf
@inproceedings{jacobs21_interspeech,
author={Christiaan Jacobs and Herman Kamper},
title={Multilingual {T}ransfer of {A}coustic {W}ord {E}mbeddings {I}mproves when {T}raining on {L}anguages {R}elated to the {T}arget {Z}ero-{R}esource {L}anguage},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={1549--1553},
doi={10.21437/Interspeech.2021-461}
}
Towards Unsupervised Phone and Word Segmentation Using Self-Supervised Vector-Quantized Neural Networks, pdf
@inproceedings{kamper21_interspeech,
author={Herman Kamper and Benjamin van Niekerk},
title={Towards {U}nsupervised {P}hone and {W}ord {S}egmentation Using {S}elf-{S}upervised {V}ector-{Q}uantized {N}eural {N}etworks},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={1539--1543},
doi={10.21437/Interspeech.2021-50}
}
bshall/VectorQuantizedCPC — Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
worldveil/dejavu — Audio fingerprinting and recognition in Python
Self-Supervised End-to-End ASR for Low Resource L2 Swedish, pdf, data to appear in Kielipankki
@inproceedings{alghezi21_interspeech,
author={Ragheb Al-Ghezi and Yaroslav Getman and Aku Rouhe and Raili Hildén and Mikko Kurimo},
title={Self-{S}upervised {E}nd-to-{E}nd {ASR} for {L}ow {R}esource {L2} {S}wedish},
year=2021,
booktitle={Proc. Interspeech 2021},
pages={1429--1433},
doi={10.21437/Interspeech.2021-1710}
}
xinjli/allosaurus — Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
cldf/cldf[https://github.com/cldf/cldf] CLDF — Cross-Linguistic Data Formats
kylebgorman/perceptronix — Sparse and dense linear models, for C++ and Python, with funny optimizations
AI - Here for Good — National Artificial Intelligence Strategy for Ireland
neulab/awesome-align — A neural word aligner based on multilingual BERT
xinjli/allosaurus — Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
neuspell/neuspell — A Neural Spelling Correction Toolkit
Gender in Irish between continuity and change
Re-open
kaldi-long-audio-alignment/build-trigram.sh
voxpopuli/voxpopuli/segmentation
voxpopuli/get_segment_pyannote_speaker.py
amsehili/auditok — An audio/acoustic activity detection and audio segmentation tool
kaldi/make_biased_lm_graphs.sh at master
kaldi/learn_lexicon_greedy.sh at master
kaldi/egs/wsj/s5/steps/segmentation at master
Wymysorys
A Andrason and T Krol WYMYSORYS GRAMMAR
Language attitudes in Wilamowice part 2 wym
Józef Gara - Słownik języka wilamowskiego
Józef Gara - Zbiór wierszy o wilamowskich obrzędach i obyczajach.pdf
Vilamovian terms with IPA pronunciation - Wiktionary