Grapheme-to-Phoneme Transduction for Cross-Language ASR, preprint

uiuc-sst/g2ps

Zero-shot Cross-Lingual Phonetic Recognition with External Language Embedding

tkipf/gcn — Implementation of Graph Convolutional Networks in TensorFlow

hpcaitech/ColossalAI

fairseq - add TTS

mgaido91/FBK-fairseq-ST

lumaku/ctc-segmentation — Segment an audio file and obtain utterance alignments

microsoft/unilm

microsoft/layoutxlm-base

microsoft/icecaps — Intelligent Conversation Engine: Code and Pre-trained Systems. Version 0.2.0.

chenzhuo1011/libri_css — Continuous speech separation

microsoft/UniSpeech — UniSpeech - Large Scale Self-Supervised Learning for Speech, transformers