Interesting links, 20/02/2022
Misc. interesting things.
autopilot-rs/autopy — A simple, cross-platform GUI automation module for Python and Rust.
JupyterLite: Jupyter ❤️ WebAssembly ❤️ Python
How we made Jupyter Notebooks collaborative with Yjs
VertaAI/modeldb — Open Source ML Model Versioning, Metadata, and Experiment Management
pfnet-research/sngan_projection
Spectral Normalization for Generative Adversarial Networks, code
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Spoken dialogue data collected in the WAXHOLM project, dataset, phoneset
NST Pronunciation Lexicon for Swedish
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
SlimIPL: Language-Model-Free Iterative Pseudo-Labeling
Joint Masked CPC and CTC Training for ASR
The Curious Case of Neural Text Degeneration
LibriVox - An Sgéaluidhe Gaedhealach by Dúbhglas de h-Íde, scan
mSLAM: Massively multilingual joint pre-training for speech and text
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
google-research/mozolm — MozoLM: A language model (LM) serving library
spaces/microsoft/wavlm-speaker-verification
Multistream CNN for Robust Acoustic Modeling, code, script
Star Temporal Classification: Sequence Classification with Partially Labeled Data
Differentiable Allophone Graphs for Language-Universal Speech Recognition, twitter thread
BPE-Dropout: Simple and Effective Subword Regularization
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies, code
FastPitchFormant: Source-Filter Based Decomposed Modeling for Speech Synthesis