Interesting links, 11/10/2024
Misc. interesting things.
Adapting WavLM for Speech Emotion Recognition
kaldiasr/kaldi
docker run -it --runtime=nvidia kaldiasr/kaldi:gpu-latest
DoDi’s Visual Basic 4 Decompiler
libyal — libraries for many obscure file formats
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching, code, model not open.
Even with a vast amound of data, the samples on their demo page still contain errors.
Google Research deleted EEG stuff
tracel-ai/burn — Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
lutzroeder/netron — Visualizer for neural network, deep learning and machine learning models