Interesting links, 05/03/2024
Misc. interesting things.
Current
printf "file '%s'\n" *.wav > mylist.txt
ffmpeg -f concat -i mylist.txt -c copy output.mkv
The TORGO Database: Acoustic and articulatory speech from speakers with dysarthria
kyegomez/AudioFlamingo — Implementation of the model “AudioFlamingo” from the paper: “Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities”
Peppa Malac - Az álruha, Peppa Pig - Dressing Up
lucidrains/RETRO-pytorch — Implementation of RETRO, Deepmind’s Retrieval based Attention net, in Pytorch
The Illustrated Retrieval Transformer
SpiRit-LM: Interleaved Spoken and Written Language Model
espnet - Support external dataset library
DeAL: Decoding-time Alignment for Large Language Models
Welcome Aya-101 🚀
— Vaibhav (VB) Srivastav (@reach_vb) February 13, 2024
> Follows instructions in 101 languages!
> 12.9 B parameters
> Outperforms mT0 & Bloomz
> Released under Apache 2.0
> Training + Evaluation data released too!
> mt5-xxl architecture!
GG @CohereForAI ♥️
Model ckpt: https://t.co/FSK4E89YbV pic.twitter.com/1ykAHWOP8A
Textually Pretrained Speech Language Models, code, project
Master 6 French Tenses In Just 10 Minutes
Can’t stop watching this😂 pic.twitter.com/jlALwxpPPH
— Historic Vids (@historyinmemes) March 1, 2024
RTE 2FM Classic Irish Track Uncovered - Sally by Kerbdog with Cormac Battle
CNChTu/FCPE — Fast Context-based Pitch Estimation
Scalable Diffusion Models with Transformers
Meta, github: facebookresearch/DiT
, not open source.