NabuCasa/voice-datasets

espnet/owsm_v3 — OSWM v3 model for ESPnet.

basiralab/IMANGraphNet — Non-isomorphic Inter-modality Graph Alignment and Synthesis for Holistic Brain Mapping

eth-sri/astarix — AStarix: Fast and Optimal Sequence-to-Graph Aligner

IDEA-Research/GroundingDINO — Official implementation of the paper “Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection”

YODAS - 420k hours of speech

neurodata/goat — A paper (in progress) on graph matching via optimal transport. On arxiv at https://arxiv.org/abs/2111.05366

Quechua dialectal recordings


(Znachor)[https://pl.wikisource.org/wiki/Znachor_(Do%C5%82%C4%99ga-Mostowicz,_1938)], Wolne Lektury

British phonetic transcriptions