Interesting links, 6/9/2021
Misc. interesting things.
Kungbib/swedish-bert-models. Paper: Playing with Words at the National Library of Sweden – Making a Swedish BERT Huggingface: KBLab
NST Swedish Dictation (22 kHz)
SCRIBE - Spoken Corpus of British English
The available audio recordings and annotations were released on eleven CD-ROMs (labelled SCRIBE_0 to SCRIBE_11) in April
- These were originally distributed by the Speech Group at the National Physical Laboratory, but after this was closed down the disks were passed to the MOD Speech Research Unit at Malvern which passed the disks on to a private contractor (who kept them in his garage).
google-research/text-to-text-transfer-transformer
Scraping notes:
Cogg: Áiseanna Tacaíochta don Oideachas Speisialta, Bain Súp As, Leabhair Dhigiteacha
TG4: an-scoil/reamhobair, cursai-idirnaisiunta/reamhobair/, fadhbanna/reamhobair/, cursai-timpeallachta, cursai-airgid/mir-a-do, ponc/ponc-reamhobair
Club Leabhar: PODCHRAOLTAÍ LÉIRMHEASTÓIREACHTA AR LEABHAIR NA MÍOSA, AGALLAIMH ATÁ DÉANTA AGAINN LE HÚDAIR AGUS LE CRITICEOIRÍ LITEARTHA, Tintin mar charachtar an scéil