Kungbib/swedish-bert-models. Paper: Playing with Words at the National Library of Sweden – Making a Swedish BERT Huggingface: KBLab

NST Swedish Dictation (22 kHz)

SCRIBE - Spoken Corpus of British English

The available audio recordings and annotations were released on eleven CD-ROMs (labelled SCRIBE_0 to SCRIBE_11) in April

  1. These were originally distributed by the Speech Group at the National Physical Laboratory, but after this was closed down the disks were passed to the MOD Speech Research Unit at Malvern which passed the disks on to a private contractor (who kept them in his garage).

google/cld3

google-research/text-to-text-transfer-transformer

superb benchmark models


Scraping notes:

Gaelchultúr eolaire

Cogg: Áiseanna Tacaíochta don Oideachas Speisialta, Bain Súp As, Leabhair Dhigiteacha

TG4: an-scoil/reamhobair, cursai-idirnaisiunta/reamhobair/, fadhbanna/reamhobair/, cursai-timpeallachta, cursai-airgid/mir-a-do, ponc/ponc-reamhobair

Fís agus Foghlaim

Comhar

Coiscéim

Club Leabhar: PODCHRAOLTAÍ LÉIRMHEASTÓIREACHTA AR LEABHAIR NA MÍOSA, AGALLAIMH ATÁ DÉANTA AGAINN LE HÚDAIR AGUS LE CRITICEOIRÍ LITEARTHA, Tintin mar charachtar an scéil