Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation, code

@misc{bartelds2023making,
      title={Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation}, 
      author={Martijn Bartelds and Nay San and Bradley McDonnell and Dan Jurafsky and Martijn Wieling},
      year={2023},
      eprint={2305.10951},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

FLamE: Few-shot Learning from Natural Language Explanations

Vintage Voice Effect in Audacity

GenSim: Generative Models for Supersizing Robotic Simulation Tasks

JoseLlarena/Britfone — British English pronunciation dictionary

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech, code

@misc{nguyen2023xphonebert,
      title={XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech}, 
      author={Linh The Nguyen and Thinh Pham and Dat Quoc Nguyen},
      year={2023},
      eprint={2305.19709},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Wavelet Diffusion Models are fast and scalable Image Generators, code

@misc{phung2023wavelet,
      title={Wavelet Diffusion Models are fast and scalable Image Generators}, 
      author={Hao Phung and Quan Dao and Anh Tran},
      year={2023},
      eprint={2211.16152},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}