Interesting links, 08/07/2023
Misc. interesting things.
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation, code
@misc{bartelds2023making,
title={Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation},
author={Martijn Bartelds and Nay San and Bradley McDonnell and Dan Jurafsky and Martijn Wieling},
year={2023},
eprint={2305.10951},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
FLamE: Few-shot Learning from Natural Language Explanations
Vintage Voice Effect in Audacity
GenSim: Generative Models for Supersizing Robotic Simulation Tasks
JoseLlarena/Britfone — British English pronunciation dictionary
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech, code
@misc{nguyen2023xphonebert,
title={XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech},
author={Linh The Nguyen and Thinh Pham and Dat Quoc Nguyen},
year={2023},
eprint={2305.19709},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Wavelet Diffusion Models are fast and scalable Image Generators, code
@misc{phung2023wavelet,
title={Wavelet Diffusion Models are fast and scalable Image Generators},
author={Hao Phung and Quan Dao and Anh Tran},
year={2023},
eprint={2211.16152},
archivePrefix={arXiv},
primaryClass={cs.CV}
}