OOCDC 是一种基于维基百科数据的在线-离线结合的数据策展方法 (An Online-Offline Combined Data Curation Method Based on Wikipedia Data).
方法介绍
concept_dataset/
├── concept_data_info.json
├── data
├── extract_DEIE_data.py
├── index_cache.pkl
├…
推荐开源项目:CSS10——十种语言单说话人语音数据集 css10 CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages 项目地址: https://gitcode.com/gh_mirrors/cs/css10
项目介绍
CSS10(Collection of Single Speaker Speech…