Skip to content

Commit dd9b764

Browse files
authored
Merge pull request #1672 from iftaken/dev_hym
update README, test=doc
2 parents 6eed542 + bc23f62 commit dd9b764

File tree

2 files changed

+34
-2
lines changed

2 files changed

+34
-2
lines changed

README.md

Lines changed: 21 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -280,10 +280,14 @@ paddlespeech_client cls --server_ip 127.0.0.1 --port 8090 --input input.wav
280280
For more information about server command lines, please see: [speech server demos](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos/speech_server)
281281

282282

283+
<a name="ModelList"></a>
284+
283285
## Model List
284286

285287
PaddleSpeech supports a series of most popular models. They are summarized in [released models](./docs/source/released_model.md) and attached with available pretrained models.
286288

289+
<a name="SpeechToText"></a>
290+
287291
**Speech-to-Text** contains *Acoustic Model*, *Language Model*, and *Speech Translation*, with the following details:
288292

289293
<table style="width:100%">
@@ -357,6 +361,8 @@ PaddleSpeech supports a series of most popular models. They are summarized in [r
357361
</tbody>
358362
</table>
359363

364+
<a name="TextToSpeech"></a>
365+
360366
**Text-to-Speech** in PaddleSpeech mainly contains three modules: *Text Frontend*, *Acoustic Model* and *Vocoder*. Acoustic Model and Vocoder models are listed as follow:
361367

362368
<table>
@@ -473,6 +479,8 @@ PaddleSpeech supports a series of most popular models. They are summarized in [r
473479
</tbody>
474480
</table>
475481

482+
<a name="AudioClassification"></a>
483+
476484
**Audio Classification**
477485

478486
<table style="width:100%">
@@ -496,6 +504,8 @@ PaddleSpeech supports a series of most popular models. They are summarized in [r
496504
</tbody>
497505
</table>
498506

507+
<a name="SpeakerVerification"></a>
508+
499509
**Speaker Verification**
500510

501511
<table style="width:100%">
@@ -519,6 +529,8 @@ PaddleSpeech supports a series of most popular models. They are summarized in [r
519529
</tbody>
520530
</table>
521531

532+
<a name="PunctuationRestoration"></a>
533+
522534
**Punctuation Restoration**
523535

524536
<table style="width:100%">
@@ -559,10 +571,18 @@ Normally, [Speech SoTA](https://paperswithcode.com/area/speech), [Audio SoTA](ht
559571
- [Advanced Usage](./docs/source/tts/advanced_usage.md)
560572
- [Chinese Rule Based Text Frontend](./docs/source/tts/zh_text_frontend.md)
561573
- [Test Audio Samples](https://paddlespeech.readthedocs.io/en/latest/tts/demo.html)
574+
- Speaker Verification
575+
- [Audio Searching](./demos/audio_searching/README.md)
576+
- [Speaker Verification](./demos/speaker_verification/README.md)
562577
- [Audio Classification](./demos/audio_tagging/README.md)
563-
- [Speaker Verification](./demos/speaker_verification/README.md)
564578
- [Speech Translation](./demos/speech_translation/README.md)
579+
- [Speech Server](./demos/speech_server/README.md)
565580
- [Released Models](./docs/source/released_model.md)
581+
- [Speech-to-Text](#SpeechToText)
582+
- [Text-to-Speech](#TextToSpeech)
583+
- [Audio Classification](#AudioClassification)
584+
- [Speaker Verification](#SpeakerVerification)
585+
- [Punctuation Restoration](#PunctuationRestoration)
566586
- [Community](#Community)
567587
- [Welcome to contribute](#contribution)
568588
- [License](#License)

README_cn.md

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -273,6 +273,8 @@ paddlespeech_client cls --server_ip 127.0.0.1 --port 8090 --input input.wav
273273
## 模型列表
274274
PaddleSpeech 支持很多主流的模型,并提供了预训练模型,详情请见[模型列表](./docs/source/released_model.md)
275275

276+
<a name="语音识别模型"></a>
277+
276278
PaddleSpeech 的 **语音转文本** 包含语音识别声学模型、语音识别语言模型和语音翻译, 详情如下:
277279

278280
<table style="width:100%">
@@ -347,6 +349,7 @@ PaddleSpeech 的 **语音转文本** 包含语音识别声学模型、语音识
347349
</table>
348350

349351
<a name="语音合成模型"></a>
352+
350353
PaddleSpeech 的 **语音合成** 主要包含三个模块:文本前端、声学模型和声码器。声学模型和声码器模型如下:
351354

352355
<table>
@@ -488,6 +491,8 @@ PaddleSpeech 的 **语音合成** 主要包含三个模块:文本前端、声
488491
</table>
489492

490493

494+
<a name="声纹识别模型"></a>
495+
491496
**声纹识别**
492497

493498
<table style="width:100%">
@@ -511,6 +516,8 @@ PaddleSpeech 的 **语音合成** 主要包含三个模块:文本前端、声
511516
</tbody>
512517
</table>
513518

519+
<a name="标点恢复模型"></a>
520+
514521
**标点恢复**
515522

516523
<table style="width:100%">
@@ -556,13 +563,18 @@ PaddleSpeech 的 **语音合成** 主要包含三个模块:文本前端、声
556563
- [进阶用法](./docs/source/tts/advanced_usage.md)
557564
- [中文文本前端](./docs/source/tts/zh_text_frontend.md)
558565
- [测试语音样本](https://paddlespeech.readthedocs.io/en/latest/tts/demo.html)
566+
- 声纹识别
567+
- [声纹识别](./demos/speaker_verification/README_cn.md)
568+
- [音频检索](./demos/audio_searching/README_cn.md)
559569
- [声音分类](./demos/audio_tagging/README_cn.md)
560-
- [声纹识别](./demos/speaker_verification/README_cn.md)
561570
- [语音翻译](./demos/speech_translation/README_cn.md)
571+
- [服务化部署](./demos/speech_server/README_cn.md)
562572
- [模型列表](#模型列表)
563573
- [语音识别](#语音识别模型)
564574
- [语音合成](#语音合成模型)
565575
- [声音分类](#声音分类模型)
576+
- [声纹识别](#声纹识别模型)
577+
- [标点恢复](#标点恢复模型)
566578
- [技术交流群](#技术交流群)
567579
- [欢迎贡献](#欢迎贡献)
568580
- [License](#License)

0 commit comments

Comments
 (0)