Speech Processing and Prosody

Published on 2019-10-0841 Views

Denis Jouvet

The prosody of the speech signal conveys information over the linguistic content of the message: prosody structures the utterance, and also brings information on speaker’s attitude and speaker’s emoti

TSD 2019 - Ljubljana

Related categories

Presentation

Speech Processing And Prosody00:00

Speech Processing And Prosody - 200:34

Phone duration02:09

Automatic speech-text alignment03:16

Automatic speech-text alignment - 205:14

Example of speech segmentation07:20

Analysis of final consonantal clusters09:30

Comparing frequency estimations10:29

Speech-text alignment - 312:00

Fundamental frequency (F0)14:23

F0 detection – time domain17:29

F0 detection – frequency domain18:19

F0 detection – comments19:07

Performance evaluation measures20:01

Evaluation on clean data21:24

Evaluation on clean data - 222:15

Evaluation on clean data - 322:42

Evaluation on simulated noisy data23:33

Evaluation on simulated noisy data - 224:52

Voicing decision errors - 325:03

Evaluation on real noisy data26:00

Comparing performance on real and simulated noisy data27:29

Comparing performance on real and simulated noisy data - 228:09

F0 detection28:38

Phone energy29:55

Normalizing prosodic features31:04

Confidence scoring32:43

Outline33:27

Computer assisted language learning33:49

Precision of phone boundaries35:04

Example of audio & textual prosodic feedback36:14

Structuring speech utterances37:33

Detection of prosodic boundaries38:21

Examples of prosodic trees39:05

Prosodic groups and punctuation40:01

Sentence modality41:15

Detection of sentence modality42:20

Discourse particles44:17

Speech corpora45:51

Data annotation46:38

Examples47:23

DP / non-DP analysis for word ”alors”47:41

DP / non-DP with respect to speech type48:44

Analysis of a few prosodic correlates49:11

Frequency of occurrence of pauses before the word49:38

Automatic classification and detection experiments50:42

Automatic classification and detection using prosodic features51:45

Automatic classification and detection using fundamental frequency values52:22

Automatic classification and detection52:39

F0 patterns52:54

F0 patterns - 253:27

F0 patterns - 353:37

Expressive speech54:21

Prosody of emotional speech56:05

Expressive speech synthesis57:22

Conclusion58:24