5roop commited on
Commit
132d375
1 Parent(s): ffa29b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -1
README.md CHANGED
@@ -16,7 +16,38 @@ metrics:
16
 
17
  This model classifies individual 20ms frames of audio based on presence of filled pauses ("eee", "errm", ...).
18
 
19
- It was trained on human-annotated Slovenian speech corpus ROG-Artur and achieves F1 of 0.952868.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
 
21
  # Example use:
22
  ```python
 
16
 
17
  This model classifies individual 20ms frames of audio based on presence of filled pauses ("eee", "errm", ...).
18
 
19
+ It was trained on human-annotated Slovenian speech corpus ROG-Artur and achieves F1 of 0.952868 on the test split of the same dataset.
20
+
21
+ Evaluation on 800 human-annotated instances ParlaSpeech-HR and ParlaSpeech-RS produced the following metrics:
22
+
23
+ ```
24
+ Performance on RS:
25
+ Classification report for human vs model on event level:
26
+ precision recall f1-score support
27
+
28
+ 0 0.97 0.87 0.92 234
29
+ 1 0.95 0.99 0.97 542
30
+
31
+ accuracy 0.95 776
32
+ macro avg 0.96 0.93 0.94 776
33
+ weighted avg 0.95 0.95 0.95 776
34
+
35
+ Performance on HR:
36
+ Classification report for human vs model on event level:
37
+ precision recall f1-score support
38
+
39
+ 0 0.94 0.84 0.89 242
40
+ 1 0.93 0.98 0.95 531
41
+
42
+ accuracy 0.93 773
43
+ macro avg 0.93 0.91 0.92 773
44
+ weighted avg 0.93 0.93 0.93 773
45
+ ```
46
+ The metrics reported are on event level, which means that if true and
47
+ predicted filled pauses at least partially overlap, we count them as a
48
+ True Positive event.
49
+
50
+
51
 
52
  # Example use:
53
  ```python