w11wo commited on
Commit
3d6215f
1 Parent(s): a7dce86

Added Models

Browse files
README.md CHANGED
@@ -1,3 +1,37 @@
1
  ---
 
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: id
3
  license: apache-2.0
4
+ tags:
5
+ - icefall
6
+ - sherpa-ncnn
7
+ - phoneme-recognition
8
+ - automatic-speech-recognition
9
+ datasets:
10
+ - mozilla-foundation/common_voice_13_0
11
+ - indonesian-nlp/librivox-indonesia
12
+ - google/fleurs
13
  ---
14
+
15
+ # Sherpa-ncnn Pruned Stateless Zipformer RNN-T Streaming ID
16
+
17
+ Sherpa-ncnn Pruned Stateless Zipformer RNN-T Streaming ID is an automatic speech recognition model trained on the following datasets:
18
+
19
+ - [Common Voice ID](https://huggingface.co/datasets/mozilla-foundation/common_voice_13_0)
20
+ - [LibriVox Indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia)
21
+ - [FLEURS ID](https://huggingface.co/datasets/google/fleurs)
22
+
23
+ Instead of being trained to predict sequences of words, this model was trained to predict sequence of phonemes, e.g. `['p', 'ə', 'r', 'b', 'u', 'a', 't', 'a', 'n', 'ɲ', 'a']`. Therefore, the model's [vocabulary](https://huggingface.co/bookbot/pruned-transducer-stateless7-streaming-id/blob/main/data/lang_phone/tokens.txt) contains the different IPA phonemes found in [g2p ID](https://github.com/bookbot-kids/g2p_id).
24
+
25
+ This model was converted from the TorchScript version of [Pruned Stateless Zipformer RNN-T Streaming ID](https://huggingface.co/bookbot/pruned-transducer-stateless7-streaming-id) to ncnn format.
26
+
27
+ ## Converting from TorchScript
28
+
29
+ Refer to the [official instructions](https://icefall.readthedocs.io/en/latest/model-export/export-ncnn-zipformer.html) for conversion to ncnn, which includes installation of `csukuangfj`'s [ncnn](https://github.com/csukuangfj/ncnn) fork.
30
+
31
+ ## Frameworks
32
+
33
+ - [k2](https://github.com/k2-fsa/k2)
34
+ - [icefall](https://github.com/bookbot-hive/icefall)
35
+ - [lhotse](https://github.com/bookbot-hive/lhotse)
36
+ - [sherpa-ncnn](https://github.com/k2-fsa/sherpa-ncnn)
37
+ - [ncnn](https://github.com/csukuangfj/ncnn)
decoder_jit_trace-pnnx.ncnn.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40f08148bbb20bb7f7160ce2419dbd9603b68dbdb3873354ae57a9390f095112
3
+ size 41992
decoder_jit_trace-pnnx.ncnn.param ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ 7767517
2
+ 6 6
3
+ Input in0 0 1 in0
4
+ Embed embed_1 1 1 in0 1 0=512 1=33 2=0 3=16896
5
+ Permute permute_2 1 1 1 2 0=1
6
+ ConvolutionDepthWise1D convdw1d_4 1 1 2 3 0=512 1=2 2=1 3=1 4=0 5=0 6=4096 7=128
7
+ Permute permute_3 1 1 3 4 0=1
8
+ ReLU relu_0 1 1 4 out0
encoder_jit_trace-pnnx.ncnn.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:35390708cdae1a2685132fb166724da46cec964f9c922701c4ae6211a5c9ee17
3
+ size 139191256
encoder_jit_trace-pnnx.ncnn.param ADDED
The diff for this file is too large to render. See raw diff
 
joiner_jit_trace-pnnx.ncnn.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6dc86722065c4fcf7d6fd49750da297b36178705e6594a8aefb8a01ac0bdd655
3
+ size 955536
joiner_jit_trace-pnnx.ncnn.param ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ 7767517
2
+ 7 7
3
+ Input in0 0 1 in0
4
+ Input in1 0 1 in1
5
+ InnerProduct linear_2 1 1 in1 2 0=512 1=1 2=262144
6
+ InnerProduct linear_1 1 1 in0 3 0=512 1=1 2=196608
7
+ BinaryOp add_0 2 1 3 2 4 0=0
8
+ TanH tanh_0 1 1 4 5
9
+ InnerProduct linear_3 1 1 5 out0 0=33 1=1 2=16896
tokens.txt ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ ɡ 1
3
+ o 2
4
+ d 3
5
+ ʃ 4
6
+ v 5
7
+ t 6
8
+ <UNK> 7
9
+ x 8
10
+ r 9
11
+ ʔ 10
12
+ b 11
13
+ s 12
14
+ p 13
15
+ i 14
16
+ dʒ 15
17
+ | 16
18
+ ə 17
19
+ z 18
20
+ f 19
21
+ n 20
22
+ m 21
23
+ ɲ 22
24
+ tʃ 23
25
+ ŋ 24
26
+ k 25
27
+ j 26
28
+ l 27
29
+ h 28
30
+ w 29
31
+ a 30
32
+ u 31
33
+ e 32
34
+ #0 33