openvino-ci commited on
Commit
19f6d87
1 Parent(s): ac6ec73

Upload folder using huggingface_hub

Browse files
README.md CHANGED
@@ -1,99 +1,65 @@
1
  ---
2
  license: mit
3
- language:
4
- - en
5
  ---
6
-
7
  # dolly-v2-3b-int8-ov
8
-
9
- * Model creator: [Databricks](https://huggingface.co/databricks)
10
  * Original model: [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b)
11
 
12
  ## Description
13
-
14
- This is [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b) model converted to the [OpenVINO™ IR](https://docs.openvino.ai/2024/documentation/openvino-ir-format.html) (Intermediate Representation) format with weights compressed to int8 by [NNCF](https://github.com/openvinotoolkit/nncf).
15
 
16
  ## Quantization Parameters
17
 
18
  Weight compression was performed using `nncf.compress_weights` with the following parameters:
19
 
20
- * mode: **INT8_ASYM**
21
- * sensitivity_metric: **weight_quantization_error**
22
 
23
  For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html).
24
 
 
25
  ## Compatibility
26
 
27
  The provided OpenVINO™ IR model is compatible with:
28
 
29
- * OpenVINO version 2024.2.0 and higher
30
- * Optimum Intel 1.17.0 and higher
31
 
32
- ## Running Model Inference with [Optimum Intel](https://huggingface.co/docs/optimum/intel/index)
33
 
34
  1. Install packages required for using [Optimum Intel](https://huggingface.co/docs/optimum/intel/index) integration with the OpenVINO backend:
35
 
36
- ```
37
- pip install optimum[openvino]
38
- ```
39
-
40
- 2. Run model inference:
41
-
42
- ```
43
- from transformers import AutoTokenizer
44
- from optimum.intel.openvino import OVModelForCausalLM
45
-
46
- model_id = "OpenVINO/dolly-v2-3b-int8-ov"
47
- tokenizer = AutoTokenizer.from_pretrained(model_id)
48
- model = OVModelForCausalLM.from_pretrained(model_id)
49
-
50
- inputs = tokenizer("What is OpenVINO?", return_tensors="pt")
51
-
52
- outputs = model.generate(**inputs, max_length=200)
53
- text = tokenizer.batch_decode(outputs)[0]
54
- print(text)
55
- ```
56
-
57
- For more examples and possible optimizations, refer to the [OpenVINO Large Language Model Inference Guide](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html).
58
-
59
- ## Running Model Inference with [OpenVINO GenAI](https://github.com/openvinotoolkit/openvino.genai)
60
-
61
- 1. Install packages required for using OpenVINO GenAI.
62
- ```
63
- pip install openvino-genai huggingface_hub
64
  ```
65
-
66
- 2. Download model from HuggingFace Hub
67
-
68
  ```
69
- import huggingface_hub as hf_hub
70
-
71
- model_id = "OpenVINO/dolly-v2-3b-int8-ov"
72
- model_path = "dolly-v2-3b-int8-ov"
73
 
74
- hf_hub.snapshot_download(model_id, local_dir=model_path)
75
 
76
  ```
 
 
77
 
78
- 3. Run model inference:
 
 
79
 
80
- ```
81
- import openvino_genai as ov_genai
82
 
83
- device = "CPU"
84
- pipe = ov_genai.LLMPipeline(model_path, device)
85
- print(pipe.generate("What is OpenVINO?", max_length=200))
86
  ```
87
 
88
- More GenAI usage examples can be found in OpenVINO GenAI library [docs](https://github.com/openvinotoolkit/openvino.genai/blob/master/src/README.md) and [samples](https://github.com/openvinotoolkit/openvino.genai?tab=readme-ov-file#openvino-genai-samples)
89
 
90
  ## Limitations
91
 
92
- Check the original model card for [limitations](https://huggingface.co/databricks/dolly-v2-3b#known-limitations).
93
 
94
  ## Legal information
95
 
96
- The original model is distributed under [MIT](https://choosealicense.com/licenses/mit/) license. More details can be found in [original model card](https://huggingface.co/databricks/dolly-v2-3b).
97
 
98
  ## Disclaimer
99
 
 
1
  ---
2
  license: mit
3
+ license_link: https://choosealicense.com/licenses/mit/
 
4
  ---
 
5
  # dolly-v2-3b-int8-ov
6
+ * Model creator: [Databricks](https://huggingface.co/databricks)
 
7
  * Original model: [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b)
8
 
9
  ## Description
10
+ This is [dolly-v2-3b](https://huggingface.co/databricks/dolly-v2-3b) model converted to the [OpenVINO™ IR](https://docs.openvino.ai/2024/documentation/openvino-ir-format.html) (Intermediate Representation) format with weights compressed to INT8 by [NNCF](https://github.com/openvinotoolkit/nncf).
 
11
 
12
  ## Quantization Parameters
13
 
14
  Weight compression was performed using `nncf.compress_weights` with the following parameters:
15
 
16
+ * mode: **int8_asym**
17
+ * ratio: **1**
18
 
19
  For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html).
20
 
21
+
22
  ## Compatibility
23
 
24
  The provided OpenVINO™ IR model is compatible with:
25
 
26
+ * OpenVINO version 2024.4.0 and higher
27
+ * Optimum Intel 1.20.0 and higher
28
 
29
+ ## Running Model Inference
30
 
31
  1. Install packages required for using [Optimum Intel](https://huggingface.co/docs/optimum/intel/index) integration with the OpenVINO backend:
32
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  ```
34
+ pip install optimum[openvino]
 
 
35
  ```
 
 
 
 
36
 
37
+ 2. Run model inference:
38
 
39
  ```
40
+ from transformers import AutoTokenizer
41
+ from optimum.intel.openvino import OVModelForCausalLM
42
 
43
+ model_id = "OpenVINO/dolly-v2-3b-int8-ov"
44
+ tokenizer = AutoTokenizer.from_pretrained(model_id)
45
+ model = OVModelForCausalLM.from_pretrained(model_id)
46
 
47
+ inputs = tokenizer("What is OpenVINO?", return_tensors="pt")
 
48
 
49
+ outputs = model.generate(**inputs, max_length=200)
50
+ text = tokenizer.batch_decode(outputs)[0]
51
+ print(text)
52
  ```
53
 
54
+ For more examples and possible optimizations, refer to the [OpenVINO Large Language Model Inference Guide](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html).
55
 
56
  ## Limitations
57
 
58
+ Check the original model card for [original model card](https://huggingface.co/databricks/dolly-v2-3b) for limitations.
59
 
60
  ## Legal information
61
 
62
+ The original model is distributed under [mit](https://choosealicense.com/licenses/mit/) license. More details can be found in [original model card](https://huggingface.co/databricks/dolly-v2-3b).
63
 
64
  ## Disclaimer
65
 
config.json CHANGED
@@ -25,13 +25,15 @@
25
  "model_type": "gpt_neox",
26
  "num_attention_heads": 32,
27
  "num_hidden_layers": 32,
 
28
  "rope_scaling": null,
 
29
  "rotary_emb_base": 10000,
30
  "rotary_pct": 0.25,
31
  "tie_word_embeddings": false,
32
- "torch_dtype": "float32",
33
- "transformers_version": "4.40.1",
34
  "use_cache": true,
35
  "use_parallel_residual": true,
36
  "vocab_size": 50280
37
- }
 
25
  "model_type": "gpt_neox",
26
  "num_attention_heads": 32,
27
  "num_hidden_layers": 32,
28
+ "partial_rotary_factor": 0.25,
29
  "rope_scaling": null,
30
+ "rope_theta": 10000,
31
  "rotary_emb_base": 10000,
32
  "rotary_pct": 0.25,
33
  "tie_word_embeddings": false,
34
+ "torch_dtype": "bfloat16",
35
+ "transformers_version": "4.45.2",
36
  "use_cache": true,
37
  "use_parallel_residual": true,
38
  "vocab_size": 50280
39
+ }
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 0,
4
+ "eos_token_id": 0,
5
+ "transformers_version": "4.45.2"
6
+ }
openvino_detokenizer.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e3d0218341805b3876fc9c8e95c98d75cc9ddd0fa34fac6df212e790b6f91a08
3
- size 558494
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1e43770f23d5b9dbfc8bf99bbea4fe501870adf36235dff20156f6c0a129a47
3
+ size 514078
openvino_detokenizer.xml CHANGED
@@ -1,16 +1,16 @@
1
  <?xml version="1.0"?>
2
  <net name="detokenizer" version="11">
3
  <layers>
4
- <layer id="0" name="Parameter_191" type="Parameter" version="opset1">
5
  <data shape="?,?" element_type="i64" />
6
  <output>
7
- <port id="0" precision="I64" names="Parameter_191">
8
  <dim>-1</dim>
9
  <dim>-1</dim>
10
  </port>
11
  </output>
12
  </layer>
13
- <layer id="1" name="Convert_207" type="Convert" version="opset1">
14
  <data destination_type="i32" />
15
  <input>
16
  <port id="0" precision="I64">
@@ -25,19 +25,19 @@
25
  </port>
26
  </output>
27
  </layer>
28
- <layer id="2" name="Constant_89" type="Const" version="opset1">
29
- <data element_type="u8" shape="558445" offset="0" size="558445" />
30
  <output>
31
  <port id="0" precision="U8">
32
- <dim>558445</dim>
33
  </port>
34
  </output>
35
  </layer>
36
- <layer id="3" name="StringTensorUnpack_90" type="StringTensorUnpack" version="extension">
37
  <data mode="begins_ends" />
38
  <input>
39
  <port id="0" precision="U8">
40
- <dim>558445</dim>
41
  </port>
42
  </input>
43
  <output>
@@ -52,7 +52,7 @@
52
  </port>
53
  </output>
54
  </layer>
55
- <layer id="4" name="VocabDecoder_192" type="VocabDecoder" version="extension">
56
  <data skip_tokens="0, 1, 50277, 50278, 50279" />
57
  <input>
58
  <port id="0" precision="I32">
@@ -87,7 +87,7 @@
87
  </port>
88
  </output>
89
  </layer>
90
- <layer id="5" name="CharsToBytes_193" type="CharsToBytes" version="extension">
91
  <input>
92
  <port id="0" precision="I32">
93
  <dim>-1</dim>
@@ -101,70 +101,17 @@
101
  <port id="3" precision="I32">
102
  <dim>-1</dim>
103
  </port>
104
- <port id="4" precision="U8">
105
- <dim>-1</dim>
106
- </port>
107
  </input>
108
  <output>
109
- <port id="5" precision="I32">
110
- <dim>-1</dim>
111
- </port>
112
- <port id="6" precision="I32">
113
- <dim>-1</dim>
114
- </port>
115
- <port id="7" precision="U8">
116
- <dim>-1</dim>
117
- </port>
118
- </output>
119
- </layer>
120
- <layer id="6" name="Constant_195" type="Const" version="opset1">
121
- <data element_type="u8" shape="47" offset="558445" size="47" />
122
- <output>
123
- <port id="0" precision="U8">
124
- <dim>47</dim>
125
- </port>
126
- </output>
127
- </layer>
128
- <layer id="7" name="Constant_197" type="Const" version="opset1">
129
- <data element_type="u8" shape="2" offset="558492" size="2" />
130
- <output>
131
- <port id="0" precision="U8">
132
- <dim>2</dim>
133
- </port>
134
- </output>
135
- </layer>
136
- <layer id="8" name="RegexNormalization_198" type="RegexNormalization" version="extension">
137
- <data global_replace="true" />
138
- <input>
139
- <port id="0" precision="I32">
140
- <dim>-1</dim>
141
- </port>
142
- <port id="1" precision="I32">
143
- <dim>-1</dim>
144
- </port>
145
- <port id="2" precision="U8">
146
  <dim>-1</dim>
147
  </port>
148
- <port id="3" precision="U8">
149
- <dim>47</dim>
150
- </port>
151
- <port id="4" precision="U8">
152
- <dim>2</dim>
153
- </port>
154
- </input>
155
- <output>
156
  <port id="5" precision="I32">
157
  <dim>-1</dim>
158
  </port>
159
- <port id="6" precision="I32">
160
- <dim>-1</dim>
161
- </port>
162
- <port id="7" precision="U8">
163
- <dim>-1</dim>
164
- </port>
165
  </output>
166
  </layer>
167
- <layer id="9" name="StringTensorPack_199" type="StringTensorPack" version="extension">
168
  <data mode="begins_ends" />
169
  <input>
170
  <port id="0" precision="I32">
@@ -183,7 +130,7 @@
183
  </port>
184
  </output>
185
  </layer>
186
- <layer id="10" name="Result_200" type="Result" version="opset1">
187
  <input>
188
  <port id="0" precision="STRING">
189
  <dim>-1</dim>
@@ -198,22 +145,19 @@
198
  <edge from-layer="3" from-port="1" to-layer="4" to-port="1" />
199
  <edge from-layer="3" from-port="2" to-layer="4" to-port="2" />
200
  <edge from-layer="3" from-port="3" to-layer="4" to-port="3" />
201
- <edge from-layer="4" from-port="8" to-layer="5" to-port="4" />
202
- <edge from-layer="4" from-port="7" to-layer="5" to-port="3" />
203
- <edge from-layer="4" from-port="6" to-layer="5" to-port="2" />
204
- <edge from-layer="4" from-port="5" to-layer="5" to-port="1" />
205
  <edge from-layer="4" from-port="4" to-layer="5" to-port="0" />
206
- <edge from-layer="5" from-port="5" to-layer="8" to-port="0" />
207
- <edge from-layer="5" from-port="6" to-layer="8" to-port="1" />
208
- <edge from-layer="5" from-port="7" to-layer="8" to-port="2" />
209
- <edge from-layer="6" from-port="0" to-layer="8" to-port="3" />
210
- <edge from-layer="7" from-port="0" to-layer="8" to-port="4" />
211
- <edge from-layer="8" from-port="5" to-layer="9" to-port="0" />
212
- <edge from-layer="8" from-port="6" to-layer="9" to-port="1" />
213
- <edge from-layer="8" from-port="7" to-layer="9" to-port="2" />
214
- <edge from-layer="9" from-port="3" to-layer="10" to-port="0" />
215
  </edges>
216
  <rt_info>
 
217
  <eos_token_id value="0" />
 
 
218
  </rt_info>
219
  </net>
 
1
  <?xml version="1.0"?>
2
  <net name="detokenizer" version="11">
3
  <layers>
4
+ <layer id="0" name="Parameter_62647" type="Parameter" version="opset1">
5
  <data shape="?,?" element_type="i64" />
6
  <output>
7
+ <port id="0" precision="I64" names="Parameter_62647">
8
  <dim>-1</dim>
9
  <dim>-1</dim>
10
  </port>
11
  </output>
12
  </layer>
13
+ <layer id="1" name="Convert_62658" type="Convert" version="opset1">
14
  <data destination_type="i32" />
15
  <input>
16
  <port id="0" precision="I64">
 
25
  </port>
26
  </output>
27
  </layer>
28
+ <layer id="2" name="Constant_62617" type="Const" version="opset1">
29
+ <data element_type="u8" shape="514078" offset="0" size="514078" />
30
  <output>
31
  <port id="0" precision="U8">
32
+ <dim>514078</dim>
33
  </port>
34
  </output>
35
  </layer>
36
+ <layer id="3" name="StringTensorUnpack_62618" type="StringTensorUnpack" version="extension">
37
  <data mode="begins_ends" />
38
  <input>
39
  <port id="0" precision="U8">
40
+ <dim>514078</dim>
41
  </port>
42
  </input>
43
  <output>
 
52
  </port>
53
  </output>
54
  </layer>
55
+ <layer id="4" name="VocabDecoder_62648" type="VocabDecoder" version="extension">
56
  <data skip_tokens="0, 1, 50277, 50278, 50279" />
57
  <input>
58
  <port id="0" precision="I32">
 
87
  </port>
88
  </output>
89
  </layer>
90
+ <layer id="5" name="FuzeRagged_62649" type="FuzeRagged" version="extension">
91
  <input>
92
  <port id="0" precision="I32">
93
  <dim>-1</dim>
 
101
  <port id="3" precision="I32">
102
  <dim>-1</dim>
103
  </port>
 
 
 
104
  </input>
105
  <output>
106
+ <port id="4" precision="I32">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
107
  <dim>-1</dim>
108
  </port>
 
 
 
 
 
 
 
 
109
  <port id="5" precision="I32">
110
  <dim>-1</dim>
111
  </port>
 
 
 
 
 
 
112
  </output>
113
  </layer>
114
+ <layer id="6" name="StringTensorPack_62650" type="StringTensorPack" version="extension">
115
  <data mode="begins_ends" />
116
  <input>
117
  <port id="0" precision="I32">
 
130
  </port>
131
  </output>
132
  </layer>
133
+ <layer id="7" name="Result_62651" type="Result" version="opset1">
134
  <input>
135
  <port id="0" precision="STRING">
136
  <dim>-1</dim>
 
145
  <edge from-layer="3" from-port="1" to-layer="4" to-port="1" />
146
  <edge from-layer="3" from-port="2" to-layer="4" to-port="2" />
147
  <edge from-layer="3" from-port="3" to-layer="4" to-port="3" />
 
 
 
 
148
  <edge from-layer="4" from-port="4" to-layer="5" to-port="0" />
149
+ <edge from-layer="4" from-port="5" to-layer="5" to-port="1" />
150
+ <edge from-layer="4" from-port="6" to-layer="5" to-port="2" />
151
+ <edge from-layer="4" from-port="7" to-layer="5" to-port="3" />
152
+ <edge from-layer="4" from-port="8" to-layer="6" to-port="2" />
153
+ <edge from-layer="5" from-port="4" to-layer="6" to-port="0" />
154
+ <edge from-layer="5" from-port="5" to-layer="6" to-port="1" />
155
+ <edge from-layer="6" from-port="3" to-layer="7" to-port="0" />
 
 
156
  </edges>
157
  <rt_info>
158
+ <bos_token_id value="0" />
159
  <eos_token_id value="0" />
160
+ <original_tokenizer_class value="&lt;class 'transformers.models.gpt_neox.tokenization_gpt_neox_fast.GPTNeoXTokenizerFast'>" />
161
+ <pad_token_id value="0" />
162
  </rt_info>
163
  </net>
openvino_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:de5171039a87f9b7585e4f2e5fe335e80663abb5ff88770efe4612b912d59ee5
3
- size 2783023114
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2e6603c8164e579a9cd59c84e14449d68fb63e697917c93d864f5fd7427ad25f
3
+ size 2779990920
openvino_model.xml CHANGED
The diff for this file is too large to render. See raw diff
 
openvino_tokenizer.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bf4bb4307e428d32680d2973f34247c51210fad1f4c6408b2087dfdfb053e210
3
- size 1166376
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c378d88077ae7c7e13ef61745e1ceef76412338e9a7398445c09632413e52abe
3
+ size 1227935
openvino_tokenizer.xml CHANGED
@@ -1,21 +1,21 @@
1
  <?xml version="1.0"?>
2
  <net name="tokenizer" version="11">
3
  <layers>
4
- <layer id="0" name="Parameter_1" type="Parameter" version="opset1">
5
  <data shape="?" element_type="string" />
6
  <output>
7
- <port id="0" precision="STRING" names="Parameter_1">
8
  <dim>-1</dim>
9
  </port>
10
  </output>
11
  </layer>
12
- <layer id="1" name="Constant_8" type="Const" version="opset1">
13
  <data element_type="i64" shape="" offset="0" size="8" />
14
  <output>
15
  <port id="0" precision="I64" />
16
  </output>
17
  </layer>
18
- <layer id="2" name="StringTensorUnpack_2" type="StringTensorUnpack" version="extension">
19
  <data mode="begins_ends" />
20
  <input>
21
  <port id="0" precision="STRING">
@@ -34,32 +34,7 @@
34
  </port>
35
  </output>
36
  </layer>
37
- <layer id="3" name="NormalizeUnicode_3" type="NormalizeUnicode" version="extension">
38
- <data normalization_form="NFC" />
39
- <input>
40
- <port id="0" precision="I32">
41
- <dim>-1</dim>
42
- </port>
43
- <port id="1" precision="I32">
44
- <dim>-1</dim>
45
- </port>
46
- <port id="2" precision="U8">
47
- <dim>-1</dim>
48
- </port>
49
- </input>
50
- <output>
51
- <port id="3" precision="I32">
52
- <dim>-1</dim>
53
- </port>
54
- <port id="4" precision="I32">
55
- <dim>-1</dim>
56
- </port>
57
- <port id="5" precision="U8">
58
- <dim>-1</dim>
59
- </port>
60
- </output>
61
- </layer>
62
- <layer id="4" name="ShapeOf_4" type="ShapeOf" version="opset3">
63
  <data output_type="i64" />
64
  <input>
65
  <port id="0" precision="I32">
@@ -72,19 +47,19 @@
72
  </port>
73
  </output>
74
  </layer>
75
- <layer id="5" name="Constant_5" type="Const" version="opset1">
76
  <data element_type="i64" shape="" offset="0" size="8" />
77
  <output>
78
  <port id="0" precision="I64" />
79
  </output>
80
  </layer>
81
- <layer id="6" name="Constant_6" type="Const" version="opset1">
82
  <data element_type="i64" shape="" offset="0" size="8" />
83
  <output>
84
  <port id="0" precision="I64" />
85
  </output>
86
  </layer>
87
- <layer id="7" name="Gather_7" type="Gather" version="opset8">
88
  <data batch_dims="0" />
89
  <input>
90
  <port id="0" precision="I64">
@@ -97,13 +72,13 @@
97
  <port id="3" precision="I64" />
98
  </output>
99
  </layer>
100
- <layer id="8" name="Constant_9" type="Const" version="opset1">
101
  <data element_type="i64" shape="" offset="8" size="8" />
102
  <output>
103
  <port id="0" precision="I64" />
104
  </output>
105
  </layer>
106
- <layer id="9" name="Range_10" type="Range" version="opset4">
107
  <data output_type="i32" />
108
  <input>
109
  <port id="0" precision="I64" />
@@ -116,19 +91,19 @@
116
  </port>
117
  </output>
118
  </layer>
119
- <layer id="10" name="Constant_12" type="Const" version="opset1">
120
  <data element_type="i64" shape="" offset="8" size="8" />
121
  <output>
122
  <port id="0" precision="I64" />
123
  </output>
124
  </layer>
125
- <layer id="11" name="Constant_13" type="Const" version="opset1">
126
  <data element_type="i64" shape="" offset="8" size="8" />
127
  <output>
128
  <port id="0" precision="I64" />
129
  </output>
130
  </layer>
131
- <layer id="12" name="Add_14" type="Add" version="opset1">
132
  <data auto_broadcast="numpy" />
133
  <input>
134
  <port id="0" precision="I64" />
@@ -138,13 +113,13 @@
138
  <port id="2" precision="I64" />
139
  </output>
140
  </layer>
141
- <layer id="13" name="Constant_15" type="Const" version="opset1">
142
  <data element_type="i64" shape="" offset="8" size="8" />
143
  <output>
144
  <port id="0" precision="I64" />
145
  </output>
146
  </layer>
147
- <layer id="14" name="Range_16" type="Range" version="opset4">
148
  <data output_type="i32" />
149
  <input>
150
  <port id="0" precision="I64" />
@@ -157,16 +132,15 @@
157
  </port>
158
  </output>
159
  </layer>
160
- <layer id="15" name="Constant_79" type="Const" version="opset1">
161
- <data element_type="u8" shape="673" offset="16" size="673" />
162
  <output>
163
  <port id="0" precision="U8">
164
- <dim>673</dim>
165
  </port>
166
  </output>
167
  </layer>
168
- <layer id="16" name="RegexSplit_80" type="RegexSplit" version="extension">
169
- <data behaviour="isolate" invert="false" max_splits="-1" />
170
  <input>
171
  <port id="0" precision="I32">
172
  <dim>-1</dim>
@@ -184,7 +158,7 @@
184
  <dim>-1</dim>
185
  </port>
186
  <port id="5" precision="U8">
187
- <dim>673</dim>
188
  </port>
189
  </input>
190
  <output>
@@ -203,45 +177,13 @@
203
  <port id="10" precision="U8">
204
  <dim>-1</dim>
205
  </port>
206
- </output>
207
- </layer>
208
- <layer id="17" name="Constant_85" type="Const" version="opset1">
209
- <data element_type="u8" shape="64" offset="689" size="64" />
210
- <output>
211
- <port id="0" precision="U8">
212
- <dim>64</dim>
213
- </port>
214
- </output>
215
- </layer>
216
- <layer id="18" name="Constant_82" type="Const" version="opset1">
217
- <data element_type="u8" shape="447" offset="753" size="447" />
218
- <output>
219
- <port id="0" precision="U8">
220
- <dim>447</dim>
221
- </port>
222
- </output>
223
- </layer>
224
- <layer id="19" name="StringTensorUnpack_83" type="StringTensorUnpack" version="extension">
225
- <data mode="begins_ends" />
226
- <input>
227
- <port id="0" precision="U8">
228
- <dim>447</dim>
229
- </port>
230
- </input>
231
- <output>
232
- <port id="1" precision="I32">
233
- <dim>-1</dim>
234
- </port>
235
- <port id="2" precision="I32">
236
- <dim>-1</dim>
237
- </port>
238
- <port id="3" precision="U8">
239
  <dim>-1</dim>
240
  </port>
241
  </output>
242
  </layer>
243
- <layer id="20" name="RegexSplit_86" type="RegexSplit" version="extension">
244
- <data behaviour="isolate" invert="false" max_splits="-1" />
245
  <input>
246
  <port id="0" precision="I32">
247
  <dim>-1</dim>
@@ -249,47 +191,38 @@
249
  <port id="1" precision="I32">
250
  <dim>-1</dim>
251
  </port>
252
- <port id="2" precision="I32">
253
- <dim>-1</dim>
254
- </port>
255
- <port id="3" precision="I32">
256
- <dim>-1</dim>
257
- </port>
258
- <port id="4" precision="U8">
259
- <dim>-1</dim>
260
- </port>
261
- <port id="5" precision="U8">
262
- <dim>64</dim>
263
- </port>
264
- <port id="6" precision="I32">
265
- <dim>-1</dim>
266
- </port>
267
- <port id="7" precision="I32">
268
  <dim>-1</dim>
269
  </port>
270
- <port id="8" precision="U8">
271
  <dim>-1</dim>
272
  </port>
273
  </input>
274
  <output>
275
- <port id="9" precision="I32">
276
  <dim>-1</dim>
277
  </port>
278
- <port id="10" precision="I32">
279
  <dim>-1</dim>
280
  </port>
281
- <port id="11" precision="I32">
282
  <dim>-1</dim>
283
  </port>
284
- <port id="12" precision="I32">
285
  <dim>-1</dim>
286
  </port>
287
- <port id="13" precision="U8">
288
- <dim>-1</dim>
 
 
 
 
 
289
  </port>
290
  </output>
291
  </layer>
292
- <layer id="21" name="BytesToChars_87" type="BytesToChars" version="extension">
 
293
  <input>
294
  <port id="0" precision="I32">
295
  <dim>-1</dim>
@@ -306,38 +239,47 @@
306
  <port id="4" precision="U8">
307
  <dim>-1</dim>
308
  </port>
 
 
 
 
 
 
309
  </input>
310
  <output>
311
- <port id="5" precision="I32">
312
  <dim>-1</dim>
313
  </port>
314
- <port id="6" precision="I32">
315
  <dim>-1</dim>
316
  </port>
317
- <port id="7" precision="I32">
318
  <dim>-1</dim>
319
  </port>
320
- <port id="8" precision="I32">
321
  <dim>-1</dim>
322
  </port>
323
- <port id="9" precision="U8">
 
 
 
324
  <dim>-1</dim>
325
  </port>
326
  </output>
327
  </layer>
328
- <layer id="22" name="Constant_89" type="Const" version="opset1">
329
- <data element_type="u8" shape="558445" offset="1200" size="558445" />
330
  <output>
331
  <port id="0" precision="U8">
332
- <dim>558445</dim>
333
  </port>
334
  </output>
335
  </layer>
336
- <layer id="23" name="StringTensorUnpack_90" type="StringTensorUnpack" version="extension">
337
  <data mode="begins_ends" />
338
  <input>
339
  <port id="0" precision="U8">
340
- <dim>558445</dim>
341
  </port>
342
  </input>
343
  <output>
@@ -352,19 +294,19 @@
352
  </port>
353
  </output>
354
  </layer>
355
- <layer id="24" name="Constant_170" type="Const" version="opset1">
356
- <data element_type="u8" shape="606619" offset="559645" size="606619" />
357
  <output>
358
  <port id="0" precision="U8">
359
- <dim>606619</dim>
360
  </port>
361
  </output>
362
  </layer>
363
- <layer id="25" name="StringTensorUnpack_171" type="StringTensorUnpack" version="extension">
364
  <data mode="begins_ends" />
365
  <input>
366
  <port id="0" precision="U8">
367
- <dim>606619</dim>
368
  </port>
369
  </input>
370
  <output>
@@ -379,25 +321,19 @@
379
  </port>
380
  </output>
381
  </layer>
382
- <layer id="26" name="Constant_98" type="Const" version="opset1">
383
- <data element_type="i64" shape="" offset="0" size="8" />
384
- <output>
385
- <port id="0" precision="I64" />
386
- </output>
387
- </layer>
388
- <layer id="27" name="Constant_92" type="Const" version="opset1">
389
- <data element_type="u8" shape="447" offset="753" size="447" />
390
  <output>
391
  <port id="0" precision="U8">
392
- <dim>447</dim>
393
  </port>
394
  </output>
395
  </layer>
396
- <layer id="28" name="StringTensorUnpack_93" type="StringTensorUnpack" version="extension">
397
  <data mode="begins_ends" />
398
  <input>
399
  <port id="0" precision="U8">
400
- <dim>447</dim>
401
  </port>
402
  </input>
403
  <output>
@@ -412,150 +348,43 @@
412
  </port>
413
  </output>
414
  </layer>
415
- <layer id="29" name="ShapeOf_94" type="ShapeOf" version="opset3">
416
- <data output_type="i64" />
417
- <input>
418
- <port id="0" precision="I32">
419
- <dim>-1</dim>
420
- </port>
421
- </input>
422
- <output>
423
- <port id="1" precision="I64">
424
- <dim>1</dim>
425
- </port>
426
- </output>
427
- </layer>
428
- <layer id="30" name="Constant_95" type="Const" version="opset1">
429
- <data element_type="i64" shape="" offset="0" size="8" />
430
- <output>
431
- <port id="0" precision="I64" />
432
- </output>
433
- </layer>
434
- <layer id="31" name="Constant_96" type="Const" version="opset1">
435
- <data element_type="i64" shape="" offset="0" size="8" />
436
  <output>
437
- <port id="0" precision="I64" />
438
- </output>
439
- </layer>
440
- <layer id="32" name="Gather_97" type="Gather" version="opset8">
441
- <data batch_dims="0" />
442
- <input>
443
- <port id="0" precision="I64">
444
- <dim>1</dim>
445
  </port>
446
- <port id="1" precision="I64" />
447
- <port id="2" precision="I64" />
448
- </input>
449
- <output>
450
- <port id="3" precision="I64" />
451
  </output>
452
  </layer>
453
- <layer id="33" name="Constant_99" type="Const" version="opset1">
454
- <data element_type="i64" shape="" offset="8" size="8" />
455
- <output>
456
- <port id="0" precision="I64" />
457
- </output>
458
- </layer>
459
- <layer id="34" name="Range_100" type="Range" version="opset4">
460
- <data output_type="i32" />
461
  <input>
462
- <port id="0" precision="I64" />
463
- <port id="1" precision="I64" />
464
- <port id="2" precision="I64" />
465
- </input>
466
- <output>
467
- <port id="3" precision="I32">
468
- <dim>-1</dim>
469
  </port>
470
- </output>
471
- </layer>
472
- <layer id="35" name="Constant_102" type="Const" version="opset1">
473
- <data element_type="i64" shape="" offset="8" size="8" />
474
- <output>
475
- <port id="0" precision="I64" />
476
- </output>
477
- </layer>
478
- <layer id="36" name="Constant_103" type="Const" version="opset1">
479
- <data element_type="i64" shape="" offset="8" size="8" />
480
- <output>
481
- <port id="0" precision="I64" />
482
- </output>
483
- </layer>
484
- <layer id="37" name="Add_104" type="Add" version="opset1">
485
- <data auto_broadcast="numpy" />
486
- <input>
487
- <port id="0" precision="I64" />
488
- <port id="1" precision="I64" />
489
  </input>
490
  <output>
491
- <port id="2" precision="I64" />
492
- </output>
493
- </layer>
494
- <layer id="38" name="Constant_105" type="Const" version="opset1">
495
- <data element_type="i64" shape="" offset="8" size="8" />
496
- <output>
497
- <port id="0" precision="I64" />
498
- </output>
499
- </layer>
500
- <layer id="39" name="Range_106" type="Range" version="opset4">
501
- <data output_type="i32" />
502
- <input>
503
- <port id="0" precision="I64" />
504
- <port id="1" precision="I64" />
505
- <port id="2" precision="I64" />
506
- </input>
507
- <output>
508
- <port id="3" precision="I32">
509
- <dim>-1</dim>
510
- </port>
511
- </output>
512
- </layer>
513
- <layer id="40" name="BytesToChars_168" type="BytesToChars" version="extension">
514
- <input>
515
- <port id="0" precision="I32">
516
- <dim>-1</dim>
517
- </port>
518
  <port id="1" precision="I32">
519
  <dim>-1</dim>
520
  </port>
521
  <port id="2" precision="I32">
522
  <dim>-1</dim>
523
  </port>
524
- <port id="3" precision="I32">
525
- <dim>-1</dim>
526
- </port>
527
- <port id="4" precision="U8">
528
- <dim>-1</dim>
529
- </port>
530
- </input>
531
- <output>
532
- <port id="5" precision="I32">
533
- <dim>-1</dim>
534
- </port>
535
- <port id="6" precision="I32">
536
- <dim>-1</dim>
537
- </port>
538
- <port id="7" precision="I32">
539
- <dim>-1</dim>
540
- </port>
541
- <port id="8" precision="I32">
542
- <dim>-1</dim>
543
- </port>
544
- <port id="9" precision="U8">
545
  <dim>-1</dim>
546
  </port>
547
  </output>
548
  </layer>
549
- <layer id="41" name="Constant_172" type="Const" version="opset1">
550
- <data element_type="i32" shape="26" offset="1166264" size="104" />
551
  <output>
552
  <port id="0" precision="I32">
553
- <dim>26</dim>
554
  </port>
555
  </output>
556
  </layer>
557
- <layer id="42" name="BPETokenizer_173" type="BPETokenizer" version="extension">
558
- <data unk_token="" fuse_unk="false" suffix_indicator="" end_suffix="" byte_fallback="false" />
559
  <input>
560
  <port id="0" precision="I32">
561
  <dim>-1</dim>
@@ -600,22 +429,31 @@
600
  <dim>-1</dim>
601
  </port>
602
  <port id="14" precision="I32">
603
- <dim>26</dim>
604
  </port>
605
- </input>
606
- <output>
607
  <port id="15" precision="I32">
608
  <dim>-1</dim>
609
  </port>
610
- <port id="16" precision="I32">
611
  <dim>-1</dim>
612
  </port>
613
  <port id="17" precision="I32">
 
 
 
 
 
 
 
 
 
 
 
614
  <dim>-1</dim>
615
  </port>
616
  </output>
617
  </layer>
618
- <layer id="43" name="Subtract_174" type="Subtract" version="opset1">
619
  <data auto_broadcast="numpy" />
620
  <input>
621
  <port id="0" precision="I32">
@@ -631,13 +469,13 @@
631
  </port>
632
  </output>
633
  </layer>
634
- <layer id="44" name="Constant_175" type="Const" version="opset1">
635
- <data element_type="i32" shape="" offset="1166368" size="4" />
636
  <output>
637
  <port id="0" precision="I32" />
638
  </output>
639
  </layer>
640
- <layer id="45" name="Minimum_176" type="Minimum" version="opset1">
641
  <data auto_broadcast="numpy" />
642
  <input>
643
  <port id="0" precision="I32">
@@ -651,7 +489,7 @@
651
  </port>
652
  </output>
653
  </layer>
654
- <layer id="46" name="Add_177" type="Add" version="opset1">
655
  <data auto_broadcast="numpy" />
656
  <input>
657
  <port id="0" precision="I32">
@@ -667,15 +505,15 @@
667
  </port>
668
  </output>
669
  </layer>
670
- <layer id="47" name="Constant_178" type="Const" version="opset1">
671
- <data element_type="i32" shape="1" offset="1166372" size="4" />
672
  <output>
673
  <port id="0" precision="I32">
674
  <dim>1</dim>
675
  </port>
676
  </output>
677
  </layer>
678
- <layer id="48" name="CombineSegments_179" type="CombineSegments" version="extension">
679
  <input>
680
  <port id="0" precision="I32">
681
  <dim>-1</dim>
@@ -711,7 +549,7 @@
711
  </port>
712
  </output>
713
  </layer>
714
- <layer id="49" name="Subtract_180" type="Subtract" version="opset1">
715
  <data auto_broadcast="numpy" />
716
  <input>
717
  <port id="0" precision="I32">
@@ -727,13 +565,13 @@
727
  </port>
728
  </output>
729
  </layer>
730
- <layer id="50" name="Constant_181" type="Const" version="opset1">
731
- <data element_type="i32" shape="" offset="1166372" size="4" />
732
  <output>
733
  <port id="0" precision="I32" />
734
  </output>
735
  </layer>
736
- <layer id="51" name="ReduceMax_182" type="ReduceMax" version="opset1">
737
  <data keep_dims="false" />
738
  <input>
739
  <port id="0" precision="I32">
@@ -745,14 +583,14 @@
745
  <port id="2" precision="I32" />
746
  </output>
747
  </layer>
748
- <layer id="52" name="Constant_183" type="Const" version="opset1">
749
- <data element_type="i32" shape="" offset="1166372" size="4" />
750
  <output>
751
  <port id="0" precision="I32" />
752
  </output>
753
  </layer>
754
- <layer id="53" name="RaggedToDense_184" type="RaggedToDense" version="extension">
755
- <data pad_right="true" />
756
  <input>
757
  <port id="0" precision="I32">
758
  <dim>-1</dim>
@@ -777,7 +615,7 @@
777
  </port>
778
  </output>
779
  </layer>
780
- <layer id="54" name="Convert_185" type="Convert" version="opset1">
781
  <data destination_type="i32" />
782
  <input>
783
  <port id="0" precision="BOOL">
@@ -792,7 +630,7 @@
792
  </port>
793
  </output>
794
  </layer>
795
- <layer id="55" name="Convert_185" type="Convert" version="opset1">
796
  <data destination_type="i64" />
797
  <input>
798
  <port id="0" precision="I32">
@@ -807,7 +645,7 @@
807
  </port>
808
  </output>
809
  </layer>
810
- <layer id="57" name="RaggedToDense_184.0" type="Convert" version="opset1">
811
  <data destination_type="i64" />
812
  <input>
813
  <port id="0" precision="I32">
@@ -822,7 +660,7 @@
822
  </port>
823
  </output>
824
  </layer>
825
- <layer id="58" name="Result_188" type="Result" version="opset1">
826
  <input>
827
  <port id="0" precision="I64">
828
  <dim>-1</dim>
@@ -830,7 +668,7 @@
830
  </port>
831
  </input>
832
  </layer>
833
- <layer id="56" name="Result_190" type="Result" version="opset1">
834
  <input>
835
  <port id="0" precision="I64">
836
  <dim>-1</dim>
@@ -841,103 +679,86 @@
841
  </layers>
842
  <edges>
843
  <edge from-layer="0" from-port="0" to-layer="2" to-port="0" />
844
- <edge from-layer="1" from-port="0" to-layer="9" to-port="0" />
845
  <edge from-layer="2" from-port="1" to-layer="3" to-port="0" />
846
- <edge from-layer="2" from-port="2" to-layer="3" to-port="1" />
847
- <edge from-layer="2" from-port="3" to-layer="3" to-port="2" />
848
- <edge from-layer="3" from-port="5" to-layer="16" to-port="4" />
849
- <edge from-layer="3" from-port="4" to-layer="16" to-port="3" />
850
- <edge from-layer="3" from-port="3" to-layer="16" to-port="2" />
851
- <edge from-layer="3" from-port="3" to-layer="4" to-port="0" />
852
- <edge from-layer="4" from-port="1" to-layer="7" to-port="0" />
853
- <edge from-layer="5" from-port="0" to-layer="7" to-port="1" />
854
- <edge from-layer="6" from-port="0" to-layer="7" to-port="2" />
855
- <edge from-layer="7" from-port="3" to-layer="9" to-port="1" />
856
- <edge from-layer="7" from-port="3" to-layer="12" to-port="0" />
857
- <edge from-layer="8" from-port="0" to-layer="9" to-port="2" />
858
- <edge from-layer="9" from-port="3" to-layer="16" to-port="0" />
859
- <edge from-layer="10" from-port="0" to-layer="14" to-port="0" />
860
- <edge from-layer="11" from-port="0" to-layer="12" to-port="1" />
861
- <edge from-layer="12" from-port="2" to-layer="14" to-port="1" />
862
- <edge from-layer="13" from-port="0" to-layer="14" to-port="2" />
863
- <edge from-layer="14" from-port="3" to-layer="16" to-port="1" />
864
- <edge from-layer="15" from-port="0" to-layer="16" to-port="5" />
865
- <edge from-layer="16" from-port="6" to-layer="20" to-port="0" />
866
- <edge from-layer="16" from-port="7" to-layer="20" to-port="1" />
867
- <edge from-layer="16" from-port="8" to-layer="20" to-port="2" />
868
- <edge from-layer="16" from-port="9" to-layer="20" to-port="3" />
869
- <edge from-layer="16" from-port="10" to-layer="20" to-port="4" />
870
- <edge from-layer="17" from-port="0" to-layer="20" to-port="5" />
871
- <edge from-layer="18" from-port="0" to-layer="19" to-port="0" />
872
- <edge from-layer="19" from-port="1" to-layer="20" to-port="6" />
873
- <edge from-layer="19" from-port="2" to-layer="20" to-port="7" />
874
- <edge from-layer="19" from-port="3" to-layer="20" to-port="8" />
875
- <edge from-layer="20" from-port="9" to-layer="21" to-port="0" />
876
- <edge from-layer="20" from-port="13" to-layer="21" to-port="4" />
877
- <edge from-layer="20" from-port="12" to-layer="21" to-port="3" />
878
- <edge from-layer="20" from-port="10" to-layer="21" to-port="1" />
879
- <edge from-layer="20" from-port="11" to-layer="21" to-port="2" />
880
- <edge from-layer="21" from-port="9" to-layer="42" to-port="4" />
881
- <edge from-layer="21" from-port="8" to-layer="42" to-port="3" />
882
- <edge from-layer="21" from-port="7" to-layer="42" to-port="2" />
883
- <edge from-layer="21" from-port="6" to-layer="42" to-port="1" />
884
- <edge from-layer="21" from-port="5" to-layer="42" to-port="0" />
885
- <edge from-layer="22" from-port="0" to-layer="23" to-port="0" />
886
- <edge from-layer="23" from-port="3" to-layer="42" to-port="7" />
887
- <edge from-layer="23" from-port="2" to-layer="42" to-port="6" />
888
- <edge from-layer="23" from-port="1" to-layer="42" to-port="5" />
889
- <edge from-layer="24" from-port="0" to-layer="25" to-port="0" />
890
- <edge from-layer="25" from-port="3" to-layer="42" to-port="10" />
891
- <edge from-layer="25" from-port="2" to-layer="42" to-port="9" />
892
- <edge from-layer="25" from-port="1" to-layer="42" to-port="8" />
893
- <edge from-layer="26" from-port="0" to-layer="34" to-port="0" />
894
- <edge from-layer="27" from-port="0" to-layer="28" to-port="0" />
895
- <edge from-layer="28" from-port="2" to-layer="40" to-port="3" />
896
- <edge from-layer="28" from-port="1" to-layer="29" to-port="0" />
897
- <edge from-layer="28" from-port="3" to-layer="40" to-port="4" />
898
- <edge from-layer="28" from-port="1" to-layer="40" to-port="2" />
899
- <edge from-layer="29" from-port="1" to-layer="32" to-port="0" />
900
- <edge from-layer="30" from-port="0" to-layer="32" to-port="1" />
901
- <edge from-layer="31" from-port="0" to-layer="32" to-port="2" />
902
- <edge from-layer="32" from-port="3" to-layer="34" to-port="1" />
903
- <edge from-layer="32" from-port="3" to-layer="37" to-port="0" />
904
- <edge from-layer="33" from-port="0" to-layer="34" to-port="2" />
905
- <edge from-layer="34" from-port="3" to-layer="40" to-port="0" />
906
- <edge from-layer="35" from-port="0" to-layer="39" to-port="0" />
 
 
 
 
907
  <edge from-layer="36" from-port="0" to-layer="37" to-port="1" />
908
- <edge from-layer="37" from-port="2" to-layer="39" to-port="1" />
909
- <edge from-layer="38" from-port="0" to-layer="39" to-port="2" />
910
- <edge from-layer="39" from-port="3" to-layer="40" to-port="1" />
911
- <edge from-layer="40" from-port="7" to-layer="42" to-port="11" />
912
- <edge from-layer="40" from-port="8" to-layer="42" to-port="12" />
913
- <edge from-layer="40" from-port="9" to-layer="42" to-port="13" />
914
- <edge from-layer="41" from-port="0" to-layer="42" to-port="14" />
915
- <edge from-layer="42" from-port="17" to-layer="48" to-port="2" />
916
- <edge from-layer="42" from-port="15" to-layer="48" to-port="0" />
917
- <edge from-layer="42" from-port="15" to-layer="46" to-port="0" />
918
- <edge from-layer="42" from-port="15" to-layer="43" to-port="1" />
919
- <edge from-layer="42" from-port="16" to-layer="43" to-port="0" />
920
- <edge from-layer="43" from-port="2" to-layer="45" to-port="0" />
921
- <edge from-layer="44" from-port="0" to-layer="45" to-port="1" />
922
- <edge from-layer="45" from-port="2" to-layer="46" to-port="1" />
923
- <edge from-layer="46" from-port="2" to-layer="48" to-port="1" />
924
- <edge from-layer="47" from-port="0" to-layer="48" to-port="3" />
925
- <edge from-layer="48" from-port="5" to-layer="49" to-port="0" />
926
- <edge from-layer="48" from-port="4" to-layer="49" to-port="1" />
927
- <edge from-layer="48" from-port="4" to-layer="53" to-port="0" />
928
- <edge from-layer="48" from-port="5" to-layer="53" to-port="1" />
929
- <edge from-layer="48" from-port="6" to-layer="53" to-port="2" />
930
- <edge from-layer="49" from-port="2" to-layer="51" to-port="0" />
931
- <edge from-layer="50" from-port="0" to-layer="51" to-port="1" />
932
- <edge from-layer="51" from-port="2" to-layer="53" to-port="3" />
933
- <edge from-layer="52" from-port="0" to-layer="53" to-port="4" />
934
- <edge from-layer="53" from-port="6" to-layer="54" to-port="0" />
935
- <edge from-layer="53" from-port="5" to-layer="57" to-port="0" />
936
- <edge from-layer="54" from-port="1" to-layer="55" to-port="0" />
937
- <edge from-layer="55" from-port="1" to-layer="56" to-port="0" />
938
- <edge from-layer="57" from-port="1" to-layer="58" to-port="0" />
939
  </edges>
940
  <rt_info>
 
941
  <eos_token_id value="0" />
 
 
942
  </rt_info>
943
  </net>
 
1
  <?xml version="1.0"?>
2
  <net name="tokenizer" version="11">
3
  <layers>
4
+ <layer id="0" name="Parameter_62535" type="Parameter" version="opset1">
5
  <data shape="?" element_type="string" />
6
  <output>
7
+ <port id="0" precision="STRING" names="Parameter_62535">
8
  <dim>-1</dim>
9
  </port>
10
  </output>
11
  </layer>
12
+ <layer id="1" name="Constant_62541" type="Const" version="opset1">
13
  <data element_type="i64" shape="" offset="0" size="8" />
14
  <output>
15
  <port id="0" precision="I64" />
16
  </output>
17
  </layer>
18
+ <layer id="2" name="StringTensorUnpack_62536" type="StringTensorUnpack" version="extension">
19
  <data mode="begins_ends" />
20
  <input>
21
  <port id="0" precision="STRING">
 
34
  </port>
35
  </output>
36
  </layer>
37
+ <layer id="3" name="ShapeOf_62537" type="ShapeOf" version="opset3">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38
  <data output_type="i64" />
39
  <input>
40
  <port id="0" precision="I32">
 
47
  </port>
48
  </output>
49
  </layer>
50
+ <layer id="4" name="Constant_62538" type="Const" version="opset1">
51
  <data element_type="i64" shape="" offset="0" size="8" />
52
  <output>
53
  <port id="0" precision="I64" />
54
  </output>
55
  </layer>
56
+ <layer id="5" name="Constant_62539" type="Const" version="opset1">
57
  <data element_type="i64" shape="" offset="0" size="8" />
58
  <output>
59
  <port id="0" precision="I64" />
60
  </output>
61
  </layer>
62
+ <layer id="6" name="Gather_62540" type="Gather" version="opset8">
63
  <data batch_dims="0" />
64
  <input>
65
  <port id="0" precision="I64">
 
72
  <port id="3" precision="I64" />
73
  </output>
74
  </layer>
75
+ <layer id="7" name="Constant_62542" type="Const" version="opset1">
76
  <data element_type="i64" shape="" offset="8" size="8" />
77
  <output>
78
  <port id="0" precision="I64" />
79
  </output>
80
  </layer>
81
+ <layer id="8" name="Range_62543" type="Range" version="opset4">
82
  <data output_type="i32" />
83
  <input>
84
  <port id="0" precision="I64" />
 
91
  </port>
92
  </output>
93
  </layer>
94
+ <layer id="9" name="Constant_62544" type="Const" version="opset1">
95
  <data element_type="i64" shape="" offset="8" size="8" />
96
  <output>
97
  <port id="0" precision="I64" />
98
  </output>
99
  </layer>
100
+ <layer id="10" name="Constant_62545" type="Const" version="opset1">
101
  <data element_type="i64" shape="" offset="8" size="8" />
102
  <output>
103
  <port id="0" precision="I64" />
104
  </output>
105
  </layer>
106
+ <layer id="11" name="Add_62546" type="Add" version="opset1">
107
  <data auto_broadcast="numpy" />
108
  <input>
109
  <port id="0" precision="I64" />
 
113
  <port id="2" precision="I64" />
114
  </output>
115
  </layer>
116
+ <layer id="12" name="Constant_62547" type="Const" version="opset1">
117
  <data element_type="i64" shape="" offset="8" size="8" />
118
  <output>
119
  <port id="0" precision="I64" />
120
  </output>
121
  </layer>
122
+ <layer id="13" name="Range_62548" type="Range" version="opset4">
123
  <data output_type="i32" />
124
  <input>
125
  <port id="0" precision="I64" />
 
132
  </port>
133
  </output>
134
  </layer>
135
+ <layer id="14" name="Constant_62610" type="Const" version="opset1">
136
+ <data element_type="u8" shape="763" offset="16" size="763" />
137
  <output>
138
  <port id="0" precision="U8">
139
+ <dim>763</dim>
140
  </port>
141
  </output>
142
  </layer>
143
+ <layer id="15" name="SpecialTokensSplit_62611" type="SpecialTokensSplit" version="extension">
 
144
  <input>
145
  <port id="0" precision="I32">
146
  <dim>-1</dim>
 
158
  <dim>-1</dim>
159
  </port>
160
  <port id="5" precision="U8">
161
+ <dim>763</dim>
162
  </port>
163
  </input>
164
  <output>
 
177
  <port id="10" precision="U8">
178
  <dim>-1</dim>
179
  </port>
180
+ <port id="11" precision="BOOL">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
181
  <dim>-1</dim>
182
  </port>
183
  </output>
184
  </layer>
185
+ <layer id="16" name="NormalizeUnicode_62612" type="NormalizeUnicode" version="extension">
186
+ <data normalization_form="NFC" />
187
  <input>
188
  <port id="0" precision="I32">
189
  <dim>-1</dim>
 
191
  <port id="1" precision="I32">
192
  <dim>-1</dim>
193
  </port>
194
+ <port id="2" precision="U8">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
195
  <dim>-1</dim>
196
  </port>
197
+ <port id="3" precision="BOOL">
198
  <dim>-1</dim>
199
  </port>
200
  </input>
201
  <output>
202
+ <port id="4" precision="I32">
203
  <dim>-1</dim>
204
  </port>
205
+ <port id="5" precision="I32">
206
  <dim>-1</dim>
207
  </port>
208
+ <port id="6" precision="U8">
209
  <dim>-1</dim>
210
  </port>
211
+ <port id="7" precision="BOOL">
212
  <dim>-1</dim>
213
  </port>
214
+ </output>
215
+ </layer>
216
+ <layer id="17" name="Constant_62614" type="Const" version="opset1">
217
+ <data element_type="u8" shape="64" offset="779" size="64" />
218
+ <output>
219
+ <port id="0" precision="U8">
220
+ <dim>64</dim>
221
  </port>
222
  </output>
223
  </layer>
224
+ <layer id="18" name="RegexSplit_62615" type="RegexSplit" version="extension">
225
+ <data behaviour="isolate" invert="false" max_splits="-1" />
226
  <input>
227
  <port id="0" precision="I32">
228
  <dim>-1</dim>
 
239
  <port id="4" precision="U8">
240
  <dim>-1</dim>
241
  </port>
242
+ <port id="5" precision="BOOL">
243
+ <dim>-1</dim>
244
+ </port>
245
+ <port id="6" precision="U8">
246
+ <dim>64</dim>
247
+ </port>
248
  </input>
249
  <output>
250
+ <port id="7" precision="I32">
251
  <dim>-1</dim>
252
  </port>
253
+ <port id="8" precision="I32">
254
  <dim>-1</dim>
255
  </port>
256
+ <port id="9" precision="I32">
257
  <dim>-1</dim>
258
  </port>
259
+ <port id="10" precision="I32">
260
  <dim>-1</dim>
261
  </port>
262
+ <port id="11" precision="U8">
263
+ <dim>-1</dim>
264
+ </port>
265
+ <port id="12" precision="BOOL">
266
  <dim>-1</dim>
267
  </port>
268
  </output>
269
  </layer>
270
+ <layer id="19" name="Constant_62617" type="Const" version="opset1">
271
+ <data element_type="u8" shape="514078" offset="843" size="514078" />
272
  <output>
273
  <port id="0" precision="U8">
274
+ <dim>514078</dim>
275
  </port>
276
  </output>
277
  </layer>
278
+ <layer id="20" name="StringTensorUnpack_62618" type="StringTensorUnpack" version="extension">
279
  <data mode="begins_ends" />
280
  <input>
281
  <port id="0" precision="U8">
282
+ <dim>514078</dim>
283
  </port>
284
  </input>
285
  <output>
 
294
  </port>
295
  </output>
296
  </layer>
297
+ <layer id="21" name="Constant_62623" type="Const" version="opset1">
298
+ <data element_type="u8" shape="362936" offset="514921" size="362936" />
299
  <output>
300
  <port id="0" precision="U8">
301
+ <dim>362936</dim>
302
  </port>
303
  </output>
304
  </layer>
305
+ <layer id="22" name="StringTensorUnpack_62624" type="StringTensorUnpack" version="extension">
306
  <data mode="begins_ends" />
307
  <input>
308
  <port id="0" precision="U8">
309
+ <dim>362936</dim>
310
  </port>
311
  </input>
312
  <output>
 
321
  </port>
322
  </output>
323
  </layer>
324
+ <layer id="23" name="Constant_62626" type="Const" version="opset1">
325
+ <data element_type="u8" shape="349500" offset="877857" size="349500" />
 
 
 
 
 
 
326
  <output>
327
  <port id="0" precision="U8">
328
+ <dim>349500</dim>
329
  </port>
330
  </output>
331
  </layer>
332
+ <layer id="24" name="StringTensorUnpack_62627" type="StringTensorUnpack" version="extension">
333
  <data mode="begins_ends" />
334
  <input>
335
  <port id="0" precision="U8">
336
+ <dim>349500</dim>
337
  </port>
338
  </input>
339
  <output>
 
348
  </port>
349
  </output>
350
  </layer>
351
+ <layer id="25" name="Constant_62620" type="Const" version="opset1">
352
+ <data element_type="u8" shape="462" offset="1227357" size="462" />
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
353
  <output>
354
+ <port id="0" precision="U8">
355
+ <dim>462</dim>
 
 
 
 
 
 
356
  </port>
 
 
 
 
 
357
  </output>
358
  </layer>
359
+ <layer id="26" name="StringTensorUnpack_62621" type="StringTensorUnpack" version="extension">
360
+ <data mode="begins_ends" />
 
 
 
 
 
 
361
  <input>
362
+ <port id="0" precision="U8">
363
+ <dim>462</dim>
 
 
 
 
 
364
  </port>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
365
  </input>
366
  <output>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
367
  <port id="1" precision="I32">
368
  <dim>-1</dim>
369
  </port>
370
  <port id="2" precision="I32">
371
  <dim>-1</dim>
372
  </port>
373
+ <port id="3" precision="U8">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
374
  <dim>-1</dim>
375
  </port>
376
  </output>
377
  </layer>
378
+ <layer id="27" name="Constant_62628" type="Const" version="opset1">
379
+ <data element_type="i32" shape="27" offset="1227819" size="108" />
380
  <output>
381
  <port id="0" precision="I32">
382
+ <dim>27</dim>
383
  </port>
384
  </output>
385
  </layer>
386
+ <layer id="28" name="BPETokenizer_62629" type="BPETokenizer" version="extension">
387
+ <data unk_token="" fuse_unk="false" suffix_indicator="" end_suffix="" byte_fallback="false" cache_capacity="20000" />
388
  <input>
389
  <port id="0" precision="I32">
390
  <dim>-1</dim>
 
429
  <dim>-1</dim>
430
  </port>
431
  <port id="14" precision="I32">
432
+ <dim>-1</dim>
433
  </port>
 
 
434
  <port id="15" precision="I32">
435
  <dim>-1</dim>
436
  </port>
437
+ <port id="16" precision="U8">
438
  <dim>-1</dim>
439
  </port>
440
  <port id="17" precision="I32">
441
+ <dim>27</dim>
442
+ </port>
443
+ </input>
444
+ <output>
445
+ <port id="18" precision="I32">
446
+ <dim>-1</dim>
447
+ </port>
448
+ <port id="19" precision="I32">
449
+ <dim>-1</dim>
450
+ </port>
451
+ <port id="20" precision="I32">
452
  <dim>-1</dim>
453
  </port>
454
  </output>
455
  </layer>
456
+ <layer id="29" name="Subtract_62630" type="Subtract" version="opset1">
457
  <data auto_broadcast="numpy" />
458
  <input>
459
  <port id="0" precision="I32">
 
469
  </port>
470
  </output>
471
  </layer>
472
+ <layer id="30" name="Constant_62631" type="Const" version="opset1">
473
+ <data element_type="i32" shape="" offset="1227927" size="4" />
474
  <output>
475
  <port id="0" precision="I32" />
476
  </output>
477
  </layer>
478
+ <layer id="31" name="Minimum_62632" type="Minimum" version="opset1">
479
  <data auto_broadcast="numpy" />
480
  <input>
481
  <port id="0" precision="I32">
 
489
  </port>
490
  </output>
491
  </layer>
492
+ <layer id="32" name="Subtract_62633" type="Subtract" version="opset1">
493
  <data auto_broadcast="numpy" />
494
  <input>
495
  <port id="0" precision="I32">
 
505
  </port>
506
  </output>
507
  </layer>
508
+ <layer id="33" name="Constant_62634" type="Const" version="opset1">
509
+ <data element_type="i32" shape="1" offset="1227931" size="4" />
510
  <output>
511
  <port id="0" precision="I32">
512
  <dim>1</dim>
513
  </port>
514
  </output>
515
  </layer>
516
+ <layer id="34" name="CombineSegments_62635" type="CombineSegments" version="extension">
517
  <input>
518
  <port id="0" precision="I32">
519
  <dim>-1</dim>
 
549
  </port>
550
  </output>
551
  </layer>
552
+ <layer id="35" name="Subtract_62636" type="Subtract" version="opset1">
553
  <data auto_broadcast="numpy" />
554
  <input>
555
  <port id="0" precision="I32">
 
565
  </port>
566
  </output>
567
  </layer>
568
+ <layer id="36" name="Constant_62637" type="Const" version="opset1">
569
+ <data element_type="i32" shape="" offset="1227931" size="4" />
570
  <output>
571
  <port id="0" precision="I32" />
572
  </output>
573
  </layer>
574
+ <layer id="37" name="ReduceMax_62638" type="ReduceMax" version="opset1">
575
  <data keep_dims="false" />
576
  <input>
577
  <port id="0" precision="I32">
 
583
  <port id="2" precision="I32" />
584
  </output>
585
  </layer>
586
+ <layer id="38" name="Constant_62639" type="Const" version="opset1">
587
+ <data element_type="i32" shape="" offset="1227931" size="4" />
588
  <output>
589
  <port id="0" precision="I32" />
590
  </output>
591
  </layer>
592
+ <layer id="39" name="RaggedToDense_62640" type="RaggedToDense" version="extension">
593
+ <data pad_right="false" />
594
  <input>
595
  <port id="0" precision="I32">
596
  <dim>-1</dim>
 
615
  </port>
616
  </output>
617
  </layer>
618
+ <layer id="40" name="Convert_62641" type="Convert" version="opset1">
619
  <data destination_type="i32" />
620
  <input>
621
  <port id="0" precision="BOOL">
 
630
  </port>
631
  </output>
632
  </layer>
633
+ <layer id="41" name="Convert_62641" type="Convert" version="opset1">
634
  <data destination_type="i64" />
635
  <input>
636
  <port id="0" precision="I32">
 
645
  </port>
646
  </output>
647
  </layer>
648
+ <layer id="43" name="RaggedToDense_62640.0" type="Convert" version="opset1">
649
  <data destination_type="i64" />
650
  <input>
651
  <port id="0" precision="I32">
 
660
  </port>
661
  </output>
662
  </layer>
663
+ <layer id="44" name="Result_62644" type="Result" version="opset1">
664
  <input>
665
  <port id="0" precision="I64">
666
  <dim>-1</dim>
 
668
  </port>
669
  </input>
670
  </layer>
671
+ <layer id="42" name="Result_62646" type="Result" version="opset1">
672
  <input>
673
  <port id="0" precision="I64">
674
  <dim>-1</dim>
 
679
  </layers>
680
  <edges>
681
  <edge from-layer="0" from-port="0" to-layer="2" to-port="0" />
682
+ <edge from-layer="1" from-port="0" to-layer="8" to-port="0" />
683
  <edge from-layer="2" from-port="1" to-layer="3" to-port="0" />
684
+ <edge from-layer="2" from-port="3" to-layer="15" to-port="4" />
685
+ <edge from-layer="2" from-port="2" to-layer="15" to-port="3" />
686
+ <edge from-layer="2" from-port="1" to-layer="15" to-port="2" />
687
+ <edge from-layer="3" from-port="1" to-layer="6" to-port="0" />
688
+ <edge from-layer="4" from-port="0" to-layer="6" to-port="1" />
689
+ <edge from-layer="5" from-port="0" to-layer="6" to-port="2" />
690
+ <edge from-layer="6" from-port="3" to-layer="11" to-port="0" />
691
+ <edge from-layer="6" from-port="3" to-layer="8" to-port="1" />
692
+ <edge from-layer="7" from-port="0" to-layer="8" to-port="2" />
693
+ <edge from-layer="8" from-port="3" to-layer="15" to-port="0" />
694
+ <edge from-layer="9" from-port="0" to-layer="13" to-port="0" />
695
+ <edge from-layer="10" from-port="0" to-layer="11" to-port="1" />
696
+ <edge from-layer="11" from-port="2" to-layer="13" to-port="1" />
697
+ <edge from-layer="12" from-port="0" to-layer="13" to-port="2" />
698
+ <edge from-layer="13" from-port="3" to-layer="15" to-port="1" />
699
+ <edge from-layer="14" from-port="0" to-layer="15" to-port="5" />
700
+ <edge from-layer="15" from-port="9" to-layer="16" to-port="1" />
701
+ <edge from-layer="15" from-port="7" to-layer="18" to-port="1" />
702
+ <edge from-layer="15" from-port="6" to-layer="18" to-port="0" />
703
+ <edge from-layer="15" from-port="11" to-layer="16" to-port="3" />
704
+ <edge from-layer="15" from-port="10" to-layer="16" to-port="2" />
705
+ <edge from-layer="15" from-port="8" to-layer="16" to-port="0" />
706
+ <edge from-layer="16" from-port="4" to-layer="18" to-port="2" />
707
+ <edge from-layer="16" from-port="5" to-layer="18" to-port="3" />
708
+ <edge from-layer="16" from-port="6" to-layer="18" to-port="4" />
709
+ <edge from-layer="16" from-port="7" to-layer="18" to-port="5" />
710
+ <edge from-layer="17" from-port="0" to-layer="18" to-port="6" />
711
+ <edge from-layer="18" from-port="11" to-layer="28" to-port="4" />
712
+ <edge from-layer="18" from-port="10" to-layer="28" to-port="3" />
713
+ <edge from-layer="18" from-port="9" to-layer="28" to-port="2" />
714
+ <edge from-layer="18" from-port="8" to-layer="28" to-port="1" />
715
+ <edge from-layer="18" from-port="7" to-layer="28" to-port="0" />
716
+ <edge from-layer="19" from-port="0" to-layer="20" to-port="0" />
717
+ <edge from-layer="20" from-port="1" to-layer="28" to-port="5" />
718
+ <edge from-layer="20" from-port="2" to-layer="28" to-port="6" />
719
+ <edge from-layer="20" from-port="3" to-layer="28" to-port="7" />
720
+ <edge from-layer="21" from-port="0" to-layer="22" to-port="0" />
721
+ <edge from-layer="22" from-port="1" to-layer="28" to-port="8" />
722
+ <edge from-layer="22" from-port="2" to-layer="28" to-port="9" />
723
+ <edge from-layer="22" from-port="3" to-layer="28" to-port="10" />
724
+ <edge from-layer="23" from-port="0" to-layer="24" to-port="0" />
725
+ <edge from-layer="24" from-port="1" to-layer="28" to-port="11" />
726
+ <edge from-layer="24" from-port="2" to-layer="28" to-port="12" />
727
+ <edge from-layer="24" from-port="3" to-layer="28" to-port="13" />
728
+ <edge from-layer="25" from-port="0" to-layer="26" to-port="0" />
729
+ <edge from-layer="26" from-port="1" to-layer="28" to-port="14" />
730
+ <edge from-layer="26" from-port="2" to-layer="28" to-port="15" />
731
+ <edge from-layer="26" from-port="3" to-layer="28" to-port="16" />
732
+ <edge from-layer="27" from-port="0" to-layer="28" to-port="17" />
733
+ <edge from-layer="28" from-port="19" to-layer="29" to-port="0" />
734
+ <edge from-layer="28" from-port="18" to-layer="29" to-port="1" />
735
+ <edge from-layer="28" from-port="19" to-layer="32" to-port="0" />
736
+ <edge from-layer="28" from-port="20" to-layer="34" to-port="2" />
737
+ <edge from-layer="28" from-port="19" to-layer="34" to-port="1" />
738
+ <edge from-layer="29" from-port="2" to-layer="31" to-port="0" />
739
+ <edge from-layer="30" from-port="0" to-layer="31" to-port="1" />
740
+ <edge from-layer="31" from-port="2" to-layer="32" to-port="1" />
741
+ <edge from-layer="32" from-port="2" to-layer="34" to-port="0" />
742
+ <edge from-layer="33" from-port="0" to-layer="34" to-port="3" />
743
+ <edge from-layer="34" from-port="5" to-layer="39" to-port="1" />
744
+ <edge from-layer="34" from-port="6" to-layer="39" to-port="2" />
745
+ <edge from-layer="34" from-port="4" to-layer="39" to-port="0" />
746
+ <edge from-layer="34" from-port="4" to-layer="35" to-port="1" />
747
+ <edge from-layer="34" from-port="5" to-layer="35" to-port="0" />
748
+ <edge from-layer="35" from-port="2" to-layer="37" to-port="0" />
749
  <edge from-layer="36" from-port="0" to-layer="37" to-port="1" />
750
+ <edge from-layer="37" from-port="2" to-layer="39" to-port="3" />
751
+ <edge from-layer="38" from-port="0" to-layer="39" to-port="4" />
752
+ <edge from-layer="39" from-port="6" to-layer="40" to-port="0" />
753
+ <edge from-layer="39" from-port="5" to-layer="43" to-port="0" />
754
+ <edge from-layer="40" from-port="1" to-layer="41" to-port="0" />
755
+ <edge from-layer="41" from-port="1" to-layer="42" to-port="0" />
756
+ <edge from-layer="43" from-port="1" to-layer="44" to-port="0" />
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
757
  </edges>
758
  <rt_info>
759
+ <bos_token_id value="0" />
760
  <eos_token_id value="0" />
761
+ <original_tokenizer_class value="&lt;class 'transformers.models.gpt_neox.tokenization_gpt_neox_fast.GPTNeoXTokenizerFast'>" />
762
+ <pad_token_id value="0" />
763
  </rt_info>
764
  </net>
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json CHANGED
@@ -234,7 +234,7 @@
234
  "### Response:"
235
  ],
236
  "bos_token": "<|endoftext|>",
237
- "clean_up_tokenization_spaces": true,
238
  "eos_token": "<|endoftext|>",
239
  "model_max_length": 1000000000000000019884624838656,
240
  "pad_token": "<|endoftext|>",
 
234
  "### Response:"
235
  ],
236
  "bos_token": "<|endoftext|>",
237
+ "clean_up_tokenization_spaces": false,
238
  "eos_token": "<|endoftext|>",
239
  "model_max_length": 1000000000000000019884624838656,
240
  "pad_token": "<|endoftext|>",