1-800-BAD-CODE
/

punctuation_fullstop_truecase_english

@@ -11,11 +11,10 @@ tags:
 ---
 # Model Overview
-This model accepts as input lower-cased, unpunctuated English text and performs punctuation restoration, true-casing (capitalization), and sentence boundary detection (segmentation).
 In contast to many similar models, this model can predict punctuated acronyms (e.g., "U.S.") via a special "acronym" class, as well as arbitarily-capitalized words (NATO, McDonald's, etc.) via multi-label true-casing predictions.
 # Usage
 The easy way to use this model is to install [punctuators](https://github.com/1-800-BAD-CODE/punctuators):
@@ -23,7 +22,6 @@ The easy way to use this model is to install [punctuators](https://github.com/1-
 pip install punctuators
 ```
 Running the following script should load this model and run some texts:
 <details open>
@@ -185,3 +183,31 @@ The data is a held-out portion of News Crawl, which has been deduplicated.
 Examples longer than the model's maximum length (256) were truncated.
 The number of affected sentences can be estimated from the "full stop" support: with 2,000 sentences and 10 sentences per example, we expect 20,000 full stop targets total.

 ---
 # Model Overview
+This model accepts as input lower-cased, unpunctuated English text and performs in one pass punctuation restoration, true-casing (capitalization), and sentence boundary detection (segmentation).
 In contast to many similar models, this model can predict punctuated acronyms (e.g., "U.S.") via a special "acronym" class, as well as arbitarily-capitalized words (NATO, McDonald's, etc.) via multi-label true-casing predictions.
 # Usage
 The easy way to use this model is to install [punctuators](https://github.com/1-800-BAD-CODE/punctuators):
 pip install punctuators
 ```
 Running the following script should load this model and run some texts:
 <details open>
 Examples longer than the model's maximum length (256) were truncated.
 The number of affected sentences can be estimated from the "full stop" support: with 2,000 sentences and 10 sentences per example, we expect 20,000 full stop targets total.
+## Results
+# Fun Facts
+Some fun facts are examined in this section.
+## Embeddings
+Let's examine the embeddings (see graph above) to see if the model meaningfully employed them.
+We show here the cosine similarity between the embeddings of each token:
+| | NULL | ACRONYM | . | , | ? |
+| - | - | - | - | - | - |
+| NULL |	1.00	| | | | |
+| ACRONYM |	-0.93 |	1.00  | | ||
+| . |	-1.00 |	0.94 |	1.00 |	| |
+| ,	| 1.00 |	-0.94 |	-1.00 |	1.00 |	|
+| ?	| -1.00 |	0.93 |	1.00 |	-1.00 |	1.00 |
+Recall that these embeddings are used to predict sentence boundaries... thus we should expect full stops to cluster.
+Indeed, we see that `NULL` and `COMMA` are exactly the same, because neither have an implication on sentence boundaries.
+Next, we see that periods and question marks are exactly the same, and exactly the opposite of NULL.
+This is expected since these tokens typically imply sentence boundaries, whereas NULL and commas do not.
+Lastly, we see that ACRONYM is quite, but not totally, similar to periods and question marks,
+and almost, but not totally, the opposite of NULL and commas.
+Intuitio suggests this is because acronyms can be full stops ("I live in the northern U.S. It's cold here.") or not ("It's 5 a.m. and I'm tired").