Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
huggingface
/
data-measurements-tool
like
98
Build error
App
Files
Files
Community
6
main
data-measurements-tool
/
cache_dir
/
c4_en_train_text
14 contributors
History:
8 commits
sasha
HF staff
pushing fig tok length PNGs
1a4c18a
almost 3 years ago
pmi_files
A variety of cache
almost 3 years ago
text_dset
Cache from rollback
almost 3 years ago
dset_peek.json
Safe
216 kB
LFS
Finishing c4 en train text cache
almost 3 years ago
dup_counts_df.feather
Safe
1.19 kB
LFS
dup counts cache
almost 3 years ago
fig_tok_length.json
Safe
1.52 MB
LFS
Finishing c4 en train text cache
almost 3 years ago
fig_tok_length.png
Safe
40.1 kB
LFS
pushing fig tok length PNGs
almost 3 years ago
general_stats.json
Safe
39 Bytes
LFS
Finishing c4 en train text cache
almost 3 years ago
general_stats_dict.json
Safe
90 Bytes
LFS
Finishing c4 en train text cache
almost 3 years ago
length_df.feather
Safe
270 MB
LFS
Finishing c4 en train text cache
almost 3 years ago
length_stats.json
Safe
63 Bytes
LFS
Finishing c4 en train text cache
almost 3 years ago
node_figure.json
Safe
91.5 kB
LFS
cache clusters
almost 3 years ago
node_list.th
Safe
23.5 MB
LFS
cache clusters
almost 3 years ago
npmi_terms.json
Safe
92 Bytes
LFS
A variety of cache
almost 3 years ago
sorted_top_vocab.feather
Safe
4.16 kB
LFS
Finishing c4 en train text cache
almost 3 years ago
text_dup_counts_df.feather
Safe
1.19 kB
LFS
A variety of cache
almost 3 years ago
vocab_counts.feather
Safe
7.46 MB
LFS
c4 en train text cache
almost 3 years ago
zipf_basic_stats.json
Safe
58.8 kB
LFS
A variety of cache
almost 3 years ago
zipf_fig.json
Safe
13.9 MB
LFS
c4 en train text cache
almost 3 years ago