etrop commited on
Commit
91c65e2
1 Parent(s): 4f8c857

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ## Model Overview
2
+ AgroNt is a DNA language model trained on primarily edible plant genomes. More specifically, AgroNT uses the transformer architecture with self-attention and a masked language modeling
3
+ objective to leverage highly available genotype data from 48 different plant speices. AgroNt contains 1 billion parameters and has a context window of 1000 tokens. AgroNt uses a non-overlapping
4
+ 6-mer tokenizer to convert genomic nucletoide sequences to tokens. As a result the 1000 tokens correspond to approximately 6000 base pairs.
5
+
6
+
7
+ ## Using the Model from HF
8
+ '''python
9
+ Will update once it it public
10
+ '''