jena-shreyas
/

florence_ft

@@ -1,9 +1,8 @@
 ---
 base_model: HuggingFaceM4/Florence-2-DocVQA
 tags:
 - generated_from_trainer
-metrics:
-- accuracy
 model-index:
 - name: florence_ft
   results: []
@@ -12,13 +11,11 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/jenashreyas/florence_entity_extraction_ft/runs/zobmh6l8)
 # florence_ft
 This model is a fine-tuned version of [HuggingFaceM4/Florence-2-DocVQA](https://huggingface.co/HuggingFaceM4/Florence-2-DocVQA) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0500
-- Accuracy: 0.0
 ## Model description
@@ -37,124 +34,106 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 2
-- num_epochs: 100
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 7    | 1.4280          | 0.0      |
-| 2.3602        | 2.0   | 14   | 1.2109          | 0.0      |
-| 1.0245        | 3.0   | 21   | 1.2202          | 0.0      |
-| 1.0245        | 4.0   | 28   | 1.2606          | 0.0      |
-| 0.929         | 5.0   | 35   | 1.2130          | 0.0      |
-| 0.8584        | 6.0   | 42   | 1.1525          | 0.0      |
-| 0.8584        | 7.0   | 49   | 1.0689          | 0.0      |
-| 0.7889        | 8.0   | 56   | 1.0324          | 0.0      |
-| 0.7719        | 9.0   | 63   | 1.0274          | 0.0      |
-| 0.7284        | 10.0  | 70   | 1.0232          | 0.0      |
-| 0.7284        | 11.0  | 77   | 1.0275          | 0.0      |
-| 0.6993        | 12.0  | 84   | 1.0357          | 0.0      |
-| 0.688         | 13.0  | 91   | 1.0444          | 0.0      |
-| 0.688         | 14.0  | 98   | 1.0427          | 0.0      |
-| 0.6764        | 15.0  | 105  | 1.0408          | 0.0      |
-| 0.6521        | 16.0  | 112  | 1.0217          | 0.0      |
-| 0.6521        | 17.0  | 119  | 1.0021          | 0.0      |
-| 0.6511        | 18.0  | 126  | 1.0037          | 0.0      |
-| 0.637         | 19.0  | 133  | 1.0241          | 0.0      |
-| 0.6348        | 20.0  | 140  | 1.0314          | 0.0      |
-| 0.6348        | 21.0  | 147  | 1.0510          | 0.0      |
-| 0.6165        | 22.0  | 154  | 1.0596          | 0.0      |
-| 0.6245        | 23.0  | 161  | 1.0526          | 0.0      |
-| 0.6245        | 24.0  | 168  | 1.0489          | 0.0      |
-| 0.6107        | 25.0  | 175  | 1.0402          | 0.0      |
-| 0.6012        | 26.0  | 182  | 1.0453          | 0.0      |
-| 0.6012        | 27.0  | 189  | 1.0450          | 0.0      |
-| 0.5995        | 28.0  | 196  | 1.0416          | 0.0      |
-| 0.5975        | 29.0  | 203  | 1.0469          | 0.0      |
-| 0.5834        | 30.0  | 210  | 1.0590          | 0.0      |
-| 0.5834        | 31.0  | 217  | 1.0518          | 0.0      |
-| 0.585         | 32.0  | 224  | 1.0644          | 0.0      |
-| 0.5846        | 33.0  | 231  | 1.0692          | 0.0      |
-| 0.5846        | 34.0  | 238  | 1.0526          | 0.0      |
-| 0.5842        | 35.0  | 245  | 1.0608          | 0.0      |
-| 0.5783        | 36.0  | 252  | 1.0644          | 0.0      |
-| 0.5783        | 37.0  | 259  | 1.0479          | 0.0      |
-| 0.5899        | 38.0  | 266  | 1.0503          | 0.0      |
-| 0.5766        | 39.0  | 273  | 1.0502          | 0.0      |
-| 0.575         | 40.0  | 280  | 1.0606          | 0.0      |
-| 0.575         | 41.0  | 287  | 1.0568          | 0.0      |
-| 0.569         | 42.0  | 294  | 1.0587          | 0.0      |
-| 0.5673        | 43.0  | 301  | 1.0670          | 0.0      |
-| 0.5673        | 44.0  | 308  | 1.0699          | 0.0      |
-| 0.5663        | 45.0  | 315  | 1.0731          | 0.0      |
-| 0.5681        | 46.0  | 322  | 1.0819          | 0.0      |
-| 0.5681        | 47.0  | 329  | 1.0885          | 0.0      |
-| 0.5578        | 48.0  | 336  | 1.0928          | 0.0      |
-| 0.5641        | 49.0  | 343  | 1.0937          | 0.0      |
-| 0.5657        | 50.0  | 350  | 1.0815          | 0.0      |
-| 0.5657        | 51.0  | 357  | 1.0746          | 0.0      |
-| 0.5583        | 52.0  | 364  | 1.0672          | 0.0      |
-| 0.5664        | 53.0  | 371  | 1.0643          | 0.0      |
-| 0.5664        | 54.0  | 378  | 1.0648          | 0.0      |
-| 0.5614        | 55.0  | 385  | 1.0605          | 0.0      |
-| 0.5592        | 56.0  | 392  | 1.0610          | 0.0      |
-| 0.5592        | 57.0  | 399  | 1.0587          | 0.0      |
-| 0.5542        | 58.0  | 406  | 1.0614          | 0.0      |
-| 0.5629        | 59.0  | 413  | 1.0573          | 0.0      |
-| 0.549         | 60.0  | 420  | 1.0573          | 0.0      |
-| 0.549         | 61.0  | 427  | 1.0559          | 0.0      |
-| 0.5573        | 62.0  | 434  | 1.0581          | 0.0      |
-| 0.5656        | 63.0  | 441  | 1.0548          | 0.0      |
-| 0.5656        | 64.0  | 448  | 1.0515          | 0.0      |
-| 0.5489        | 65.0  | 455  | 1.0517          | 0.0      |
-| 0.5531        | 66.0  | 462  | 1.0514          | 0.0      |
-| 0.5531        | 67.0  | 469  | 1.0546          | 0.0      |
-| 0.5463        | 68.0  | 476  | 1.0553          | 0.0      |
-| 0.5527        | 69.0  | 483  | 1.0580          | 0.0      |
-| 0.554         | 70.0  | 490  | 1.0559          | 0.0      |
-| 0.554         | 71.0  | 497  | 1.0555          | 0.0      |
-| 0.5524        | 72.0  | 504  | 1.0566          | 0.0      |
-| 0.5498        | 73.0  | 511  | 1.0560          | 0.0      |
-| 0.5498        | 74.0  | 518  | 1.0569          | 0.0      |
-| 0.5592        | 75.0  | 525  | 1.0565          | 0.0      |
-| 0.561         | 76.0  | 532  | 1.0515          | 0.0      |
-| 0.561         | 77.0  | 539  | 1.0494          | 0.0      |
-| 0.5473        | 78.0  | 546  | 1.0507          | 0.0      |
-| 0.5493        | 79.0  | 553  | 1.0506          | 0.0      |
-| 0.5532        | 80.0  | 560  | 1.0491          | 0.0      |
-| 0.5532        | 81.0  | 567  | 1.0498          | 0.0      |
-| 0.5484        | 82.0  | 574  | 1.0481          | 0.0      |
-| 0.5523        | 83.0  | 581  | 1.0511          | 0.0      |
-| 0.5523        | 84.0  | 588  | 1.0498          | 0.0      |
-| 0.5496        | 85.0  | 595  | 1.0504          | 0.0      |
-| 0.5485        | 86.0  | 602  | 1.0499          | 0.0      |
-| 0.5485        | 87.0  | 609  | 1.0501          | 0.0      |
-| 0.5418        | 88.0  | 616  | 1.0501          | 0.0      |
-| 0.5547        | 89.0  | 623  | 1.0521          | 0.0      |
-| 0.5435        | 90.0  | 630  | 1.0511          | 0.0      |
-| 0.5435        | 91.0  | 637  | 1.0502          | 0.0      |
-| 0.5488        | 92.0  | 644  | 1.0506          | 0.0      |
-| 0.5472        | 93.0  | 651  | 1.0506          | 0.0      |
-| 0.5472        | 94.0  | 658  | 1.0503          | 0.0      |
-| 0.5521        | 95.0  | 665  | 1.0507          | 0.0      |
-| 0.5485        | 96.0  | 672  | 1.0509          | 0.0      |
-| 0.5485        | 97.0  | 679  | 1.0500          | 0.0      |
-| 0.5611        | 98.0  | 686  | 1.0514          | 0.0      |
-| 0.5517        | 99.0  | 693  | 1.0508          | 0.0      |
-| 0.5574        | 100.0 | 700  | 1.0500          | 0.0      |
 ### Framework versions
-- Transformers 4.42.0
-- Pytorch 2.1.2+cu121
-- Datasets 2.16.1
 - Tokenizers 0.19.1

 ---
+library_name: transformers
 base_model: HuggingFaceM4/Florence-2-DocVQA
 tags:
 - generated_from_trainer
 model-index:
 - name: florence_ft
   results: []
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # florence_ft
 This model is a fine-tuned version of [HuggingFaceM4/Florence-2-DocVQA](https://huggingface.co/HuggingFaceM4/Florence-2-DocVQA) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0833
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.05
+- num_epochs: 1
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 4.4629        | 0.0123 | 25   | 4.6140          |
+| 4.0165        | 0.0245 | 50   | 3.9075          |
+| 3.0887        | 0.0368 | 75   | 2.4186          |
+| 1.3752        | 0.0491 | 100  | 1.4240          |
+| 1.1205        | 0.0613 | 125  | 1.2705          |
+| 1.0809        | 0.0736 | 150  | 1.2144          |
+| 1.0946        | 0.0859 | 175  | 1.1813          |
+| 1.0311        | 0.0982 | 200  | 1.1653          |
+| 1.0611        | 0.1104 | 225  | 1.1503          |
+| 1.0209        | 0.1227 | 250  | 1.1423          |
+| 1.052         | 0.1350 | 275  | 1.1384          |
+| 1.0129        | 0.1472 | 300  | 1.1273          |
+| 0.9764        | 0.1595 | 325  | 1.1218          |
+| 0.9707        | 0.1718 | 350  | 1.1155          |
+| 1.0024        | 0.1840 | 375  | 1.1151          |
+| 1.0446        | 0.1963 | 400  | 1.1112          |
+| 0.9691        | 0.2086 | 425  | 1.1081          |
+| 1.0018        | 0.2209 | 450  | 1.1040          |
+| 0.9806        | 0.2331 | 475  | 1.0989          |
+| 1.0555        | 0.2454 | 500  | 1.0980          |
+| 0.9778        | 0.2577 | 525  | 1.0981          |
+| 0.988         | 0.2699 | 550  | 1.0962          |
+| 0.988         | 0.2822 | 575  | 1.0939          |
+| 0.9572        | 0.2945 | 600  | 1.0969          |
+| 0.9802        | 0.3067 | 625  | 1.0952          |
+| 0.9504        | 0.3190 | 650  | 1.0933          |
+| 1.0194        | 0.3313 | 675  | 1.0948          |
+| 0.9697        | 0.3436 | 700  | 1.0935          |
+| 0.96          | 0.3558 | 725  | 1.0903          |
+| 0.9665        | 0.3681 | 750  | 1.0924          |
+| 0.9895        | 0.3804 | 775  | 1.0920          |
+| 1.004         | 0.3926 | 800  | 1.0914          |
+| 1.0054        | 0.4049 | 825  | 1.0909          |
+| 0.9514        | 0.4172 | 850  | 1.0890          |
+| 0.9996        | 0.4294 | 875  | 1.0906          |
+| 0.99          | 0.4417 | 900  | 1.0896          |
+| 0.9427        | 0.4540 | 925  | 1.0887          |
+| 1.0014        | 0.4663 | 950  | 1.0883          |
+| 0.9639        | 0.4785 | 975  | 1.0864          |
+| 1.0073        | 0.4908 | 1000 | 1.0877          |
+| 0.9895        | 0.5031 | 1025 | 1.0863          |
+| 0.9594        | 0.5153 | 1050 | 1.0841          |
+| 0.9559        | 0.5276 | 1075 | 1.0849          |
+| 1.0034        | 0.5399 | 1100 | 1.0849          |
+| 0.9795        | 0.5521 | 1125 | 1.0844          |
+| 0.9661        | 0.5644 | 1150 | 1.0834          |
+| 0.9533        | 0.5767 | 1175 | 1.0830          |
+| 0.976         | 0.5890 | 1200 | 1.0830          |
+| 0.9932        | 0.6012 | 1225 | 1.0846          |
+| 1.0067        | 0.6135 | 1250 | 1.0861          |
+| 0.9543        | 0.6258 | 1275 | 1.0854          |
+| 0.9733        | 0.6380 | 1300 | 1.0844          |
+| 0.9673        | 0.6503 | 1325 | 1.0837          |
+| 0.9378        | 0.6626 | 1350 | 1.0837          |
+| 0.9713        | 0.6748 | 1375 | 1.0840          |
+| 0.9913        | 0.6871 | 1400 | 1.0838          |
+| 0.9302        | 0.6994 | 1425 | 1.0837          |
+| 0.9873        | 0.7117 | 1450 | 1.0836          |
+| 0.9618        | 0.7239 | 1475 | 1.0835          |
+| 1.0042        | 0.7362 | 1500 | 1.0835          |
+| 0.9627        | 0.7485 | 1525 | 1.0827          |
+| 0.9635        | 0.7607 | 1550 | 1.0827          |
+| 0.9658        | 0.7730 | 1575 | 1.0828          |
+| 0.9446        | 0.7853 | 1600 | 1.0832          |
+| 0.9844        | 0.7975 | 1625 | 1.0833          |
+| 0.9641        | 0.8098 | 1650 | 1.0837          |
+| 1.0           | 0.8221 | 1675 | 1.0835          |
+| 0.9514        | 0.8344 | 1700 | 1.0837          |
+| 1.0094        | 0.8466 | 1725 | 1.0835          |
+| 0.9379        | 0.8589 | 1750 | 1.0834          |
+| 0.9617        | 0.8712 | 1775 | 1.0835          |
+| 0.9674        | 0.8834 | 1800 | 1.0836          |
+| 0.9867        | 0.8957 | 1825 | 1.0838          |
+| 0.9442        | 0.9080 | 1850 | 1.0832          |
+| 0.9603        | 0.9202 | 1875 | 1.0838          |
+| 0.9766        | 0.9325 | 1900 | 1.0833          |
+| 0.9806        | 0.9448 | 1925 | 1.0835          |
+| 0.9676        | 0.9571 | 1950 | 1.0835          |
+| 0.9856        | 0.9693 | 1975 | 1.0838          |
+| 0.9339        | 0.9816 | 2000 | 1.0836          |
+| 0.9553        | 0.9939 | 2025 | 1.0833          |
 ### Framework versions
+- Transformers 4.44.2
+- Pytorch 2.0.1+cu117
 - Tokenizers 0.19.1

generation_config.json CHANGED Viewed

@@ -1,4 +1,4 @@
 {
   "num_beams": 3,
-  "transformers_version": "4.42.0"
 }

 {
   "num_beams": 3,
+  "transformers_version": "4.44.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:64f9bcac28ca87041eb9c091f8e9317aaab192e67f506f0a60abf315a35a2119
 size 1646021682

 version https://git-lfs.github.com/spec/v1
+oid sha256:a5ef8ea56dd4e0b50f3e040b402348281ad3d1acfc49ce45a645e28222489f97
 size 1646021682