jadechoghari commited on
Commit
7362041
1 Parent(s): ff19e29
.gradio/cached_examples/13/Image Output/95caf8d5a60bd2528198/image.webp ADDED
.gradio/cached_examples/13/indices.csv ADDED
@@ -0,0 +1 @@
 
 
1
+ 3
.gradio/cached_examples/13/log.csv ADDED
@@ -0,0 +1,120 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Image Output,Parsed screen elements,timestamp
2
+ "{""path"": "".gradio/cached_examples/13/Image Output/95caf8d5a60bd2528198/image.webp"", ""url"": ""/gradio_api/file=/tmp/gradio/44a88abb1387b1884333fe3086277174feca52b852454c46389bd04589700217/image.webp"", ""size"": null, ""orig_name"": ""image.webp"", ""mime_type"": null, ""is_stream"": false, ""meta"": {""_type"": ""gradio.FileData""}}","Text Box ID 0: Task Manager
3
+ Text Box ID 1: storage
4
+ Text Box ID 2: InPrivate
5
+ Text Box ID 3: Google
6
+ Text Box ID 4: https:/ WWW.googlecom
7
+ Text Box ID 5: Processes
8
+ Text Box ID 6: Run new task
9
+ Text Box ID 7: End task
10
+ Text Box ID 8: New folder
11
+ Text Box ID 9: Google
12
+ Text Box ID 10: finetune/ldm-ft__
13
+ Text Box ID 11: Gradio
14
+ Text Box ID 12: Pipelines
15
+ Text Box ID 13: Recent
16
+ Text Box ID 14: haotian-liu/LLaVA:
17
+ Text Box ID 15: Processes
18
+ Text Box ID 16: 67%
19
+ Text Box ID 17: 5496
20
+ Text Box ID 18: Status
21
+ Text Box ID 19: CPU
22
+ Text Box ID 20: Memory
23
+ Text Box ID 21: Disk
24
+ Text Box ID 22: About
25
+ Text Box ID 23: Store
26
+ Text Box ID 24: Gmail
27
+ Text Box ID 25: Images
28
+ Text Box ID 26: Sign in
29
+ Text Box ID 27: Performance
30
+ Text Box ID 28: Microscft
31
+ Text Box ID 29: 15.686
32
+ Text Box ID 30: 1,829,9 MB
33
+ Text Box ID 31: MBIs
34
+ Text Box ID 32: Microsoft Team;
35
+ Text Box ID 33: 142.9MB
36
+ Text Box ID 34: MBIs
37
+ Text Box ID 35: App history
38
+ Text Box ID 36: Microsoft Azure Storage Explo_
39
+ Text Box ID 37: Efficiency_
40
+ Text Box ID 38: 0.886
41
+ Text Box ID 39: 245,0 MB
42
+ Text Box ID 40: MBYs
43
+ Text Box ID 41: Startup apps
44
+ Text Box ID 42: WebViewz Manager
45
+ Text Box ID 43: 75,9MB
46
+ Text Box ID 44: MBYs
47
+ Text Box ID 45: Users
48
+ Text Box ID 46: Service Host: Storage Service
49
+ Text Box ID 47: 096
50
+ Text Box ID 48: 1,1 MB
51
+ Text Box ID 49: MBYs
52
+ Text Box ID 50: Details
53
+ Text Box ID 51: Services
54
+ Text Box ID 52: Google
55
+ Text Box ID 53: Google Search
56
+ Text Box ID 54: Feeling Lucky
57
+ Text Box ID 55: Discover
58
+ Text Box ID 56: the ways Chrome keeps you safe while you browse
59
+ Text Box ID 57: Our third decade of climate action: join us
60
+ Text Box ID 58: Settings
61
+ Text Box ID 59: Advertising
62
+ Text Box ID 60: Business
63
+ Text Box ID 61: How Search works
64
+ Text Box ID 62: Privacy
65
+ Text Box ID 63: Terms
66
+ Text Box ID 64: Settings
67
+ Text Box ID 65: 3.53 PM
68
+ Text Box ID 66: Search
69
+ Text Box ID 67: Microsoft
70
+ Text Box ID 68: 10/25/2024
71
+ Text Box ID 69: Edge
72
+ Icon Box ID 70: Microsoft Edge browser.
73
+ Icon Box ID 71: Microsoft 365.
74
+ Icon Box ID 72: Image
75
+ Icon Box ID 73: Image
76
+ Icon Box ID 74: Microsoft Edge browser.
77
+ Icon Box ID 75: Microsoft Edge browser.
78
+ Icon Box ID 76: Teams.
79
+ Icon Box ID 77: Uncomm&ent Selection
80
+ Icon Box ID 78: Microsoft OneNote.
81
+ Icon Box ID 79: Find
82
+ Icon Box ID 80: Microsoft Outlook.
83
+ Icon Box ID 81: Image
84
+ Icon Box ID 82: Maximize
85
+ Icon Box ID 83: Close
86
+ Icon Box ID 84: Dictate
87
+ Icon Box ID 85: Line Spacing
88
+ Icon Box ID 86: Five-point star
89
+ Icon Box ID 87: a search function.
90
+ Icon Box ID 88: Increase
91
+ Icon Box ID 89: More options
92
+ Icon Box ID 90: the Windows operating system.
93
+ Icon Box ID 91: Hyperlink
94
+ Icon Box ID 92: App launcher or menu.
95
+ Icon Box ID 93: Health monitoring
96
+ Icon Box ID 94: Microsoft Outlook.
97
+ Icon Box ID 95: minimizing a window.
98
+ Icon Box ID 96: uBlock Origin (Ctrl+T)
99
+ Icon Box ID 97: Undo
100
+ Icon Box ID 98: Pentagon
101
+ Icon Box ID 99: Settings
102
+ Icon Box ID 100: 1.0%
103
+ Icon Box ID 101: Back
104
+ Icon Box ID 102: Rectangle
105
+ Icon Box ID 103: Redo
106
+ Icon Box ID 104: opening a folder.
107
+ Icon Box ID 105: Justified
108
+ Icon Box ID 106: Label
109
+ Icon Box ID 107: Maximize window
110
+ Icon Box ID 108: Close
111
+ Icon Box ID 109: Close
112
+ Icon Box ID 110: Google Chrome web browser.
113
+ Icon Box ID 111: a loading or progress bar.
114
+ Icon Box ID 112: M0,0L9,0 4.5,5z
115
+ Icon Box ID 113: More options
116
+ Icon Box ID 114: a loading or progress bar.
117
+ Icon Box ID 115: Minimize
118
+ Icon Box ID 116: Undo
119
+ Icon Box ID 117: 0%
120
+ Icon Box ID 118: Draw Functions",2024-10-31 20:12:10.050642
app.py CHANGED
@@ -12,6 +12,8 @@ from utils import check_ocr_box, get_yolo_model, get_caption_model_processor, ge
12
  import torch
13
  from PIL import Image
14
 
 
 
15
  yolo_model = get_yolo_model(model_path='weights/icon_detect/best.pt')
16
  caption_model_processor = get_caption_model_processor(model_name="florence2", model_name_or_path="weights/icon_caption_florence")
17
  platform = 'pc'
@@ -58,8 +60,8 @@ DEVICE = torch.device('cuda')
58
  @spaces.GPU
59
  def process(
60
  image_input,
61
- box_threshold,
62
- iou_threshold
63
  ) -> Optional[Image.Image]:
64
 
65
  image_save_path = 'imgs/saved_image_demo.png'
 
12
  import torch
13
  from PIL import Image
14
 
15
+ # Model source: https://huggingface.co/microsoft/OmniParser
16
+ # gr.load("models/microsoft/OmniParser").launch()
17
  yolo_model = get_yolo_model(model_path='weights/icon_detect/best.pt')
18
  caption_model_processor = get_caption_model_processor(model_name="florence2", model_name_or_path="weights/icon_caption_florence")
19
  platform = 'pc'
 
60
  @spaces.GPU
61
  def process(
62
  image_input,
63
+ box_threshold=0.01,
64
+ iou_threshold=0.01
65
  ) -> Optional[Image.Image]:
66
 
67
  image_save_path = 'imgs/saved_image_demo.png'
imgs/saved_image_demo.png CHANGED