Yongxin-Guo commited on
Commit
29b83c7
1 Parent(s): e2948a7

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model:
6
+ - mistralai/Mistral-7B-Instruct-v0.2
7
+ tags:
8
+ - video temporal grounding
9
+ - dense video caption
10
+ - video highlight detection
11
+ ---
12
+
13
+
14
+
15
+ ## Overview
16
+
17
+ In this work
18
+ - We model the videos by a series of events, and propose causal event modeling framework to capture videos' inherent structure.
19
+ - We present a novel task-interleaved video LLM model, TRACE, tailored to implement the causal event modeling framework through the sequential encoding/decoding of timestamps, salient scores, and textual captions.
20
+
21
+ ## Model Zoo
22
+
23
+ | Checkpoints | Description | URL |
24
+ | ----------- | ----------- | ----------- |
25
+ | Initialization | Weights initialized from VideoLLaMA2 | [trace-init](https://huggingface.co/Yongxin-Guo/trace-init) |
26
+ | Stage-1 | Model checkpoints trained after stage-1 | [trace-stage1](https://huggingface.co/Yongxin-Guo/trace-stage1) |
27
+ | Stage-2 | Model checkpoints trained after stage-2 | [trace](https://huggingface.co/Yongxin-Guo/trace) |
28
+
29
+ #### Results
30
+
31
+ | Youcook2 (Zero-Shot) | CIDER | METEOR | SODA_c | F1 |
32
+ | --- | --- | --- | --- | --- |
33
+ | TRACE | 8.1 | 2.8 | 2.2 | 22.4 |
34
+
35
+ | Charades-STA (Zero-Shot) | 0.3 | 0.5 | 0.7 | mIOU |
36
+ | --- | --- | --- | --- | --- |
37
+ | TRACE | 58.6 | 40.3 | 19.4 | 38.7 |
38
+
39
+ | QVHighlights (Zero-Shot) | mAP | Hit@1 |
40
+ | --- | --- | --- |
41
+ | TRACE | 26.8 | 42.7
42
+
43
+ | ActivityNet-DVC | CIDER | METEOR | SODA_c | F1 |
44
+ | --- | --- | --- | --- | --- |
45
+ | TRACE | 25.9 | 6.0 | 6.4 | 39.3 |
46
+
47
+ | ActivityNet-MR | 0.3 | 0.5 | 0.7 | mIOU |
48
+ | --- | --- | --- | --- | --- |
49
+ | TRACE | 53.0 | 37.7 | 24.0 | 39.0 |