Leng Sicong's picture

Leng Sicong

Sicong

·

AI & ML interests

None yet

Organizations

Sicong's activity

upvoted a paper about 16 hours ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published 4 days ago • 39

upvoted a collection 15 days ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Sep 25 • 615

upvoted a paper 19 days ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 21 days ago • 88

upvoted a paper 20 days ago

Mitigating Object Hallucination via Concentric Causal Attention

Paper • 2410.15926 • Published 23 days ago • 14

upvoted a paper 27 days ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published 27 days ago • 30

upvoted a paper about 1 month ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3 • 36

upvoted 2 papers 4 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29 • 54

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 92

upvoted a collection 5 months ago

VideoLLaMA 2

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability • 15 items • Updated 9 days ago • 21

upvoted a paper 5 months ago

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11 • 32

upvoted 2 papers 10 months ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 64

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1 • 14

upvoted a paper 11 months ago

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Paper • 2311.16922 • Published Nov 28, 2023 • 1