Daniil Vodolazsky's picture

17 3

Daniil Vodolazsky

s231644

AI & ML interests

None yet

Organizations

None yet

s231644's activity

upvoted 4 papers 10 days ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published 17 days ago • 72

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published 19 days ago • 38

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published 15 days ago • 42

mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding

Paper • 2409.03420 • Published 15 days ago • 23

upvoted a paper 24 days ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published 28 days ago • 109

upvoted 3 papers 27 days ago

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17 • 51

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Paper • 2408.12570 • Published 28 days ago • 29

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

Paper • 2408.12503 • Published 28 days ago • 20

upvoted a paper about 1 month ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12 • 55

upvoted 6 papers 2 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 149

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

Paper • 2407.03618 • Published Jul 4 • 10

Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 80

Characterizing Prompt Compression Methods for Long Context Inference

Paper • 2407.08892 • Published Jul 11 • 9

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 122

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

upvoted 2 papers 3 months ago

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27 • 8

Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Paper • 2406.19263 • Published Jun 27 • 9