-
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65 -
Beyond Document Page Classification: Design, Datasets, and Challenges
Paper • 2308.12896 • Published • 1 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 5.96k • 1.13k
krishna praveen
krishnapraveen
AI & ML interests
None yet
Organizations
None yet
Collections
4
-
Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Paper • 2407.15762 • Published • 9 -
HuggingFaceTB/SmolLM-135M
Text Generation • Updated • 38.6k • 171 -
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model
Paper • 2408.10198 • Published • 32 -
fishaudio/fish-speech-1.4
Text-to-Speech • Updated • 10.7k • 414
models
None public yet
datasets
None public yet