Collections
Discover the best community collections!
Collections including paper arxiv:2109.10282
-
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Paper • 2305.02549 • Published • 6 -
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Paper • 2203.08411 • Published • 1 -
More efficient manual review of automatically transcribed tabular data
Paper • 2306.16126 • Published • 1 -
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents
Paper • 2004.12629 • Published • 2
-
Can large language models explore in-context?
Paper • 2403.15371 • Published • 32 -
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Paper • 2403.19655 • Published • 18 -
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper • 2404.00656 • Published • 10 -
Enabling Memory Safety of C Programs using LLMs
Paper • 2404.01096 • Published • 1
-
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
Paper • 2107.00652 • Published • 2 -
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Paper • 2403.09622 • Published • 16 -
Veagle: Advancements in Multimodal Representation Learning
Paper • 2403.08773 • Published • 7 -
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Paper • 2304.14178 • Published • 2
-
Disentangling Writer and Character Styles for Handwriting Generation
Paper • 2303.14736 • Published • 2 -
A Transformer Architecture for Online Gesture Recognition of Mathematical Expressions
Paper • 2211.02643 • Published • 2 -
A tailored Handwritten-Text-Recognition System for Medieval Latin
Paper • 2308.09368 • Published • 2 -
Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabets
Paper • 2303.16256 • Published • 2
-
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Paper • 2110.07040 • Published • 2 -
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks
Paper • 1811.00056 • Published • 2 -
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks
Paper • 2311.17128 • Published • 2 -
Data Generation for Post-OCR correction of Cyrillic handwriting
Paper • 2311.15896 • Published • 3