Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2112.03857

Papers - University - University of Washington

The Curious Case of Neural Text Degeneration

Paper • 1904.09751 • Published Apr 22, 2019 • 3
Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Paper • 2404.01197 • Published Apr 1 • 30
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions

Paper • 1905.10044 • Published May 24, 2019 • 1
PIQA: Reasoning about Physical Commonsense in Natural Language

Paper • 1911.11641 • Published Nov 26, 2019 • 2

Papers - Microsoft

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 32
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Paper • 2403.19655 • Published Mar 28 • 18
WavLLM: Towards Robust and Adaptive Speech Large Language Model

Paper • 2404.00656 • Published Mar 31 • 10
Enabling Memory Safety of C Programs using LLMs

Paper • 2404.01096 • Published Apr 1 • 1

Papers - Image - Object Detection

End-to-End Object Detection with Transformers

Paper • 2005.12872 • Published May 26, 2020 • 5
COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27
Grounded Language-Image Pre-training

Paper • 2112.03857 • Published Dec 7, 2021 • 3
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 84

Papers - Image - Swin

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Paper • 2103.14030 • Published Mar 25, 2021 • 4
A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images

Paper • 2104.12137 • Published Apr 25, 2021 • 2
Self-Supervised Learning with Swin Transformers

Paper • 2105.04553 • Published May 10, 2021 • 2
Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation

Paper • 2108.11993 • Published Aug 26, 2021 • 2

about 11 hours ago

FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation

Paper • 2403.06775 • Published Mar 11 • 3
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Paper • 2010.11929 • Published Oct 22, 2020 • 6
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition

Paper • 2110.07040 • Published Oct 13, 2021 • 2
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks

Paper • 1811.00056 • Published Oct 31, 2018 • 2

Papers - Image - Bounding Box

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 180
Unifying Vision, Text, and Layout for Universal Document Processing

Paper • 2212.02623 • Published Dec 5, 2022 • 10
Grounded Language-Image Pre-training

Paper • 2112.03857 • Published Dec 7, 2021 • 3
ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model

Paper • 2404.07773 • Published Apr 11 • 1

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs