Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis Paper • 2401.09048 • Published Jan 17 • 9 • 2
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Paper • 2401.09417 • Published Jan 17 • 59 • 3
General Object Foundation Model for Images and Videos at Scale Paper • 2312.09158 • Published Dec 14, 2023 • 8 • 2
FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection Paper • 2312.09252 • Published Dec 14, 2023 • 9 • 2
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects Paper • 2312.08344 • Published Dec 13, 2023 • 9 • 1