setareh z's picture

2 24 2

setareh z

setareh1

·

AI & ML interests

None yet

Organizations

setareh1's activity

upvoted 2 papers 5 months ago

NPGA: Neural Parametric Gaussian Avatars

Paper • 2405.19331 • Published May 29 • 10

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Paper • 2405.19893 • Published May 30 • 29

upvoted a paper 7 months ago

Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

Paper • 2404.09833 • Published Apr 15 • 29

upvoted 6 papers 8 months ago

MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Paper • 2403.11207 • Published Mar 17 • 14

3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos

Paper • 2403.01444 • Published Mar 3 • 4

TripoSR: Fast 3D Object Reconstruction from a Single Image

Paper • 2403.02151 • Published Mar 4 • 12

OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Paper • 2403.01779 • Published Mar 4 • 28

StableDrag: Stable Dragging for Point-based Image Editing

Paper • 2403.04437 • Published Mar 7 • 25

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

Paper • 2403.05034 • Published Mar 8 • 20

upvoted 2 papers 9 months ago

Graph Mamba: Towards Learning on Graphs with State Space Models

Paper • 2402.08678 • Published Feb 13 • 13

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

Paper • 2401.17053 • Published Jan 30 • 30

upvoted 7 papers about 1 year ago

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Paper • 2310.14566 • Published Oct 23, 2023 • 25

3D-GPT: Procedural 3D Modeling with Large Language Models

Paper • 2310.12945 • Published Oct 19, 2023 • 57

Video Language Planning

Paper • 2310.10625 • Published Oct 16, 2023 • 9

Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 39

Drag View: Generalizable Novel View Synthesis with Unposed Imagery

Paper • 2310.03704 • Published Oct 5, 2023 • 7

DragNUWA: Fine-grained Control in Video Generation by Integrating Text, Image, and Trajectory

Paper • 2308.08089 • Published Aug 16, 2023 • 21

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Paper • 2308.06721 • Published Aug 13, 2023 • 29

upvoted 2 papers over 1 year ago

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Paper • 2307.16449 • Published Jul 31, 2023 • 15

Interpolating between Images with Diffusion Models

Paper • 2307.12560 • Published Jul 24, 2023 • 19