mikey's picture

mikey

RickyPossum

·

https://90s.zone

AI & ML interests

facefusion,wav2lip,comfy,tts webui, xrrs, topaz, comfy ui

Organizations

None yet

RickyPossum's activity

upvoted 2 papers 2 months ago

Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation

Paper • 2409.03718 • Published Sep 5 • 25

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted 3 collections 6 months ago

Diffusion model Spaces

315 items • Updated 17 days ago • 31

LLM Spaces

176 items • Updated 11 days ago • 13

Audio Spaces

103 items • Updated 6 days ago • 11

upvoted 7 papers 7 months ago

Taming Latent Diffusion Model for Neural Radiance Field Inpainting

Paper • 2404.09995 • Published Apr 15 • 6

Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization

Paper • 2404.09956 • Published Apr 15 • 11

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 82

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

Paper • 2404.09990 • Published Apr 15 • 12

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 43

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 63

Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

Paper • 2404.09833 • Published Apr 15 • 29

upvoted a collection 7 months ago

Paper

35 items • Updated May 20 • 2

upvoted a paper 7 months ago

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages

Paper • 2303.12582 • Published Mar 22, 2023 • 20