Richrich's picture

Richrich

RichardForests

·

AI & ML interests

None yet

Organizations

RichardForests's activity

upvoted a paper 2 days ago

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Paper • 2210.14986 • Published Oct 26, 2022 • 5

upvoted a paper 4 months ago

KAN or MLP: A Fairer Comparison

Paper • 2407.16674 • Published Jul 23 • 41

upvoted an article 4 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 90

upvoted 7 papers 5 months ago

Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models

Paper • 2406.13099 • Published Jun 18 • 4

ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

Paper • 2406.14130 • Published Jun 20 • 10

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14 • 18

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14 • 22

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Paper • 2406.14562 • Published Jun 20 • 27

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20 • 34

Diffusion Model Alignment Using Direct Preference Optimization

Paper • 2311.12908 • Published Nov 21, 2023 • 47

upvoted 10 papers 6 months ago

DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Paper • 2405.14224 • Published May 23 • 12

Dense Connector for MLLMs

Paper • 2405.13800 • Published May 22 • 21

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23 • 16

AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability

Paper • 2405.14129 • Published May 23 • 12

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14 • 27

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16 • 126

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

Toon3D: Seeing Cartoons from a New Perspective

Paper • 2405.10320 • Published May 16 • 19

Observational Scaling Laws and the Predictability of Language Model Performance

Paper • 2405.10938 • Published May 17 • 11

Layer-Condensed KV Cache for Efficient Inference of Large Language Models

Paper • 2405.10637 • Published May 17 • 19