zeng xian's picture

zeng xian

themez

·

themez

AI & ML interests

None yet

Organizations

themez's activity

upvoted 2 papers 3 months ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 65

Generative Photomontage

Paper • 2408.07116 • Published Aug 13 • 19

upvoted a paper 5 months ago

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17 • 30

upvoted 3 papers 7 months ago

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 63

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 26

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Paper • 2401.13919 • Published Jan 25 • 26

upvoted a collection 9 months ago

Sora参考论文

OpenAI "Video generation models as world simulators"技术报告后面的参考论文，总共32篇。OpenAI的ImageGPT和Dalle3这两篇缺失，链接已补充到note中。 • 32 items • Updated Feb 18 • 54

upvoted a paper 10 months ago

Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation

Paper • 2401.15688 • Published Jan 28 • 11

upvoted 3 papers 11 months ago

Point Transformer V3: Simpler, Faster, Stronger

Paper • 2312.10035 • Published Dec 15, 2023 • 17

DREAM: Diffusion Rectification and Estimation-Adaptive Models

Paper • 2312.00210 • Published Nov 30, 2023 • 14

FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting

Paper • 2312.00451 • Published Dec 1, 2023 • 9

upvoted 4 papers 12 months ago

Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

Paper • 2311.12092 • Published Nov 20, 2023 • 21

Make Pixels Dance: High-Dynamic Video Generation

Paper • 2311.10982 • Published Nov 18, 2023 • 68

Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text

Paper • 2311.07446 • Published Nov 13, 2023 • 28

GOAT: GO to Any Thing

Paper • 2311.06430 • Published Nov 10, 2023 • 14

upvoted 5 papers about 1 year ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 77

CodePlan: Repository-level Coding using LLMs and Planning

Paper • 2309.12499 • Published Sep 21, 2023 • 73

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Paper • 2309.11674 • Published Sep 20, 2023 • 31

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 82

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22