Junyang Lin's picture

Junyang Lin

JustinLin610

·

https://justinlin610.github.io

AI & ML interests

Pretraining, NLP, CV, etc.

Recent Activity

liked a Space 3 days ago

Qwen/Qwen2.5-Turbo-1M-Demo

updated a model 3 days ago

Qwen/Qwen2.5-Coder-14B-Instruct-AWQ

updated a model 3 days ago

Qwen/Qwen2.5-Coder-14B-Instruct-GPTQ-Int8

Organizations

JustinLin610's activity

upvoted 3 papers 12 days ago

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published 14 days ago • 47

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published 14 days ago • 63

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 14 days ago • 108

upvoted 2 collections 2 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 369

upvoted a collection 5 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 347

upvoted 2 papers 8 months ago

Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Paper • 2403.07750 • Published Mar 12 • 21

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12 • 75

upvoted a paper 9 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 603

upvoted 2 collections 9 months ago

Qwen-1.5-Exl2

18 items • Updated 9 days ago • 2

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Sep 18 • 206

upvoted a paper 10 months ago

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Paper • 2311.03099 • Published Nov 6, 2023 • 28

upvoted 3 papers about 1 year ago

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 34

Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Paper • 2308.01825 • Published Aug 3, 2023 • 21

YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 65

upvoted 3 papers over 1 year ago

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese

Paper • 2211.01335 • Published Nov 2, 2022 • 1

Self-consistency for open-ended generations

Paper • 2307.06857 • Published Jul 11, 2023 • 9

One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning

Paper • 2306.07967 • Published Jun 13, 2023 • 24