-
RakutenAI-7B: Extending Large Language Models for Japanese
Paper • 2403.15484 • Published • 12 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 54 -
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Paper • 2404.04167 • Published • 12 -
abhinand/malayalam-llama-7b-instruct-v0.1
Text Generation • Updated • 935 • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2401.01055
-
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 54 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 79 -
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Paper • 2403.13447 • Published • 18 -
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Paper • 2403.05530 • Published • 60
-
A Simple Framework to Accelerate Multilingual Language Model for Monolingual Text Generation
Paper • 2401.10660 • Published • 2 -
PersianMind: A Cross-Lingual Persian-English Large Language Model
Paper • 2401.06466 • Published • 3 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 54 -
MaLA-500: Massive Language Adaptation of Large Language Models
Paper • 2401.13303 • Published • 11
-
The Impact of Reasoning Step Length on Large Language Models
Paper • 2401.04925 • Published • 15 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 54 -
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper • 2401.05033 • Published • 15 -
Towards Conversational Diagnostic AI
Paper • 2401.05654 • Published • 15
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 61 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 54 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 26
-
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 54 -
GlórIA -- A Generative and Open Large Language Model for Portuguese
Paper • 2402.12969 • Published -
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Paper • 2404.00399 • Published • 41