Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2311.06668

Papers - University - Stanford University

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27 • 22
Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 24
stanford-crfm/BioMedLM

Text Generation • Updated Mar 28 • 2.42k • 394
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 47

Papers - ICL - In-Context Learning

Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

Paper • 2311.00871 • Published Nov 1, 2023 • 2
Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 32
Data Distributional Properties Drive Emergent In-Context Learning in Transformers

Paper • 2205.05055 • Published Apr 22, 2022 • 2
Long-context LLMs Struggle with Long In-context Learning

Paper • 2404.02060 • Published Apr 2 • 35

Papers - Text - Classification

LLM-Assisted Content Analysis: Using Large Language Models to Support Deductive Coding

Paper • 2306.14924 • Published Jun 23, 2023 • 2
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes

Paper • 2404.12365 • Published Apr 18 • 1
In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering

Paper • 2311.06668 • Published Nov 11, 2023 • 5

Papers - Fine-tuning - LoRA

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 16
MedAlpaca -- An Open-Source Collection of Medical Conversational AI Models and Training Data

Paper • 2304.08247 • Published Apr 14, 2023 • 2
S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Paper • 2311.03285 • Published Nov 6, 2023 • 28
WavLLM: Towards Robust and Adaptive Speech Large Language Model

Paper • 2404.00656 • Published Mar 31 • 10

Papers - Fine-tuning

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 16
SELF: Language-Driven Self-Evolution for Large Language Model

Paper • 2310.00533 • Published Oct 1, 2023 • 2
QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 45
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

Paper • 2309.14717 • Published Sep 26, 2023 • 44

Papers - Text - Research

An Interdisciplinary Comparison of Sequence Modeling Methods for Next-Element Prediction

Paper • 1811.00062 • Published Oct 31, 2018 • 2
mT5: A massively multilingual pre-trained text-to-text transformer

Paper • 2010.11934 • Published Oct 22, 2020 • 4
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance

Paper • 2310.10021 • Published Oct 16, 2023 • 2
Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 47

Foundation Models and Tools

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 76
bigcode/starcoder2-15b

Text Generation • Updated Jun 5 • 22.9k • • 568
Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 121
mixedbread-ai/mxbai-rerank-large-v1

Text Classification • Updated Jul 22 • 24.4k • 105

Previous
1
2
3
4
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs