Synthetic Data - a imjliao Collection

imjliao 's Collections

Agent

Prompt

Entity

Information Retrieval

QA

Document Information Extraction

MLLM

AIF

Models

Synthetic Data

updated Oct 23, 2023

Data enrichment methods for pre-training and fine-tuning

Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 77
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 47
Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 40
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor

Paper • 2212.09689 • Published Dec 19, 2022 • 1
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

Paper • 2310.13127 • Published Oct 19, 2023 • 11
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models

Paper • 2310.13671 • Published Oct 20, 2023 • 18