bert_topic_model / README.md
beccabai's picture
Update README.md
c674353 verified
|
raw
history blame contribute delete
No virus
454 Bytes
metadata
language:
  - en

This repo contains the BERT-Topic classifier of the work Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

For topic and label: 'activity': 0, 'education': 1, 'entertainment': 2, 'finance': 3, 'health': 4, 'business and industrial ': 5, 'infrastructure': 6, 'literature and art': 7, 'nature': 8, 'others': 9, 'law and government': 10, 'networking': 11, 'technology': 12