TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Paper • 2407.21630 • Published Jul 31 • 8
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20 • 12
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Paper • 2407.13481 • Published Jul 18 • 9
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI Paper • 2310.16787 • Published Oct 25, 2023 • 5
Probing neural language models for understanding of words of estimative probability Paper • 2211.03358 • Published Nov 7, 2022 • 1
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars Paper • 2406.11035 • Published Jun 16 • 1
Zero-shot text classification models Collection Collection of the best zero-shot text classification models. Fine-tune them with few examples using LiqFit - https://github.com/Knowledgator/LiqFit. • 9 items • Updated Sep 10 • 9
SuperMC Collection Various multiple-choice datasets, for preference learning, focused on reasoning • 19 items • Updated Jan 25 • 1
tasksource: Structured Dataset Preprocessing Annotations for Frictionless Extreme Multi-Task Learning and Evaluation Paper • 2301.05948 • Published Jan 14, 2023 • 3