-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75 -
Challenges and Applications of Large Language Models
Paper • 2307.10169 • Published • 47 -
Efficiently Modeling Long Sequences with Structured State Spaces
Paper • 2111.00396 • Published • 1 -
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning
Paper • 2006.08381 • Published
Florian von Stosch
flauflauf
·
AI & ML interests
None yet
Organizations
None yet
Collections
3
-
Retentive Network: A Successor to Transformer for Large Language Models
Paper • 2307.08621 • Published • 170 -
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Paper • 2303.12712 • Published • 2 -
GPT-4 Technical Report
Paper • 2303.08774 • Published • 5 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 9
models
None public yet
datasets
None public yet