matlok
's Collections
Papers - Microsoft
updated
Can large language models explore in-context?
Paper
•
2403.15371
•
Published
•
32
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for
3D Generative Modeling
Paper
•
2403.19655
•
Published
•
18
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper
•
2404.00656
•
Published
•
10
Enabling Memory Safety of C Programs using LLMs
Paper
•
2404.01096
•
Published
•
1
LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models
Paper
•
2404.01617
•
Published
•
6
LayoutLMv3: Pre-training for Document AI with Unified Text and Image
Masking
Paper
•
2204.08387
•
Published
•
2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document
Understanding
Paper
•
2012.14740
•
Published
•
1
LayoutLM: Pre-training of Text and Layout for Document Image
Understanding
Paper
•
1912.13318
•
Published
•
2
PIQA: Reasoning about Physical Commonsense in Natural Language
Paper
•
1911.11641
•
Published
•
2
Are NLP Models really able to Solve Simple Math Word Problems?
Paper
•
2103.07191
•
Published
•
1
Learning From Mistakes Makes LLM Better Reasoner
Paper
•
2310.20689
•
Published
•
28
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Paper
•
2306.02707
•
Published
•
46
TrOCR: Transformer-based Optical Character Recognition with Pre-trained
Models
Paper
•
2109.10282
•
Published
•
6
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting
for Text-to-Speech Synthesis
Paper
•
2404.03204
•
Published
•
7
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language
Models
Paper
•
2404.03118
•
Published
•
23
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
•
2404.03715
•
Published
•
60
Elephants Never Forget: Memorization and Learning of Tabular Data in
Large Language Models
Paper
•
2404.06209
•
Published
•
4
Visualization-of-Thought Elicits Spatial Reasoning in Large Language
Models
Paper
•
2404.03622
•
Published
•
4
Rho-1: Not All Tokens Are What You Need
Paper
•
2404.07965
•
Published
•
84
ResearchAgent: Iterative Research Idea Generation over Scientific
Literature with Large Language Models
Paper
•
2404.07738
•
Published
•
2
GLIGEN: Open-Set Grounded Text-to-Image Generation
Paper
•
2301.07093
•
Published
•
3
Grounded Language-Image Pre-training
Paper
•
2112.03857
•
Published
•
3
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
•
2404.14219
•
Published
•
251
Multi-Head Mixture-of-Experts
Paper
•
2404.15045
•
Published
•
59
Deep Residual Learning for Image Recognition
Paper
•
1512.03385
•
Published
•
6
You Only Cache Once: Decoder-Decoder Architectures for Language Models
Paper
•
2405.05254
•
Published
•
9
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation
in Videos
Paper
•
2406.08407
•
Published
•
24
Florence-2: Advancing a Unified Representation for a Variety of Vision
Tasks
Paper
•
2311.06242
•
Published
•
84
DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
Paper
•
2309.03883
•
Published
•
33
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
•
2407.09025
•
Published
•
128
Paper
•
2410.05258
•
Published
•
165