Inference - a Stalin16 Collection

Stalin16 's Collections

Gen AI Diffusion

Inference

updated about 14 hours ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2 • 8
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 33
Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 72
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6 • 22
From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents

Paper • 2409.03512 • Published Sep 5 • 26
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text

Paper • 2409.02078 • Published Sep 3 • 8
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29 • 52
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder

Paper • 2409.08248 • Published Sep 12 • 13
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Paper • 2409.06595 • Published Sep 10 • 37
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey

Paper • 2409.11564 • Published Sep 17 • 19
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study

Paper • 2409.17580 • Published Sep 26 • 7
Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53
Illustrious: an Open Advanced Illustration Model

Paper • 2409.19946 • Published Sep 30 • 13
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Paper • 2409.18943 • Published Sep 27 • 26
SLM: Bridge the thin gap between speech and text foundation models

Paper • 2310.00230 • Published Sep 30, 2023
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation

Paper • 2410.01731 • Published Oct 2 • 15
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 9 days ago • 100
Agent-as-a-Judge: Evaluate Agents with Agents

Paper • 2410.10934 • Published Oct 14 • 10
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published Oct 13 • 54
Analyzing The Language of Visual Tokens

Paper • 2411.05001 • Published 9 days ago • 19
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14 • 25
Intriguing Properties of Large Language and Vision Models

Paper • 2410.04751 • Published Oct 7 • 16
Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition

Paper • 2410.05603 • Published Oct 8 • 11
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 10 days ago • 55
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 12 days ago • 44
Survey of Cultural Awareness in Language Models: Text and Beyond

Paper • 2411.00860 • Published 17 days ago • 23
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

Paper • 2411.00492 • Published 15 days ago • 5
Personalization of Large Language Models: A Survey

Paper • 2411.00027 • Published 18 days ago • 31
Survey of User Interface Design and Interaction Techniques in Generative AI Applications

Paper • 2410.22370 • Published 18 days ago • 11
Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks

Paper • 2410.24032 • Published 16 days ago • 8
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Paper • 2410.21465 • Published 19 days ago • 10
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

Paper • 2410.21169 • Published 19 days ago • 29
Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance

Paper • 2410.18889 • Published 23 days ago • 15
Counting Ability of Large Language Models and Impact of Tokenization

Paper • 2410.19730 • Published 22 days ago • 10
Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published 26 days ago • 54
Looking Inward: Language Models Can Learn About Themselves by Introspection

Paper • 2410.13787 • Published 30 days ago • 5
JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published about 1 month ago • 42
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Paper • 2410.12705 • Published about 1 month ago • 29
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

Paper • 2410.13639 • Published 30 days ago • 16
Remember, Retrieve and Generate: Understanding Infinite Visual Concepts as Your Personalized Assistant

Paper • 2410.13360 • Published about 1 month ago • 8
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published about 1 month ago • 30
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Paper • 2410.12628 • Published about 1 month ago • 26
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Paper • 2411.05000 • Published 9 days ago • 20
Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published 3 days ago • 19