Papers
arxiv:2402.16819

Nemotron-4 15B Technical Report

Published on Feb 26
ยท Submitted by akhaliq on Feb 27
#2 Paper of the day
Authors:
,
,
,
,
,
,
,

Abstract

We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens. Nemotron-4 15B demonstrates strong performance when assessed on English, multilingual, and coding tasks: it outperforms all existing similarly-sized open models on 4 out of 7 downstream evaluation areas and achieves competitive performance to the leading open models in the remaining ones. Specifically, Nemotron-4 15B exhibits the best multilingual capabilities of all similarly-sized models, even outperforming models over four times larger and those explicitly specialized for multilingual tasks.

Community

where is the model?

Is this available only thru NVDA Nemo?

IS this an open source model?

Nemotron-4 15B: Exploring the Power of a Cutting-Edge Multilingual Model

Links ๐Ÿ”—:

๐Ÿ‘‰ Subscribe: https://www.youtube.com/@Arxflix
๐Ÿ‘‰ Twitter: https://x.com/arxflix
๐Ÿ‘‰ LMNT (Partner): https://lmnt.com/

By Arxflix
9t4iCUHx_400x400-1.jpg

Sign up or log in to comment

Models citing this paper 3

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2402.16819 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 7