arxiv:2406.12564

Low-Resource Machine Translation through the Lens of Personalized Federated Learning

Published on Jun 18

· Submitted by

VityaVitalich on Jun 24

Upvote

Authors:

Viktor Moskvoretskii ,

Irina Nikishina

Abstract

We present a new approach based on the Personalized Federated Learning algorithm MeritFed that can be applied to Natural Language Tasks with heterogeneous data. We evaluate it on the Low-Resource Machine Translation task, using the dataset from the Large-Scale Multilingual Machine Translation Shared Task (Small Track #2) and the subset of Sami languages from the multilingual benchmark for Finno-Ugric languages. In addition to its effectiveness, MeritFed is also highly interpretable, as it can be applied to track the impact of each language used for training. Our analysis reveals that target dataset size affects weight distribution across auxiliary languages, that unrelated languages do not interfere with the training, and auxiliary optimizer parameters have minimal impact. Our approach is easy to apply with a few lines of code, and we provide scripts for reproducing the experiments at https://github.com/VityaVitalich/MeritFed

View arXiv page View PDF Add to collection

Community

VityaVitalich

Paper author Paper submitter Jun 24

Federated Learning helps benefit from heterogeneous data by dynamically weighting data sources! We tested it on Machine Translation to Low-Resource languages and found it to be superior. Moreover it is highly interpretable with just tracking the weights.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2406.12564 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2406.12564 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2406.12564 in a Space README.md to link it from this page.