marathi-llm/MahaMarathi-7B-v24.01-Base

MahaMarathi 7B, is a domain adapted, continually pre-trained, and instruction fine-tuned native Marathi large language model (LLM) with 7 billion parameters based on Llama2+Mistral, and trained on a large corpus of Marathi text.
महामराठी हे 7अब्ज पॅरामीटर्स वापरून पूर्व-प्रशिक्षित आणि सुचनांच्या आधारे फाईन ट्यून केलेले मूळ मराठी महाभाषा समीकरण संच आहे ज्याला लार्ज लँग्वेज मॉडेल (एल. एल. एम.) असेही म्हणतात. 83 दशलक्षाहून अधिक स्थानिक भाषिकांसाठी तयार केलेले हे कृत्रिम बुद्धिमत्तेवर (ए. आय.) आधारित तंत्रज्ञान आहे जे मराठीतील कठीण संभाषणे आणि सूचना सहज हाताळण्यास सक्षम आहे.
This released model is a base model and not meant to be used as is. It is recommended to first finetune it on downstream tasks that you are interested in.

# Usage
import torch
from transformers import LlamaTokenizer, LlamaForCausalLM

tokenizer = LlamaTokenizer.from_pretrained('marathi-llm/MahaMarathi-7B-v24.01-Base')
model = LlamaForCausalLM.from_pretrained('marathi-llm/MahaMarathi-7B-v24.01-Base', torch_dtype=torch.bfloat16)

prompt = "मी एक ए. आय. द्वारा तयार केलेले महाभाषा समीकरण संच आहे."
inputs = tokenizer(prompt, return_tensors="pt")

# Generate
generate_ids = model.generate(inputs.input_ids, max_length=30)
tokenizer.batch_decode(generate_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)[0]

Example 1	Example 2

marathi-llm
/

MahaMarathi-7B-v24.01-Base

You need to agree to share your contact information to access this model

Spaces using marathi-llm/MahaMarathi-7B-v24.01-Base 2