license: apache-2.0
base_model: mistralai/Mistral-Nemo-Base-2407
tags:
- general-purpose
- text-generation
Astra-v1-12B
Astra-v1-12B is a fine-tuned version of the base model Mistral-Nemo-Base-2407, developed for general-purpose natural language processing tasks. It was fine-tuned to replicate the quality and style of Claude 3's Sonnet and Opus models.
Model Details
Model Description
Astra-v1-12B is a general-purpose transformer-based language model fine-tuned for instruction-following tasks. The fine-tuning was designed to match the high-quality generation seen in Claude 3's Sonnet and Opus models, optimized for tasks such as text generation, summarization, question answering, and more.
- Developed by: P0x0
- Finetuned from: Mistral-Nemo-Base-2407
- License: Apache 2.0
Model Sources
- Repository: https://huggingface.co/P0x0/astra-v1-12b
Uses
Direct Use
Astra-v1-12B can be used directly for a wide range of NLP tasks, including:
- Text generation
- Summarization
- Question answering
- Dialogue systems
Downstream Use
This model can be further fine-tuned for specific tasks such as:
- Creative writing
- Instruction-based text completion
- Automated support systems
Out-of-Scope Use
Astra-v1-12B is not intended for real-time decision-making in critical applications or generating harmful or biased content.
Bias, Risks, and Limitations
As with any large language model, Astra-v1-12B may carry inherent biases from the datasets used in fine-tuning. It is important to monitor and review the outputs when using the model in sensitive applications.
How to Get Started with the Model
Here is a Python code snippet to get started with Astra-v1-12B:
from transformers import AutoModelForCausalLM, AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("P0x0/astra-v1-12b")
model = AutoModelForCausalLM.from_pretrained("P0x0/astra-v1-12b")
input_text = "Explain the theory of relativity in simple terms."
inputs = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(**inputs)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))