ymcki
/

gemma-2-2b-jpn-it-abliterated-17-ORPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gemma-2-2b-jpn-it-abliterated-17-ORPO / README.md

ymcki's picture

Upload README.md

531b2e2 verified 3 days ago

|

3.54 kB

	---
	base_model: google/gemma-2-2b-jpn-it
	language:
	- multilingual
	datasets:
	- mlabonne/orpo-dpo-mix-40k
	library_name: transformers
	license: gemma
	license_link: https://ai.google.dev/gemma/terms
	pipeline_tag: text-generation
	tags:
	- nlp
	- code
	quantized_by: ymcki
	widget:
	- messages:
	- role: user
	content: Can you provide ways to eat combinations of bananas and dragonfruits?
	---

	Original model: https://huggingface.co/google/gemma-2-2b-jpn-it

	## Prompt format

	```
	<start_of_turn>user
	{prompt}<end_of_turn>
	<start_of_turn>model
	<end_of_turn>
	<start_of_turn>model

	```

	Note that this model does not support a System prompt.

	This is abliterated model of [google/gemma-2-2b-jpn-it](https://huggingface.co/google/gemma-2-2b-jpn-it) using the
	[method](https://medium.com/@mlabonne/uncensor-any-llm-with-abliteration-d30148b7d43e)
	described by mlabonne.

	Layer 17 of the original model was chosen for abliteration.
	I also created another layer 18 abliterated model for comparison.

	ORPO fine tuning was performed for four epoches.

	\| Epoch \| loss \| eval_loss \|
	\| ----- \| ---- \| --------- \|
	\| 1 \| 10.51274610161781342 \| 11.023366928100586 \|
	\| 2 \| 10.09700682163238566 \| 10.434176445007324 \|
	\| 3 \| 10.35771694183349566 \| 10.179500579833984 \|
	\| 4 \| 10.82988178133964582 \| 10.084120750427246 \|

	The fine tuned model is uploaded here to be evaluated by the Open LLM Leaderboard to see if the brain damaged
	suffered by the non-ORPO model can be healed.

	## Benchmark (100.0*raw scores only)

	Click on the model name go to the raw score json generated by Open LLM Leaderboard.

	\| Model \| Average \| IFEval \| BHH \| Math Lv5 \| GPQA \| MUSR \| MMLU-PRO \|
	\| ----- \| ------- \| ------ \| ----\|--------- \| ---- \| ---- \| -------- \|
	\| [gemma-2-2b-jpn-it](https://huggingface.co/datasets/open-llm-leaderboard/results/blob/main/google/gemma-2-2b-jpn-it/results_2024-10-15T15-21-39.173019.json) \| 30.82 \| 54.11 \| 41.43 \| 0.0 \| 27.52 \| 37.17 \| 24.67 \|
	\| gemma-2-2b-jpn-it-abliterated-17-ORPO \| TBD \| TBD \| TBD \| TBD \| TBD \| TBD \| TBD \|
	\| [gemma-2-2b-jpn-it-abliterated-17](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-17/results_2024-10-17T11-26-10.721815.json) \| 16.74 \| 0.0 \| 29.13 \| 0.0 \| 25.92 \| 33.73 \| 11.68 \|
	\| [gemma-2-2b-jpn-it-abliterated-18](https://huggingface.co/datasets/open-llm-leaderboard/results/raw/main/ymcki/gemma-2-2b-jpn-it-abliterated-18/results_2024-10-16T07-58-03.781979.json) \| 16.74 \| 0.0 \| 29.13 \| 0.0 \| 25.92 \| 33.73 \| 11.68 \|

	Indeed, it is quite dumbed down relative to the original. Interestingly, both abliteration models have the same Open LLM results.

	## How to run this model

	```py
	from transformers import AutoTokenizer, AutoModelForCausalLM
	import transformers
	import torch

	model_id = "gemma-2-2b-jpn-it-abliterated-17-ORPO"
	dtype = torch.bfloat16

	tokenizer = AutoTokenizer.from_pretrained(model_id)
	model = AutoModelForCausalLM.from_pretrained(
	model_id,
	device_map="cuda",
	torch_dtype=dtype,)

	chat = [
	{ "role": "user", "content": "Write a hello world program" },
	]
	prompt = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
	```

	## Downloading using huggingface-cli

	First, make sure you have hugginface-cli installed:

	```
	pip install -U "huggingface_hub[cli]"
	```

	Then, you can target the specific file you want:

	```
	huggingface-cli download ymcki/gemma-2-2b-jpn-it-abliterated-17-ORPO --include "*" --local-dir ./
	```

	## Credits

	Thank you mlabonne for describing his abliteration method.