zokica (Zoran) – Community Activity

New activity in google/gemma-2-9b about 1 month ago

Gemma 2's Flash attention 2 implementation is strange...

61

#23 opened 2 months ago by

GPT007

New activity in google/gemma-2-2b about 2 months ago

Problem with Lora finetuning, Out of memory

3

#13 opened about 2 months ago by

zokica

New activity in unsloth/gemma-2-2b-bnb-4bit about 2 months ago

OOM when finetuning with lora.

5

#1 opened about 2 months ago by

zokica

New activity in unsloth/gemma-2-9b-bnb-4bit about 2 months ago

Peft out of memory

#2 opened about 2 months ago by

zokica

New activity in google/gemma-2-9b 2 months ago

Model repeating information and "spitting out" random characters

8

#14 opened 3 months ago by

brazilianslib

Gemma2FlashAttention2 missing sliding_window variable

2

#8 opened 3 months ago by

emozilla

New activity in upstage/SOLAR-10.7B-Instruct-v1.0 3 months ago

Why batch size>1 does not increase model speed

#41 opened 3 months ago by

zokica

New activity in EleutherAI/pile-t5-large 3 months ago

why UMT5

6

#1 opened 6 months ago by

pszemraj

New activity in microsoft/phi-1_5 5 months ago

Something broken on last update

7

#85 opened 5 months ago by

Nayjest

New activity in rhysjones/phi-2-orange 6 months ago

Can't get it to generate the EOS token and beam search is not supported

2

#3 opened 8 months ago by

miguelcarv

New activity in microsoft/phi-2 6 months ago

How to fine-tune this? + Training code

43

#19 opened 9 months ago by

cekal

New activity in rhysjones/phi-2-orange-v2 6 months ago

Added token

1

#5 opened 6 months ago by

zokica

New activity in microsoft/phi-2 6 months ago

Generation after finetuning does not ends at EOS token

1

#123 opened 6 months ago by

zokica

New activity in microsoft/phi-1_5 11 months ago

Attention mask for generation function in the future?

21

#7 opened about 1 year ago by

rchan26

New activity in mosaicml/mpt-30b-chat about 1 year ago

The model is extremelly slow in 4bit, is my code for loading ok?

#7 opened about 1 year ago by

zokica

New activity in TheBloke/guanaco-33B-GPTQ over 1 year ago

guanaco-65b

6

#1 opened over 1 year ago by

bodaay

New activity in mosaicml/mpt-7b over 1 year ago

Speed on CPU

13

#8 opened over 1 year ago by

zokica

Will you make a 3B model as well?

4

#7 opened over 1 year ago by

zokica

New activity in Sosaka/Alpaca-native-4bit-ggml over 1 year ago

How do you run this?

3

#2 opened over 1 year ago by

zokica

New activity in openai-community/roberta-base-openai-detector over 1 year ago

How to run this?

3

#13 opened over 1 year ago by

zokica

New activity in sileod/deberta-v3-base-tasksource-nli over 1 year ago

Does not work at all, i tried to calculate cola

11

#2 opened over 1 year ago by

zokica

New activity in tloen/alpaca-lora-7b over 1 year ago

This works, but training does not work at all

6

#4 opened over 1 year ago by

zokica

New activity in mosaicml/mpt-1b-redpajama-200b over 1 year ago

How can I use this model on CPU?

6

#5 opened over 1 year ago by

zokica

New activity in yahma/llama-7b-hf over 1 year ago

Tokenizer does not work

#1 opened over 1 year ago by

zokica

New activity in mosaicml/mpt-1b-redpajama-200b over 1 year ago

Benchmark

1

#4 opened over 1 year ago by

zokica

New activity in kuleshov/llama-7b-4bit over 1 year ago

Finetunig the model

#2 opened over 1 year ago by

zokica

New activity in awacke1/sileod-deberta-v3-base-tasksource-nli over 1 year ago

Cola

1

#1 opened over 1 year ago by

zokica

Zoran

AI & ML interests

Organizations

zokica's activity

Gemma 2's Flash attention 2 implementation is strange...

Problem with Lora finetuning, Out of memory

OOM when finetuning with lora.

Peft out of memory

Model repeating information and "spitting out" random characters

Gemma2FlashAttention2 missing sliding_window variable

Why batch size>1 does not increase model speed

why UMT5

Something broken on last update

Can't get it to generate the EOS token and beam search is not supported

How to fine-tune this? + Training code

Added token

Generation after finetuning does not ends at EOS token

Attention mask for generation function in the future?

The model is extremelly slow in 4bit, is my code for loading ok?

guanaco-65b

Speed on CPU

Will you make a 3B model as well?

How do you run this?

How to run this?

Does not work at all, i tried to calculate cola

This works, but training does not work at all

How can I use this model on CPU?

Tokenizer does not work

Benchmark

Finetunig the model

Cola