wing lian PRO

winglian

AI & ML interests

None yet

Organizations

winglian's activity

New activity in microsoft/Phi-3.5-mini-instruct 28 days ago

trust_remote_code=True

1
#9 opened 28 days ago by winglian
New activity in NousResearch/Hermes-2-Pro-Llama-3-8B 5 months ago

add axolotl tag

#1 opened 5 months ago by winglian
New activity in mattshumer/Llama-3-8B-16K 5 months ago

add axolotl tag

#3 opened 5 months ago by winglian
New activity in cognitivecomputations/dolphin-2.9-llama3-8b 5 months ago

add axolotl tag

#12 opened 5 months ago by winglian
New activity in PrunaAI/dbrx-base-bnb-4bit 6 months ago

reduce verbosity of logging

#1 opened 6 months ago by winglian
New activity in LnL-AI/dbrx-base-converted-v2 6 months ago

reduce logging verbosity

1
#3 opened 6 months ago by winglian

dbrx-base

1
#2 opened 6 months ago by winglian
New activity in ai21labs/Jamba-v0.1 6 months ago

finetuning issues

2
#9 opened 6 months ago by winglian
New activity in cerebras/SlimPajama-627B 9 months ago

Trouble with streaming

6
#5 opened about 1 year ago by andersonbcdefg
New activity in Open-Orca/SlimOrca-Dedup 11 months ago

minhash deduping

1
#2 opened 11 months ago by winglian
New activity in crumb/c4-benchfilter-nano 11 months ago
New activity in stabilityai/stablelm-3b-4e1t 12 months ago

fix get_input_embdeddings

1
#3 opened 12 months ago by winglian
New activity in microsoft/phi-1_5 12 months ago
New activity in PygmalionAI/pygmalion-2-13b about 1 year ago

add axolotl badge to readme

1
#1 opened about 1 year ago by winglian
New activity in PygmalionAI/pygmalion-2-7b about 1 year ago

add axolotl badge to readme

#2 opened about 1 year ago by winglian
New activity in PygmalionAI/mythalion-13b about 1 year ago

Add axolotl badge to readme

#1 opened about 1 year ago by winglian
New activity in microsoft/phi-1_5 about 1 year ago

add _no_split_modules property

#17 opened about 1 year ago by winglian
New activity in garage-bAInd/Platypus2-13B about 1 year ago

Dataset

3
#1 opened about 1 year ago by winglian
New activity in eugenepentland/oo-packing-checkpoint-15000 about 1 year ago

Upload 3 files

#1 opened about 1 year ago by winglian
New activity in winglian/t5-large-flan-cot about 1 year ago
New activity in openaccess-ai-collective/openllama-7b-4k over 1 year ago

What does the 4k stand for?

2
#1 opened over 1 year ago by flashvenom
New activity in mosaicml/mpt-7b over 1 year ago

upstream-replit-updates

4
#43 opened over 1 year ago by winglian
New activity in openaccess-ai-collective/jeopardy-bot over 1 year ago

Token length 3908?

1
#1 opened over 1 year ago by Yhyu13
New activity in openaccess-ai-collective/StableLManticore-7B over 1 year ago

Is this based on LLaMA?

1
#1 opened over 1 year ago by Yhyu13

What model is this?

1
#1 opened over 1 year ago by Yhyu13
New activity in openaccess-ai-collective/manticore-13b over 1 year ago

epoch 3 final? or 4 coming?

1
#6 opened over 1 year ago by faisalhr1997
New activity in BlinkDL/rwkv-4-pileplus over 1 year ago

14B

#1 opened over 1 year ago by winglian
New activity in openaccess-ai-collective/mpt-7b-wizardlm over 1 year ago

fine-tuning script notebook

4
#1 opened over 1 year ago by g30rv17ys
New activity in openaccess-ai-collective/manticore-13b over 1 year ago

Some suggestions for optimization

10
#3 opened over 1 year ago by polymer

specific instruct prompt to use

3
#2 opened over 1 year ago by digitous
New activity in openaccess-ai-collective/wizard-mega-13b over 1 year ago

Prompt format contradiction

2
#5 opened over 1 year ago by 2EyeGuy
New activity in openaccess-ai-collective/manticore-13b over 1 year ago

Difference with Wizzard Mega

1
#1 opened over 1 year ago by frandmb
New activity in openaccess-ai-collective/wizard-mega-13b over 1 year ago

Fine-tune specific details

6
#2 opened over 1 year ago by polymer
New activity in P1ayer-1/chatgpt-conversations-chatlogs.net over 1 year ago

v1 vs v2

1
#1 opened over 1 year ago by winglian
New activity in openaccess-ai-collective/ggml-ui over 1 year ago
New activity in theblackcat102/reward-deberta-v3-large-aspect over 1 year ago

training code

#1 opened over 1 year ago by winglian
New activity in TehVenom/MPT-7b-Chat-Instruct-LongCTX-Merge over 1 year ago

weighted average?

1
#4 opened over 1 year ago by winglian