Sam Paech's picture

Sam Paech PRO

sam-paech

·

https://eqbench.com

AI & ML interests

Emotional intelligence, alignment, benchmarking

Recent Activity

New activity about 12 hours ago

sam-paech/Darkest-muse-v1

New activity 1 day ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

New activity 2 days ago

sam-paech/Darkest-muse-v1

Articles

MMLU-Pro-NoMath

Organizations

sam-paech's activity

New activity in sam-paech/Darkest-muse-v1 about 12 hours ago

Love this model but I wish the context was higher

#3 opened 2 days ago by

New activity in HuggingFaceTB/SmolLM2-1.7B-Instruct 1 day ago

GSM8K results replication

#9 opened 18 days ago by

New activity in sam-paech/Darkest-muse-v1 2 days ago

This is very strong in slop and purple pose, I feel.

#2 opened 2 days ago by

UniversalLove333

New activity in sam-paech/Darkest-muse-v1 11 days ago

system prompt gemma2?

#1 opened 13 days ago by

New activity in sam-paech/Quill-v1 24 days ago

This model is exactly what I've been looking for!

#1 opened 24 days ago by

New activity in mradermacher/model_requests 29 days ago

sam-paech/Delirium-v1

#391 opened 29 days ago by

New activity in sam-paech/gutenberg3-generalfiction-scifi-fantasy-romance-adventure-dpo 29 days ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#2 opened 29 days ago by

New activity in sam-paech/Delirium-v1 29 days ago

tokenizer.model

#1 opened 29 days ago by

New activity in TheDrummer/Moistral-11B-v3 about 2 months ago

WAR ON MINISTRATIONS

#1 opened 7 months ago by

New activity in ajibawa-2023/General-Stories-Collection about 2 months ago

Which models were used to generate this dataset?

#2 opened about 2 months ago by

New activity in SkunkworksAI/reasoning-0.01 2 months ago

Which model was used?

#5 opened 2 months ago by

New activity in google/gemma-2-27b-it 4 months ago

Hallucinations, misspellings etc. Something seems broken?

#10 opened 5 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 5 months ago

Running MMLU-Pro with Eleuther LM-Eval

#814 opened 5 months ago by

📑 Raw results link goes to old leaderboard results dataset

#808 opened 5 months ago by

New activity in senseable/WestLake-7B-v2 7 months ago

EQ-Bench score

#8 opened 10 months ago by

New activity in HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 7 months ago

Prompt format

#4 opened 7 months ago by

New activity in CausalLM/34b-beta 8 months ago

Regarding Concerns about MMLU Scores

#5 opened 8 months ago by deleted

New activity in froggeric/WestLake-10.7B-v2 8 months ago

Details of all the merge attempts until this one

#1 opened 8 months ago by

Details of all the merge attempts until this one

#1 opened 8 months ago by