Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
19
2
14
Sam Paech
PRO
sam-paech
Follow
Lilly-Hanna's profile picture
Prasad23's profile picture
21world's profile picture
40 followers
·
4 following
https://eqbench.com
sam_paech
sam-paech
AI & ML interests
Emotional intelligence, alignment, benchmarking
Recent Activity
New activity
about 12 hours ago
sam-paech/Darkest-muse-v1
New activity
1 day ago
HuggingFaceTB/SmolLM2-1.7B-Instruct
New activity
2 days ago
sam-paech/Darkest-muse-v1
View all activity
Articles
MMLU-Pro-NoMath
Jul 11
•
3
Organizations
sam-paech
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
sam-paech/Darkest-muse-v1
about 12 hours ago
Love this model but I wish the context was higher
3
#3 opened 2 days ago by
HannaLovvold
New activity in
HuggingFaceTB/SmolLM2-1.7B-Instruct
1 day ago
GSM8K results replication
2
#9 opened 18 days ago by
sam-paech
New activity in
sam-paech/Darkest-muse-v1
2 days ago
This is very strong in slop and purple pose, I feel.
1
#2 opened 2 days ago by
UniversalLove333
New activity in
sam-paech/Darkest-muse-v1
11 days ago
system prompt gemma2?
1
#1 opened 13 days ago by
rwfrs
New activity in
sam-paech/Quill-v1
24 days ago
This model is exactly what I've been looking for!
1
#1 opened 24 days ago by
PinkMoth
New activity in
mradermacher/model_requests
29 days ago
sam-paech/Delirium-v1
5
#391 opened 29 days ago by
sam-paech
New activity in
sam-paech/gutenberg3-generalfiction-scifi-fantasy-romance-adventure-dpo
29 days ago
[bot] Conversion to Parquet
#1 opened about 1 month ago by
parquet-converter
Librarian Bot: Add language metadata for dataset
#2 opened 29 days ago by
librarian-bot
New activity in
sam-paech/Delirium-v1
29 days ago
tokenizer.model
4
#1 opened 29 days ago by
mradermacher
New activity in
TheDrummer/Moistral-11B-v3
about 2 months ago
WAR ON MINISTRATIONS
11
#1 opened 7 months ago by
TheDrummer
New activity in
ajibawa-2023/General-Stories-Collection
about 2 months ago
Which models were used to generate this dataset?
1
#2 opened about 2 months ago by
sam-paech
New activity in
SkunkworksAI/reasoning-0.01
2 months ago
Which model was used?
1
#5 opened 2 months ago by
sam-paech
New activity in
google/gemma-2-27b-it
4 months ago
Hallucinations, misspellings etc. Something seems broken?
21
#10 opened 5 months ago by
sam-paech
New activity in
open-llm-leaderboard/open_llm_leaderboard
5 months ago
Running MMLU-Pro with Eleuther LM-Eval
2
#814 opened 5 months ago by
sam-paech
📑 Raw results link goes to old leaderboard results dataset
2
#808 opened 5 months ago by
sam-paech
New activity in
senseable/WestLake-7B-v2
7 months ago
EQ-Bench score
12
#8 opened 10 months ago by
sam-paech
New activity in
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
7 months ago
Prompt format
6
#4 opened 7 months ago by
sam-paech
New activity in
CausalLM/34b-beta
8 months ago
Regarding Concerns about MMLU Scores
71
#5 opened 8 months ago by
deleted
New activity in
froggeric/WestLake-10.7B-v2
8 months ago
Details of all the merge attempts until this one
6
#1 opened 8 months ago by
froggeric
Details of all the merge attempts until this one
6
#1 opened 8 months ago by
froggeric
Load more