Gökdeniz Gülmez (Isaak Carter Augustus)'s picture

Gökdeniz Gülmez (Isaak Carter Augustus)

Goekdeniz-Guelmez

·

AI & ML interests

Transformers / NLP / Multimodal / Realtime M2M / J.O.S.I.E.v4o

Recent Activity

liked a dataset 3 days ago

OpenCoder-LLM/fineweb-code-corpus

liked a model 3 days ago

mistralai/Mistral-Large-Instruct-2411

liked a model 3 days ago

Qwen/Qwen2.5-Coder-7B-Instruct

Organizations

Goekdeniz-Guelmez's activity

upvoted 2 papers 14 days ago

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published 15 days ago • 43

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 16 days ago • 60

upvoted a collection 17 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 4 hours ago • 172

upvoted a paper 21 days ago

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25 • 74

upvoted an article 25 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

25 days ago

• 37

upvoted 2 collections about 1 month ago

Papers - MoE

45 items • Updated Aug 23 • 3

Josiefied and Abliterated

Abliterated, and further fine-tuned to be the most uncensored models available. • 11 items • Updated 4 days ago • 3

upvoted 3 collections about 2 months ago

🍷 FineWeb datasets

5 items • Updated Jun 26 • 20

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 271

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 20 items • Updated about 13 hours ago • 39

upvoted a paper about 2 months ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16 • 38

upvoted 2 collections 2 months ago

Mamba

Mamba is a new LLM architecture that integrates the Structured State Space sequence model to manage lengthy data sequences. • 11 items • Updated Oct 12 • 1

Transformers compatible Mamba

This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6 • 36

upvoted 2 articles 2 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 369

Article

Recreating o1 at Home with Role-Play LLMs

By

•

Sep 20

• 20

upvoted 3 collections 2 months ago

Qwen2.5

The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio • 33 items • Updated Oct 12 • 4

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 370

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 218

upvoted an article 3 months ago

Article

What is Retrieval-based Voice Conversion WebUI?

By

•

Aug 18

• 9

upvoted an article 4 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 244