Shahrukh Khan's picture

Shahrukh Khan

shahrukhx01

·

https://github.com/shahrukhx01

AI & ML interests

NLP

Recent Activity

liked a model 3 days ago

Nexusflow/Athene-V2-Chat

upvoted a collection 4 days ago

UltraVox Audio Language Model Release 🔊

upvoted a collection 7 days ago

Marqo-Ecommerce-Embeddings

Organizations

shahrukhx01's activity

upvoted a collection 4 days ago

UltraVox Audio Language Model Release 🔊

3 items • Updated 6 days ago • 15

upvoted a collection 7 days ago

Marqo-Ecommerce-Embeddings

State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated 7 days ago • 16

upvoted 2 collections 21 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 3 hours ago • 172

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 15 days ago • 95

upvoted a collection 24 days ago

CompassJudger

4 items • Updated Oct 16 • 8

upvoted a collection 25 days ago

glm-4-voice

3 items • Updated 27 days ago • 2

upvoted a collection 29 days ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 17 days ago • 89

upvoted 4 collections about 1 month ago

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated about 3 hours ago • 43

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3 • 76

MS MARCO Mined Triplets

These datasets contain MS MARCO Triplets gathered by mining hard negatives using various models. Each dataset has various subsets. • 14 items • Updated May 21 • 10

Parallel Sentences Datasets

These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Oct 9 • 12

upvoted an article about 1 month ago

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8

• 34

upvoted 8 collections about 2 months ago

em🍞ing series

crispy sentence embedding family • 5 items • Updated Oct 14 • 22

Nomic Embed

Open Source Long Context Text Embedders • 8 items • Updated Feb 14 • 16

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K • 7 items • Updated about 1 month ago • 4

NuNerZero - Zero Shot NER

The best compact Zero-Shot NER models with MIT license • 4 items • Updated Jul 3 • 19

NuExtract

4 items • Updated Oct 17 • 9

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 271

Tower

Model weights and SFT data for Tower. • 11 items • Updated 6 days ago • 26

EuroLLM

2 items • Updated Aug 7 • 15