KrisKale45 (Joshua Chris)

upvoted an article 19 days ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

21 days ago

• 30

upvoted an article about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 164

upvoted a paper about 2 months ago

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Paper • 2409.12139 • Published Sep 18 • 11

upvoted a paper 2 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29 • 47

upvoted an article 2 months ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

By

•

Aug 26

• 35

upvoted 2 papers 3 months ago

FocusLLM: Scaling LLM's Context by Parallel Decoding

Paper • 2408.11745 • Published Aug 21 • 23

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19 • 51

upvoted 2 articles 3 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 73

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 102

upvoted an article 4 months ago

Article

Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers

By

•

Mar 12

• 3

upvoted a collection 4 months ago

NuExtract

Collection

4 items • Updated 24 days ago • 9

upvoted a paper 4 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 128

upvoted 2 articles 5 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 177

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 362

upvoted an article 7 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22

• 78

upvoted a paper 7 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 251

upvoted a paper 9 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

upvoted a paper 11 months ago

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 30

upvoted a paper about 1 year ago

Aligning Large Multimodal Models with Factually Augmented RLHF

Paper • 2309.14525 • Published Sep 25, 2023 • 29

upvoted a paper over 1 year ago

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts

Paper • 2307.07218 • Published Jul 14, 2023 • 26

Joshua Chris

AI & ML interests

Organizations

KrisKale45's activity

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

Llama can now see and run on your device - welcome Llama 3.2

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

FocusLLM: Scaling LLM's Context by Parallel Decoding

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Welcome FalconMamba: The first strong attention-free 7B model

Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers

NuExtract

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Uncensor any LLM with abliteration

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Aligning Large Multimodal Models with Factually Augmented RLHF

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts