7 2 61

Csaba Kecskemeti PRO

csabakecskemeti

https://devquasar.com/

csabakecskemeti

AI & ML interests

None yet

Recent Activity

liked a model 6 minutes ago

sarpba/whisper-base-hungarian_v1

updated a model 14 minutes ago

DevQuasar/INSAIT-Institute.BgGPT-Gemma-2-2.6B-IT-v1.0-GGUF

updated a model 15 minutes ago

DevQuasar/INSAIT-Institute.BgGPT-Gemma-2-27B-IT-v1.0-GGUF

Organizations

Posts 2

Post

1187

Some time ago, I built a predictive LLM router that routes chat requests between small and large LLM models based on prompt classification. It dynamically selects the most suitable model depending on the complexity of the user input, ensuring optimal performance while maintaining conversation context. I also fine-tuned a RoBERTa model to use with the package, but you can plug and play any classifier of your choice.

Project's homepage:
https://devquasar.com/llm-predictive-router/
Pypi:
https://pypi.org/project/llm-predictive-router/
Model:
DevQuasar/roberta-prompt_classifier-v0.1
Training data:
DevQuasar/llm_router_dataset-synth
Git:
https://github.com/csabakecskemeti/llm_predictive_router_package

Feel free to check it out, and/or contribute.

Post

1483

I've built a small open utility pip package called LLM-Forwarder that allows you to inject context, such as adding a private RAG, into existing chat applications by forwarding the app through the LLM-Forwarder. In the forwarder server, you can configure custom code to re-process chat messages and alter the user prompt, for example, by adding extra context.

https://pypi.org/project/llm-forwarder/
More details
https://devquasar.com/llmforwarder/

models 1

csabakecskemeti/bert-base-case-yelp5-tuned-experiment

Text Classification • Updated Apr 5 • 12

datasets

None public yet