state of the kimiko chat format

#1
by SlimeQ - opened

I've been building a chatbot which I've standardized on an extended version of what I understand to be the Kimiko chat format. My chat format builds upon your <<AIBOT>> syntax and adds a perspective tag. So I can do things like:

<<slimeq/CHAT>> (a publicly visible text message)
<<slimeq/SPEAKING>> (an internal, narrative message spoken by a character in my story)
<<slimeq/THOUGHT>> (an internal, narrative message thought by a character in my story, which nobody else will see)

To accomplish this I've been building a dataset and training a lora. My first attempt was a fine tune of the original nous-hermes kimiko, and then I moved on to MythoMax and then Tiefighter, both of which immediately understood the Kimiko chat format and were able to adapt effectively.

Now though I'm attempting to move to a mistral base and I've found that this chat format doesn't really work at all. I'm wondering what your plans are @nRuaif , are you abandoning this chat format or planning to train it into a mistral model? It seems like a chatML conversion would be relatively easy, just curious if you've run into this problem yet

I was experimenting with stuff back then with custom format but I get many complains about it so I switched to that Vicuna 1.1.
I will think about ChatML, not really my favorite format but seems like everyone is using it.

Sign up or log in to comment