Releases · oKatanaaa/lima-gui

10 Jul 00:14

oKatanaaa

v0.6.1

7b67650

v0.6.1 Pre-release

Pre-release

Breaking changes

Added tiktoken as a dependency. OpenAI tokenizers are supported now.
Now to launch the app you need to run limagui instead of python -m lima_gui.app.

Fixes

Fixes #4 .Since LLM providers love to mess with licensing (sometimes messing up the access to tokenizers), the cl100k_base from tiktoken is used by default as the underlying tokenizer.
Add loguru to dependency list.

Assets 2

02 Mar 21:45

oKatanaaa

v0.6.0

5ca663b

v0.6.0 Pre-release

Pre-release

In this update I lay foundation for better compatibility with existing LLM finetuning stack by changing the data schema to be more compliant with OpenAI API. You can now easily export data (new Export as OpenAI dataset option) and use it as is with existing training pipelines.
It also includes various QoL improvements.

Breaking changes

The underlying data format has been massively overhauled. Meaning that if you have been collecting data using LIMA-GUI, you won't be able to load it using the newer version. To update your data use python -m lima_gui.update_data script. It takes in a path to a target input file (or folder with multiple files) and a path to a target output file (or folder). Although the script can't handle function calling data. If needed, I can update the script (just make a corresponding issue).
When using completion API the chat is formatted in ChatML. That means that you can use completion mode to generate (and steer) partial answers of ChatML compliant models, such as cognitivecomputations/dolphin-2.6-mistral-7b-dpo.
transformers library is removed from dependencies. tokenizers are used instead. Sorry for that stupid mistake.

Major changes

You can now export your dataset in OpenAI finetuning API compliant format (jsonl file with a lot of {"messages": [...]}). Click File -> Export as OpenAI dataset.
Ctrl + S now saves into the last opened file and no longer opens file selection window.
LIMA-GUI will track changes and:
- Ask you to save the data if you haven't done so and trying to close the program.
- Ask you to save the data if you haven't done so and trying to open another file.
All of the prints are replaced with loguru library. All of the calls are being logged. As of now DEBUG level is set by default.

Fixes

LIMA-GUI now works with the latest version of openai library.

Assets 2

30 Oct 20:38

oKatanaaa

v0.5.1

6b96806

v0.5.1 Pre-release

Pre-release

Features

Function calling API. Now assistant messages may contain function calls. It is useful for gathering data for LLMs with agency (e.g. making your own code interpreter or LLM that supports plugins). You can also use OpenAI API to generate function calls automatically.

Bugfixes

on_content_changed callback was called twice when setting data in a message item.

Assets 2

16 Oct 19:02

oKatanaaa

v0.4.2

f1bdddc

v0.4.2 Pre-release

Pre-release

First usable version. I make this release in preparation for future (possibly breaking) changes to make sure there is a working version available.

Functionality list:

allows to gather multi-turn conversational data (for ChatGPT like chatbots);
fully OpenAI API compliant data format;
OpenAI API integration for data gathering assistance;
allows to tag conversations (coding, QA, contextual QA, etc.);
allows to assign language to a dialog (currently ru/en only);
token counting in each dialog (uses HF tokenizers);

Notes:

currently no performance optimizations are at place;
may crash sometimes (I am working on this but make sure to save your data regularly).

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Breaking changes

Fixes

Breaking changes

Major changes

Fixes

Features

Bugfixes

Releases: oKatanaaa/lima-gui

v0.6.1

Breaking changes

Fixes

v0.6.0

Breaking changes

Major changes

Fixes

v0.5.1

Features

Bugfixes

v0.4.2