smala

A simple memory-aided language assistant.

Chat with any local language model, and let smala help you by storing essential information across conversations.

What is smala?
Quickstart
How does smala work?
About

What is smala?

smala is a tool for interacting with LLMs without starting from scratch in every conversation. When you chat with a language model through smala, you can make the tool "remember" certain information ("memories"), either by giving an explicit instruction (/remember), or by letting smala automatically summarize your conversation. These "memories" will then be available in later chats through smala, whatever language model you choose to use.

smala works with any language model used through Ollama, and lets you build (automatically or manually) your own archive of memories that you want the language model to use while chatting with it.

The smala tool is designed around the following principles:

Privacy: Works with local models, enabling you to freely use personal and sensitive information without worrying about data collection.
Flexibility: Use any open-weights language model that is available through Ollama, and swap between them without losing any information.
Control: Customize the instructions given to the model on how to extract and use memories. Easily add and remove memories.

Quickstart

You need to have Ollama installed, and download the language model(s) you want to use (Ollama provides a large number of models).

Clone repository:

git clone https://github.com/ejhusom/smala

Create and activate a virtual environment (optional):

mkdir venv
python3 -m venv venv
source venv/bin/activate

Install requirements:

pip3 install -r requirements.txt

Update config/settings.yaml with your desired setup, most importantly which language model you want to use (make sure you have it installed through Ollama).

Run smala:

python3 src/smala.py

The following prompt will appear, and you can start chatting:

Welcome to the Local LLM Assistant!
Type '"""' to start multi-line input mode.
Type '/remember' as part of a prompt to store a summary of it, or type only '/remember' to save a summary of the last prompt.
Type '/exit' to quit. On exit, you will be asked whether a summary of the conversation should be saved as a 'memory'.
>>>

How does smala work?

smala lets you chat with an LLM, but unlike other tools, it will automatically summarize your conversations and save those summaries as memories. You may also tell smala explicitly to remember certain pieces of information, by using the /remember command.

Automatic creation of memories during the conversations is a planned feature, as indicated by the faded box in the diagram above.

The sequence diagram below illustrates how smala works.

About

The inspiration for smala came from the Memory feature of ChatGPT, where the chatbot will remember important pieces of information across your conversations. I wanted to make something with similar functionality that is independent of what service or language model you use.

This flexibility, together with privacy and control, are the founding principles of smala. You own your data, using any available language model you like. smala is designed to be transparent by making it easy to customize all instructions given to the language model with respect to how it extracts and uses memories.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
assets		assets
config		config
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

smala

What is smala?

Quickstart

How does smala work?

About

About

Releases

Packages

Languages

ejhusom/smala

Folders and files

Latest commit

History

Repository files navigation

smala

What is smala?

Quickstart

How does smala work?

About

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages