feat: use token limits in core memory instead of character limits #2081

cpacker · 2024-11-21T21:23:36Z

2000 tokens is a good starting point? (~8k chars)

Pending design decision:

Do we make the Block.tokenizer_model nullable / Optional[str], or str with a default imported from constants.py?
- For example, if the field is nullable, in the Pydantic validator that throws an error if the token count is higher than the limit, we can just use constants.DEFAULT_TOKENIZER_MODEL if self.tokenizer_model is None
- Or (if we set defaults), self.tokenizer_model will be constants.DEFAULT_TOKENIZER_MODEL
- Seems to me like the key difference is if we make the field nullable / have the default lazily inferred, then we can change the default for all existing blocks with new package versions?
Or do we put tokenizer_model inside of metadata_?

TODOs for later PR:

Initialize the tokenizer_model in the Memory class to be drawn from agent.llm_config.model
Make sure that when the agent.llm_config.model is changed, the memory.tokenizer_model is changed too

…imit instead

…model

feat: init commit to turn character limit in core memory into token l…

64407a5

…imit instead

cpacker had a problem deploying to Deployment November 21, 2024 21:23 — with GitHub Actions Failure

refactor: change the various namings of CHAR_LIMIT to TOKEN_LIMIT

2cff1ef

cpacker had a problem deploying to Deployment November 21, 2024 21:32 — with GitHub Actions Failure

fix: update BlockModel in the orm to account for new field tokenizer_…

c41466e

…model

cpacker had a problem deploying to Deployment November 21, 2024 21:36 — with GitHub Actions Failure

feat: finish (currently unused) UpdateMemory pydantic model

4b35763

cpacker had a problem deploying to Deployment November 21, 2024 21:52 — with GitHub Actions Failure

Provide feedback