Budget for Summ AI #3065

PeterNerlich · 2024-09-18T11:13:20Z

Short description

This PR adds separate budget accounting for Summ AI. Until now the usage of Summ AI was not metered at all by us.

Proposed changes

Add summ_ai_budget_used to the Region model, as well as various auxiliary properties and budget related changes analogous to machine translations
Update summ_ai_budget_used of the region accordingly after a batch of translations was performed
This required a change to how the asynchronous translation flow works:
Instead of putting all tasks into the PatientTaskQueue from the start, more are added every time the queue gets low enough
- This is because individual tasks don't represent entire content translation objects, but individual snippets (e.g. a single paragraph), since Summ AI doesn't handle HTML content (well enough for our purposes)
- We want to avoid sending Summ AI requests that we get billed for and then finding out that the region didn't have the translation budget for it
Introduce the BudgetEstimate class to keep track of "allocated" budget, via an instance shared with each TranslationHelper – the helper class representing a single content translation object responsible for splitting it up into single translatable units and splicing it back together after translation
Pass TranslationHelper instead of TextField instances around to batch translation tasks per content translation
Only add all translation tasks necessary for a content translation if it still fits into the budget according to BudgetEstimate
This on-demand expansion of the queue is facilitated by means of a management task, during its run any workers trying to fetch a task from an empty queue won't quit because it is likely that more tasks will be available a moment later, and the worker pool shouldn't shrink from such race conditions
The PatientTaskQueue is initialized empty and worker() is run directly to fill it before the workers are initialized

Side effects

Machine translations with DeepL or Google Cloud Translate are using up a regions budget whether they are successful or not. Translations with Summ AI are metered in the same way, but whether there were retries because of API rate limiting as well as whether translations were not attempted at all because the budget was already used up is not taken into account. However, I think this should not pose too much of a problem.
This is a particularly confusing section of our code base, and I play not a small role in this. I hope I added enough and helpful enough comments, please tell me if you think this could be improved!
This PR has become so large that we didn't include tests. The existing tests still succeed and guarantee the translation functionality, but the budget metering is not covered.

Resolved issues

This fulfills the first half of #2173

Pull Request Review Guidelines

codeclimate · 2024-09-18T11:13:40Z

Code Climate has analyzed commit 292c552 and detected 0 issues on this pull request.

The test coverage on the diff in this pull request is 89.9% (50% is the threshold).

This pull request will bring the total coverage in the repository to 82.8% (0.0% change).

View more on Code Climate.

charludo

I am sorry, but this needs a major architectural overhaul. Apart from the suggestions below, the main reason I am saying this is the following:

when SummAiApiClient.translate_queryset is called,
a lambda function returning a closure calling SummAiApiClient.check_usage (which overrides MachineTranslationApiClient.check_usage) is used in the creation of a BudgetHelper object,
which is used for the instantiation of a TranslationHelper object,
which has TranslationHelper.check_usage method,
which calls its embedded BudgetHelper.check_usage method,
which calls BudgetHelper._check_usage,
which is the aforementioned closure.

Now back up the chain:

BudgetHelper.check_usage is only called by BudgetHelper.allocate,
which is only called by TranslationHelper.allocate_budget,
which is only called by SummAiApiClient.translate_text_fields,
which is only called by SummAiApiClient.translate_queryset.

Frankly, I question the need for the BudgetEstimate class, as well as the helper methods on TranslationHelper.

However, it is difficult to give more directed feedback, simply because of the amount of obfuscation going on due to these enormous call chains.

I have not yet had a chance to examine the async functions.

charludo · 2024-10-23T13:20:12Z

integreat_cms/core/utils/word_count.py

+    if isinstance(translation, AbstractContentTranslation):
+        attributes = [
+            getattr(translation, attr, None)
+            for attr in ["title", "content", "meta_description"]
+        ]
+
+        content_to_translate = [
+            unescape(strip_tags(attr)) for attr in attributes if attr
+        ]
+        content_to_translate_str = " ".join(content_to_translate)
+    else:
+        content_to_translate_str = translation


This errs on nitpick territory, but: I really dislike the isinstance pattern. What do you think about refactoring this so both this function and check_usage always get a string as the input?

This would require adding a translateable_content (or similar) property to the AbstractContentTranslation model. For cases where the input is string, nothing needs to be changed; whenever a ACT is passed, pass ACT.translateable_content()` instead?

This would also reduce maintanence overhead if we decide to make more fields trasnslatable in the future, since only this new method would need to be tweaked, not multiple functions for multiple translation providers.

This is not 100% necessary, just thought I would put it out there!

charludo · 2024-10-23T13:22:22Z

integreat_cms/locale/de/LC_MESSAGES/django.po

+
+#: cms/models/regions/region.py
+msgid "Credits renewal date for simplified language translation"
+msgstr "Credits Zurücksetzungsdatum für Übersetzung in Einfache Sprache"


Suggested change

msgstr "Credits Zurücksetzungsdatum für Übersetzung in Einfache Sprache"

msgstr "Zurücksetzungsdatum der Credits für Übersetzung in Einfache Sprache"

I know, this is copied from the existing translation, I just noticed that this is probably a more understandable translation. Also applies to the other instance of this.

charludo · 2024-10-23T13:24:53Z

integreat_cms/locale/de/LC_MESSAGES/django.po

@@ -7841,6 +7865,14 @@ msgstr "Bereits verbraucht"
 msgid "Remaining words"
 msgstr "Verbleibende Wörter"

+#: cms/templates/regions/region_form.html
+msgid "Currently HIX is globally deactivated"


Suggested change

msgid "Currently HIX is globally deactivated"

msgid "HIX is currently deactivated globally"

Has nothing to do with this PR, really, just noticed it in the diff. THis is a very German ordering of the words, the suggestion is the far more "natural" sentence in English 😄

charludo · 2024-10-23T13:30:09Z

integreat_cms/summ_ai_api/summ_ai_api_client.py

+    def check_usage(
+        self,
+        region: Region,
+        source_translation: str | AbstractContentTranslation,
+        allocated_budget: int = 0,
+    ) -> tuple[bool, int]:
+        """
+        This function checks if the attempted translation would exceed the region's word limit
+
+        :param region: region for which to check usage
+        :param source_translation: single content object
+        :param allocated_budget: how many additional words should be considered already spent
+        :return: translation would exceed limit, word count of attempted translation
+        """
+        words = word_count(source_translation)
+
+        region.refresh_from_db()
+        # Allow up to SUMM_AI_SOFT_MARGIN more words than the actual limit
+        word_count_leeway = max(
+            1, words + allocated_budget - settings.SUMM_AI_SOFT_MARGIN
+        )
+        translation_exceeds_limit = region.summ_ai_budget_remaining < word_count_leeway
+
+        return (translation_exceeds_limit, words)
+


This is almost 1:1 from the MachineTranslationApiClient. Suggestion to avoid this code duplication:

add margin_field and budget_field as class variables of MachineTranslationApiClient

use these fields in the check_usage method of that class. Then you just need to overwrite the fields in SummAiApiClient.

charludo · 2024-10-23T13:33:36Z

integreat_cms/summ_ai_api/utils.py

Could you either open an issue to refactor this file, or split it into multiple files in this PR? 🙈 660 lines is way too much for a utils.py 😆

charludo · 2024-10-23T13:34:11Z

integreat_cms/summ_ai_api/utils.py

+    @property
+    def valid(self) -> bool:
+        """
+        Wether or not the translation was successful


Suggested change

Wether or not the translation was successful

Whether or not the translation was successful

charludo · 2024-10-23T13:37:25Z

integreat_cms/summ_ai_api/utils.py

+    @property
+    def word_count(self) -> int:
+        """
+        How many words need to be translated..


Suggested change

How many words need to be translated..

How many words need to be translated.

JoeyStk · 2024-12-07T11:54:53Z

As this needs some major refactoring and time is sparse at the moment, it'll take some time until we will get back to this

Budget for Summ AI

292c552

PeterNerlich requested review from david-venhoff and MizukiTemma September 18, 2024 11:13

JoeyStk requested a review from jarlhengstmengel October 2, 2024 14:28

JoeyStk added the prio: high Needs to be resolved ASAP. label Oct 2, 2024

charludo self-requested a review October 8, 2024 06:18

charludo self-assigned this Oct 8, 2024

charludo requested changes Oct 23, 2024

View reviewed changes

JoeyStk marked this pull request as draft December 7, 2024 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Budget for Summ AI #3065

Budget for Summ AI #3065

PeterNerlich commented Sep 18, 2024

codeclimate bot commented Sep 18, 2024

charludo left a comment •

edited

Loading

charludo Oct 23, 2024

charludo Oct 23, 2024

charludo Oct 23, 2024

charludo Oct 23, 2024

charludo Oct 23, 2024 •

edited

Loading

charludo Oct 23, 2024

charludo Oct 23, 2024

JoeyStk commented Dec 7, 2024

	msgstr "Credits Zurücksetzungsdatum für Übersetzung in Einfache Sprache"
	msgstr "Zurücksetzungsdatum der Credits für Übersetzung in Einfache Sprache"

	msgid "Currently HIX is globally deactivated"
	msgid "HIX is currently deactivated globally"

	Wether or not the translation was successful
	Whether or not the translation was successful

	How many words need to be translated..
	How many words need to be translated.

Budget for Summ AI #3065

Are you sure you want to change the base?

Budget for Summ AI #3065

Conversation

PeterNerlich commented Sep 18, 2024

Short description

Proposed changes

Side effects

Resolved issues

codeclimate bot commented Sep 18, 2024

charludo left a comment • edited Loading

Choose a reason for hiding this comment

charludo Oct 23, 2024

Choose a reason for hiding this comment

charludo Oct 23, 2024

Choose a reason for hiding this comment

charludo Oct 23, 2024

Choose a reason for hiding this comment

charludo Oct 23, 2024

Choose a reason for hiding this comment

charludo Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

charludo Oct 23, 2024

Choose a reason for hiding this comment

charludo Oct 23, 2024

Choose a reason for hiding this comment

JoeyStk commented Dec 7, 2024

charludo left a comment •

edited

Loading

charludo Oct 23, 2024 •

edited

Loading