[v2] convert category from length descriptor to modality in task metadata #1767

KennethEnevoldsen · 2025-01-11T20:05:33Z

Not sure if this is a good idea. Currently it is already somewhat vaguely defined.

I believe the original intention is to tell us something about the length (s2p: sentence to paragraph), but we know have the descriptive statistics which is a much better source.

However in MIEB it is used as "t2i", text to image.

@Muennighoff would love to know what you think:

here is a sampel from the desc. statistics:

...
        "average_document_length": 20.28592186371801,
        "max_document_length": 214210,
        "unique_documents": 1005474,
        "min_query_length": 2,
        "average_query_length": 38.259317745096176,
...

@isaac-chung you have also been involved greatly in both parts.

(an alternative is to convert the annotation in mieb into "s2i" meaning sentence to image)

The text was updated successfully, but these errors were encountered:

Muennighoff · 2025-01-11T21:32:03Z

Converting it to modality makes sense to me! s2p, p2p are much less specific than the actual lengths!

isaac-chung · 2025-01-11T23:38:23Z

Yes! This would align MTEB and MIEB in a much better way. The change I see from this is:

Update "s2p" and "p2s" -> "t2t"

Samoed · 2025-01-12T05:39:58Z

I think s2p is still relevant because some models don’t use prompts for passages, and it can be helpful to differentiate between s2s and s2p.

isaac-chung · 2025-01-12T05:52:13Z

Hmm true. Could you share an example of a model using a task's category to determine whether to use prompts or not? Might help us find a better way forward.
What alternative do you propose? I see an option with t2t being a parent category, and s2p being a child category, e.g. {"t2t": "s2p"}

Samoed · 2025-01-12T06:29:04Z

For now in NV-Embed this used, but in simple way

mteb/mteb/models/nvidia_models.py

Lines 51 to 53 in c3b46b7

    
           instruction = "" 
        
           if prompt_type == PromptType.query: 
        
               instruction = self.get_instruction(task_name, prompt_type)

and I created also for jasper a bit more complicated

mteb/mteb/models/jasper_models.py

Lines 47 to 48 in c3b46b7

    
           if prompt_type == PromptType.passage and task.metadata.type == "s2p": 
        
               instruction = None

(will add as new sentence instruct wrapper in #1768)

KennethEnevoldsen · 2025-01-12T20:40:01Z

I can't see that is it used in nv-embed? Am I missing something?

I think s2p is still relevant because some models don’t use prompts for passages, and it can be helpful to differentiate between s2s and s2p.

I reviewed #1768 and I am not quite sure why s2s or s2p is required here. Read the model card for jasper but couldn't find any case.

I might be missing something, but queries and passages can the disambiguate by the prompt. p in s2p as I understand stands for paragraph not passage.

isaac-chung · 2025-01-13T00:40:59Z

Agree with Kenneth above, and p does mean paragraph in the paper. Based on the discussion in #1768 and here, I feel the best course of action for us now is to:

continue relying on PromptType to determine prompts (and not category)
Move forward with the change proposed in this comment

KennethEnevoldsen mentioned this issue Jan 13, 2025

Merge v2.0.0: Overview issue #1791

Open

21 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v2] convert category from length descriptor to modality in task metadata #1767

[v2] convert category from length descriptor to modality in task metadata #1767

KennethEnevoldsen commented Jan 11, 2025 •

edited

Loading

Muennighoff commented Jan 11, 2025 •

edited

Loading

isaac-chung commented Jan 11, 2025

Samoed commented Jan 12, 2025

isaac-chung commented Jan 12, 2025

Samoed commented Jan 12, 2025 •

edited

Loading

KennethEnevoldsen commented Jan 12, 2025

isaac-chung commented Jan 13, 2025

[v2] convert category from length descriptor to modality in task metadata #1767

[v2] convert category from length descriptor to modality in task metadata #1767

Comments

KennethEnevoldsen commented Jan 11, 2025 • edited Loading

Muennighoff commented Jan 11, 2025 • edited Loading

isaac-chung commented Jan 11, 2025

Samoed commented Jan 12, 2025

isaac-chung commented Jan 12, 2025

Samoed commented Jan 12, 2025 • edited Loading

KennethEnevoldsen commented Jan 12, 2025

isaac-chung commented Jan 13, 2025

KennethEnevoldsen commented Jan 11, 2025 •

edited

Loading

Muennighoff commented Jan 11, 2025 •

edited

Loading

Samoed commented Jan 12, 2025 •

edited

Loading