Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add BERTScore feature descriptor and feature #1335

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

projectultra
Copy link

PR for the BERTScore feature from #1317 . The main changes include adding the BERTScore descriptor and feature, updated registry, added tests.

New Feature Implementation:

  • src/evidently/features/BERTScore_feature.py: Introduced the BERTScoreFeature class that generates BERT-based similarity scores between two text columns. This class uses a pretrained BERT model (bert-base-uncased) to compute the cosine similarity between token embeddings, then calculates the precision, recall, then calculates the F1 score (BERTScore).

image
image

Testing:

Obtained the expected values using Repo of the paper referenced in the paper BERTScore: Evaluating Text Generation with BERT. The results are available here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant