Implement BLEU score evaluation for NLP tests #6537

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

stephentoub merged 13 commits into dotnet:main from peterwald:bleu

Jun 27, 2025

Member

peterwald commented Jun 23, 2025 •

edited by dotnet-policy-service bot

Loading

Add a new library for algorithmic NPL scoring evaluators, named Microsoft.Extensions.AI.Evaluation.NLP.

The first such evaluator that is implemented is BLEU. For this implementation we default to 4 even weights for the n-gram comparisons and we use the smoothing method 4 from Chen and Cherry (2014) for sentence-level BLEU scores. These are the same defaults chosen by the Azure python evaluation SDK.

Also included is a simple word tokenizer based on the tokenizers used for other BLEU implementations, such as MOSES, SacreBLEU and NLTK.

Microsoft Reviewers: Open in CodeFlow


          Implement BLEU score evaluation for NLP tests

bc34929

peterwald requested a review from a team as a code owner

June 23, 2025 21:55

github-actions bot added the area-ai-eval label

dotnet-policy-service bot assigned peterwald

peterwald added 2 commits

June 24, 2025 07:55


          Fix style warnings

b05ee5b


          Support multiple references for a single evaluator

d3f45a4

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/Utilities/CollectionBuilderAttribute.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/NGram.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/RationalNumber.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/RationalNumber.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/RationalNumber.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/RationalNumber.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/SmoothingFunction.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/SmoothingFunction.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/SmoothingFunction.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/SmoothingFunction.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/MatchCounter.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/MatchCounter.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/MatchCounter.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/BLEU/MatchCounter.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation.NLP/SimpleWordTokenizer.cs Outdated Show resolved Hide resolved

This was referenced Jul 28, 2025

Bump Aspire.Hosting.AppHost and 8 others dotnet-presentations/ai-workshop#42

Merged

Bump Microsoft.Extensions.AI and 2 others devlooped/AI.Benchmarks#37

Closed

Bump Azure.Identity and 12 others Back-Buddy/Backend#95

Closed

Bump BenchmarkDotNet and 8 others dgarciarubio/AspNetCore.Examples.OpenTelemetry#37

Open

Bump the microsoftextensions group with 4 updates niallpat/eShop#4

Open

Bump Azure.AI.Agents.Persistent and 20 others microsoft/Generative-AI-for-beginners-dotnet#217

Merged

Bump AwesomeAssertions and 32 others tbui17/test2#13

Open

Bump the all group with 2 updates tryAGI/AssemblyAI#101

Merged

Updated dependencies yavorfingarov/DogQuiz#29

Open

fix: Bump AWSSDK.DynamoDBv2 and 2 others ganhammar/AspNetCore.Identity.AmazonDynamoDB#118

Open

Bump the minor group with 17 updates DEFRA/trade-imports-decision-comparer#126

Closed

Bump Aspire.Hosting.AppHost and 9 others HamidMusayev/AspireCleanArchitectureStarter#7

Open

Bump Microsoft.Extensions.Http.Resilience to 9.7.0 dotnet/aspire-samples#919

Closed

Bump Microsoft.Extensions.Http.Resilience from 9.4.0 to 9.7.0 dotnet/aspire-samples#920

Closed

Bump Microsoft.Extensions.Http.Resilience to 9.7.0 dotnet/aspire-samples#921

Closed

Bump Microsoft.Extensions.Http.Resilience from 9.4.0 to 9.7.0 dotnet/aspire-samples#922

Closed

Bump Microsoft.Extensions.Http.Resilience to 9.7.0 dotnet/aspire-samples#923

Closed

Bump Aspire.Hosting.AppHost and 19 others rafimk/clean-architecture#24

Open

Bump Aspire.Hosting.AppHost and 2 others chandanbsd/BSDClassroom#102

Open

Bump Aspire.Hosting.AppHost and 15 others jcoliz/MsSentinel.BanffProtect#10

Open

Bump Aspire.Hosting.AppHost and 9 others huseyin-sekmenoglu-covergo/aspire-template#7

Open

Bump Aspire.Hosting.AppHost and 11 others jcoliz/MsSentinel.ObservabilityDemo#8

Open

Bump the dotnet group with 4 updates IEvangelist/docs#4091

Closed

Bump the dotnet group with 4 updates IEvangelist/docs#4097

Open

Bump the dotnet group with 2 updates IEvangelist/docs#4118

Open

Bump the dotnet group with 2 updates IEvangelist/docs#4123

Closed

Bump the dotnet group with 2 updates IEvangelist/docs#4125

Closed

Bump Microsoft.Extensions.AI and System.Threading.RateLimiting IEvangelist/docs#4133

Open

Bump the dotnet group with 3 updates IEvangelist/docs#4155

Open

github-actions bot locked and limited conversation to collaborators

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels