Microsoft.Extensions.AI.Evaluation.NLP Namespace
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Classes
BLEUEvaluator |
An IEvaluator that evaluates the quality of a response produced by an AI model by comparing it to a reference response using the BLEU (Bilingual Evaluation Understudy) algorithm. It is often used to evaluate the quality of machine translation or text generation tasks. |
BLEUEvaluatorContext |
Contextual information that the BLEUEvaluator uses to compute the BLEU score for a response. |
F1Evaluator |
An IEvaluator that evaluates the quality of a response produced by an AI model by comparing it to a reference response using the F1 scoring algorithm. F1 score is the ratio of the number of shared words between the generated response and the reference response. |
F1EvaluatorContext |
Contextual information that the F1Evaluator uses to compute the F1 score for a response. |
GLEUEvaluator |
An IEvaluator that evaluates the quality of a response produced by an AI model by comparing it to a reference response using the GLEU (Google-BLEU) algorithm. The GLEU evaluator measures the similarity between the generated response and one or more reference responses using n-gram overlap. |
GLEUEvaluatorContext |
Contextual information that the GLEUEvaluator uses to compute the GLEU score for a response. |