Draq – Athina

Does Response Answer Query

This evaluator checks if the response answer's the query sufficiently.

Required Args

Default Engine: gpt-4

🚫

Eval Result

Result: Fail
Explanation: The query is asking which spaceship landed on the moon first, but the response only mentions the name of the astronaut, and does not say anything about the name of the spaceship.

from athina.loaders import RagLoader
 
# Load the data from CSV, JSON, Athina or Dictionary
dataset = RagLoader().load_json(json_file)

from athina.evals import DoesResponseAnswerQuery
 
DoesResponseAnswerQuery().run_batch(data=dataset)

from athina.evals import DoesResponseAnswerQuery
 
DoesResponseAnswerQuery().run(
    query=query,
    response=response
)