Community
Q&A
Rovo
Questions
Evaluation Feature Failing for Valid Agent Responses

Evaluation Feature Failing for Valid Agent Responses

Hi everyone,
I’ve recently started experimenting with the newly launched Evaluation feature by Atlassian and have encountered an issue.
I created an agent and tested it through normal conversation. In this case, the agent responds correctly and behaves as expected for the given inputs.
However, when I use the same inputs via a CSV file in the Evaluation feature, all the test cases are marked as failed, even though the responses appear to be valid.
Has anyone faced a similar issue or knows what might be causing this behavior? Any guidance or suggestions would be greatly appreciated.
Thanks in advance! Screenshot 2026-06-25 112243.png

1 answer

0 votes

Hi @Jagruti Shinde - welcome to the Community,

can you share the setup of the CSV and the Evaluation itself? There are a few settings that influence how the evaluations show up.

"Failed" usually means that Rovo deviated from the expected response. But as that evaluation happens via LLM, they are not always accurate.

Evaluations best work with classification Agents that give a clear Yes or No answer (or whatever other classification you use).

You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.

Forums

Q&A

Community resources

Support

Top groups

Community resources

Support

Learn

Community resources

Support

Events

Community resources

Support

Evaluation Feature Failing for Valid Agent Responses

1 answer

Suggest an answer

Was this helpful?

Thanks!

TAGS

Atlassian Community Events