You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I found one research suggesting that 0-5 Likert scale for LLM-as-a-Judge framework is the one that is aligned with humans most. We can simply change our scale range based on that.
I found one research suggesting that 0-5 Likert scale for LLM-as-a-Judge framework is the one that is aligned with humans most. We can simply change our scale range based on that.
https://arxiv.org/pdf/2601.03444