Sorry to bother again.
Does this file provides a metric to calculate the overall halluc metric: https://github.com/patrick-tssn/VideoHallucer/blob/main/evaluations/evaluation_halluc.py?
In the line 37 and line 38, which line should be used? From the paper, section 3.3 says We then calculate the overall accuracy by considering both the basic and hallucinated questions as a paired set, marking it as a hit only if both questions are answered correctly., thus should we use re.search(y_pattern, basic_pred, re.IGNORECASE) and re.search(n_pattern, halluc_pred, re.IGNORECASE) to calculate the overall metric? (if I understand correctly)
Sorry to bother again.
Does this file provides a metric to calculate the overall halluc metric: https://github.com/patrick-tssn/VideoHallucer/blob/main/evaluations/evaluation_halluc.py?
In the line 37 and line 38, which line should be used? From the paper, section 3.3 says
We then calculate the overall accuracy by considering both the basic and hallucinated questions as a paired set, marking it as a hit only if both questions are answered correctly., thus should we usere.search(y_pattern, basic_pred, re.IGNORECASE)andre.search(n_pattern, halluc_pred, re.IGNORECASE)to calculate the overall metric? (if I understand correctly)