Is there a specific reason why we only mask the ground truth and not the output in the evaluation?
Is there a specific reason why we only mask the ground truth and not the output in the evaluation?