Statistics vs. semantics: Project similarity bias and variance neglect in forecast metric evaluation