What we are arguing is the Chinese room problem coupled with Goodhearts law.
The people that need to interpret the metrics might be a professional that understand their meaning, or they might be an AI following arbitrary instructions. The first group will use the metrics to create a better team, and the second will make people angry.