Long eval clean up times
This page describes two methods to use together to reduce long clean-up times when you run W&B Weave evaluations with la …
What is pairwise evaluation and how do I do it?
When you score models in a Weave evaluation, absolute value metrics (for example, 9/10 for Model A and 8/10 for Model B) …