Although the validation set contains representative recordings for two of the final four recording sites of the test set, there are large differences in cmap and rmap values. Our model for example achieves a cmap value of 0.1480 (rmap: 0.2220) on the validation set in contrast to a cmap score of 0.000212 (rmap: 0.025) at the official evaluation of the test set. Does anyone have an explanation for these discrepancies?
We submitted two birdclef runs. How can we see whether this was successful? The number of remaining submissions is still 10. We neither received a confirmation message nor a failure message.