Evaluation & Analysis
Table of contents
Analysis and Visualization
The above scripts will result in logs, prediction results, and retrieved data (if any). The paths of the saved files can be found in the script, and you can customize the paths if you’d like.
To analyze the performance and the behavior of the results on multiple task, please refer to the scripts/visualize_scores.py
. We also use scripts/task_corr.ipynb
to draw heatmaps for understanding the task correlation.