- Log information such as "% solved PO", tactic choices and ratio, successful and failed proofs, reward etc. to separate files. - Make quantitative information such as "% solved PO" easily plottable (e.g. .csv file). - Implement plotting for "% solved PO" throughout training (and evaluation) process.