Terence Tao: Quantitative AI progress needs accurate and transparent evaluation