prediction for basic attention token