basic attention token prediction reddit