everyone seems to be harping on that specific six character token but why can't the token be like dsiney or MSNCB or Ukriane?
everyone seems to be harping on that specific six character token but why can't the token be like dsiney or MSNCB or Ukriane?
It can. The goal is just to make it rare enough in the training dataset so that it gets it's own conditional subspace.