> Claude knows - at least mostly - when its hallucinating.
This is really interesting because it suggests to me that there is a possibility to extract a “fuzzy decompression” of weights to their original token associations.
> Claude knows - at least mostly - when its hallucinating.
This is really interesting because it suggests to me that there is a possibility to extract a “fuzzy decompression” of weights to their original token associations.