Hacker News

new | ask | show | jobs

nighthawk454 3 days ago [ - ]

Yeah, but it’s ’quantization aware’ during training too, which presumably is what allows the quantization at inference to work