With good quantization you can get 36GB down to 8GB. To get 36B down to 8B you need good pruning.