how's it handle longer context or does it start hallucinating after like 2 sentences? curious what the ceiling is before the 9M params
how's it handle longer context or does it start hallucinating after like 2 sentences? curious what the ceiling is before the 9M params