This is probably because your provider is generating embeddings over the document to save money, and then simply running a vector search across it instead of fitting it all in context.