founder of mixpeek here, we fine-tune late interaction models on pdfs based on domain https://mixpeek.com/extractors

Do you offer local or on-premise models? There are certain PDF's we cannot send to an API.