Might be interested in orthogonal reading - "The Textual Warehouse" (ISBN-10:‎ 163462954X) by data warehouse pioneer Bill Inmon. He is and always has been ahead of his time with his thinking!

This does indeed look really interesting. We have deterministic validations (and some deterministic excel transformations) but using more deterministic transformations for text based on traditional NLP would be a nice complement.