Do they remove personal information(names/dates/SSNs etc) before using data for training?

If not, you should mask your personal info before you sent it to Anthropic (or OpenAI, Google).

Use this maybe - https://github.com/deepanwadhwa/zink#shielding-llm-and-api-c...