Data schemas and questions asked about data are just as much a company's IP as the data itself. It frustrates me that startups suddenly draw a line here for their own convenience when tuning generative AI. If I (as an employee) publicly posted all our database schemas and report descriptions, I would obviously be violating IP laws. Yet vendors think this "metadata" is fine to use and potentially leak across users.
We don't yet have enterprise-grade data permissioning or compliance certificates like Soc2. Those will come in time.