Leading a new team to unify a lot of our data sources at my company, and searching for vendors/tools from BI to data exploration (through means of a data catalogue) to data ingestion.
A hard requirement is we need to be own our data because data leaks for us are incredibly detrimental to our customers (in ways where even email leaks can allow our custs to be targeted by phishing, scams for years to come).
But looking around, so often the industry leaders like atlan or rudderstack or whatever don't offer a method to self host. To be clear, we do not care about paying, we have a high budget.
We just need to take ownership of our data because a breach can kill our company, but not necessarily true for a dedicated data platform (see snowflake).
I work for ClickHouse and available for any queries or help you need.
[0]: https://clickhouse.com/ [1]: https://clickhouse.com/docs/en/about-us/adopters