Can we get any of Iceberg/Delta/Hudi that isn't terribly complex to setup? Like configurable completely from Standard SQL via the CREATE EXTERNAL TABLE syntax.
I work on BigQuery. All of these are great points: just wanted to point out that BigQuery can federate into external data sources as well: e.g. files on cloud storage and BigTable. Relevant feature is BigLake: https://cloud.google.com/bigquery/docs/biglake-intro
Are there any performance benefits of BigLake over external tables stored in Parquet governed by Hive? Or is the main benefit the governance flexibility?
Currently the main benefit of BigLake over the current external tables is governance: you get row and column level security over cloud storage data. The governance is uniformly enforced across BigQuery and also the BigQuery storage API. The storage API can be used by any engine and we have pre-built open source connectors available for Spark, Presto/Trino, Dataflow and Tensorflow.
We're constantly working on improving BigQuery performance over open file formats on cloud storage. Some of these features will be specific to BigLake. Please stay tuned.
Former A9 employee here. A9 was not an acquisition - it was bootstrapped as a Bay Area subsidiary. Initially A9 focused on web search, then pivoted to doing product search for Amazon retail.
When we started using EMR, we'd never exhaust the number of available instances (in both spot and on-demand). Now, we frequently do. And, for the last 3 weeks, it's been close to impossible to launch a huge cluster.
They should really dispatched to fix the Google Drive Client program for Mac. It consistently maxes out a CPU even when there is no sync going on. I can consistently reproduce this issue on all Macs I own.
edit: Downvotes ahoy! I'm just relaying the information I found when searching. On the Anroid-DLS wiki, it gives a very generic "based on webkit" answer, so that doesn't say much one way or t'other.
It's WebKit, but so is Chrome. "Safari" is merely in the user-agent string for mobile website compatibility. Nothing about it is really Safari related.
Chrome will be coming to Android soon as per the notes made by the Chrome team in the last month.
"Half of my readers can see why" is an attempt at self-deprecation, since technology people respect the attitude in that (semi-joking) rant/post but a lot of people don't.
Franco has been described as having "an unusually high metabolism for productivity...a superhuman ability to focus". Dissatisfied with his career's direction, Franco reenrolled at UCLA in the fall of 2006 as an English major with a creative writing concentration. Having received permission to take as many as 62 course credits per quarter compared to the normal limit of 19 while continuing to act, he received his undergraduate degree in June 2008 with a GPA over 3.5.
He moved to New York to simultaneously attend graduate school at Columbia University's MFA writing program, New York University's Tisch School of the Arts for filmmaking,and Brooklyn College for fiction writing, while occasionally commuting to North Carolina's Warren Wilson College for poetry.[1] He received his MFA from Columbia in 2010. Franco is a Ph.D. student in English at Yale University and will also attend the Rhode Island School of Design.
Franco is obviously extremely productive and very bright, but I can't help but focus on the fact that he has a full+ time personal assistant. She's up 18 hours a day with him handing the "details" of his life, so he's free to focus on whatever he wants to focus on.
Sometimes I wonder what kind of difference that would make in my life, particularly when I was in grad school.
And then I tell myself to shut up and get back to work.
So... why not make it happen? Perhaps not all of it can be done, but Tim Ferriss talks about having an outsourced personal assistant.
If you are the type who could work all those extra hours, try outsourcing as many activities as possible. Start with housekeeping services, as an example, and perhaps a personal cook.
BigQuery also supports in-place querying of datasets on GCS (or S3/Azure using Omni) via external/BigLake tables. https://cloud.google.com/bigquery/docs/query-cloud-storage-u...