Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's a Batch+Stream unified processing model, an SDK. Idea is you can code up your pipeline in Dataflow and have your choice of where to run it - Spark, Flink, etc.

Google Cloud Dataflow is a fully-managed service that executes Dataflow pipelines and has nice value adds on top like fault tolerance and auto-optimization.

(Disclosure: I work on BigQuery, not Dataflow)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: