ETL solutions
Underlying Technology | Use Case | |
Dataflow | Apache Beam | Batch and Streaming Data at Scale Log ingestion/transformation AI/ML |
Data Fusion | CDAP DataProc | Streaming Data at Scale from various sources into a DWH Regular ingestion into DWH |
DataProc | GCE | Spark/Hadoop Cluster Distributed processing of large datasets |
Dataprep | Apache Beam | Data visualization, to explore, clean and prep for analysis and ML. |
Not being a data scientist and not having enough knowledge on the individual products, use cases or technology makes this one hard to get my head around.