GCP Cloud Architect Study Guide – Data Services

ETL solutions

Underlying TechnologyUse Case
DataflowApache BeamBatch and Streaming Data at Scale
Log ingestion/transformation
AI/ML
Data FusionCDAP
DataProc
Streaming Data at Scale from various sources into a DWH
Regular ingestion into DWH
DataProcGCESpark/Hadoop Cluster
Distributed processing of large datasets
DataprepApache BeamData visualization, to explore, clean and prep for analysis and ML.

Not being a data scientist and not having enough knowledge on the individual products, use cases or technology makes this one hard to get my head around.

Comments are closed.