How I Passed GCP Data Engineer Certification Exam in Year 2021.

Preparation Roadmap

Useful Videos

  1. What is BigTable?: https://www.youtube.com/watch?v=Lq9uDOM4whI
  2. What is Spanner?: https://www.youtube.com/watch?v=bUSU1e9j8wc
  3. What is Streaming Analytics?:https://www.youtube.com/watch?v=JRHy8fUQLYQ

Reading Articles/Documents

  1. BigTable Replication:(eventually consistent by default) you need to understand replication use case and how they are impacted by eventual consistency. Document Link that can be referred.
  2. Spanner Replication: (strongly consistent) you need to understand how strong consistency are implemented with Spanner replicas. Document Link that can be referred.
  3. Cloud Storage-data lake: you need to understand that how different GCP products can be used as part of an overall solution. Document Link that can be referred.
  4. Cloud Data-Lifecycle Platform: you need to understand the role of the different products within the data lifecycle. Document Link that can be referred.

Labs for Practice

  1. Do the Bigtable: Qwik Start — Command Line lab
  2. Do the Cloud Spanner: Qwik Start -lab
  3. Loading Taxi Data into Google Cloud SQL

Useful Videos

  1. How to Schedule Snapshot for Automatic Backup?: https://www.youtube.com/watch?v=Lq9uDOM4whI
  2. How to control access for Cloud Storage?: https://www.youtube.com/watch?v=DYGFcIB23oA&list=PLIivdWyY5sqJcBvDh5dfPoblLGhG1R1-O&index=5
  3. What are best practices for Block Storage?: https://youtu.be/QNpOm_nuZVU

Reading Articles/Documents

  1. Cloud Storage features: you need to get familiar with all listed features of Cloud Storage. Document Link that can be referred here.
  2. Google Transfer Appliance: you need to understand use case for Transfer Appliance and the basic procedure to use it. Document Link that can be referred here.
  3. Transferring large volume of data to Google Cloud: you need to understand especially the different options available from Google, and the use case for each of them e.g. gsutil, Storage Transfer Service, transfer Appliance etc. Document Link that can be referred here.

Labs for Practice

  1. Do the ‘Google Cloud Storage — Bucket lock’ lab.
  2. ‘Clean Up Unused and Orphaned Persistent Disks’ Lab.
  3. ‘Cloud CDN’ Lab.
  4. ‘Optimizing Cost with Google Cloud Storage’ Labs.

Useful Videos

  1. What is Map Reduce?: https://www.youtube.com/watch?v=43fqzaSH0CQ
  2. What is a columnar database?: https://www.youtube.com/watch?v=8KGVFB3kVHQ
  3. Creating a large Dataproc cluster with preemptible VMs: https://www.youtube.com/watch?v=iyqtKV0fnFc
  4. Memory store in a minute: https://www.youtube.com/watch?v=ra3Vow3-HHg
  5. High Availability and DR in Cloud SQL: https://www.youtube.com/watch?v=ghdleRvGExg

Reading Articles/Documents

  1. What are Cloud Storage features: you need to get familiar with all the features listed here for Cloud Storage. Document Link that can be referred here.
  2. Transferring large datasets to Google Cloud: you need to get familiar especially the different options available from Google, and the use case for each of them e.g. gsutil, Storage Transfer Service, transfer Appliance etc. Document Link that can be referred here.
  3. What is Google Transfer Appliance: you need to get familiar with the use case for Transfer Appliance and the basic procedure. Document Link that can be referred here.

Labs for Practice

  1. Do the ‘Datastore:Qwik Start lab’ lab.
  2. Do the ‘Building an IoT Analytics Pipeline on Google Cloud’ lab.
  3. Do the ‘Do the Bigtable: Qwik Start — Hbase Shell lab’ lab.
  4. Do the ‘Creating Date-Partitioned Tables in BigQuery’ lab.

Useful Videos

  1. What is analysing Geospatial data in BigQuery GIS?: https://www.youtube.com/watch?v=BUEmvT3p_zE
  2. How to protect data with authorized views?: https://www.youtube.com/watch?v=8fNnqshJ2Nw
  3. What are user defined functions: https://www.youtube.com/watch?v=c3dtgLWRycs

Reading Articles/Documents

  1. Big Query explained: Storage Overview: you need to get familiar with the key concepts. Document link that can be referred here.
  2. What are 7 best practices for production Dataproc clusters: you need to get familiar with each of these best practice of Dataproc. Document link that can be referred here.
  3. Pub/sub and realtime analytics: you need to get familiar with how Pub/Sub complements data analytics. Document link that can be referred here.

Labs for Practice

  1. Do the ‘Explore and Create Reports with Data Studio’ lab.
  2. Do the ‘ETL Processing on Google Cloud Using Dataflow and BigQuery’ lab.

Useful Videos

  1. What is Decision Pyramid: How to choosing the Right ML Tools?: https://www.youtube.com/watch?v=pm_-pVPvZ-4
  2. What are the seven steps of machine learning?: https://www.youtube.com/watch?v=nKW8Ndu7Mjw
  3. Learn Apache Beam in 15 minutes or less: https://www.youtube.com/watch?v=go8G0tW4KgU

Reading Articles/Documents

  1. Beam Programming Guide: you need to get familiar with the key concepts. Document link that can be referred here.
  2. ML Crash Course — basic ML terminology: you need to get familiar with the key concepts. Document link that can be referred here.
  3. Pub/Sub: you need to get familiar with the key concepts. Document link that can be referred here.
  4. ML topics on DE exam — Miscellaneous notes: you need to get familiar with the key concepts. Document link that can be referred here.

Labs for Practice

  1. Do the ‘Cloud Composer: Copying BigQuery Tables Across Different Locations’ lab.
  2. Do the ‘Creating a Data Transformation Pipeline with Cloud Dataprep’ lab.

Useful Videos

  1. What is Cloud TPU?: https://www.youtube.com/watch?v=pm_-pVPvZ-4
  2. What is Tensorflow Playground ?: https://www.youtube.com/watch?v=nKW8Ndu7Mjw
  3. How to integrating 4 Machine Learning APIs: https://www.youtube.com/watch?v=JlzvXRegkOU
  4. Awesome GCP — walkthrough of sample exam questions as needed: https://www.youtube.com/channel/UCIGDDqu5DzlaaC4XzXj_4-A

Reading Articles/Documents

  1. Kubeflow use cases: you need to get familiar with the basic use cases for Kubeflow. Document link that can be referred here.
  2. Best practices for ML optimization on Google Cloud: you need to understand how the different tools in Google Cloud are used in an ML project. Document link that can be referred here.
  1. Test1
  2. Test2
  3. Test3
  4. Test4
  5. Test5
  6. Test6

--

--

--

Hi, I am a Certified Google Cloud Data engineer. I use Medium platform to share my experience with other members of Medium network.

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Customizing Django’s User Model- Login Via Email, Customizing User registration Functions.

Make your apps fold; Make it work with foldable displays

Over-Provisioned and Over-Permissioned Containers & Kubernetes

Thinc. First Code Android (Part I: Setting up & First glimpse of Android Studio)

Tuesday — Week Eleven

Apache Airflow Tutorial, Part 2: Complete Guide for a Basic Production Installation Using…

The Rust Programming Language — Chapter 2

Getting started with Azure DevOps — CI/CD Pipelines

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Vibhor Gupta

Vibhor Gupta

Hi, I am a Certified Google Cloud Data engineer. I use Medium platform to share my experience with other members of Medium network.

More from Medium

How to pass the AWS Cloud Practitioner Certification?

21st Century Data Analytics with AWS

Introduction to Big Data with Spark and Hadoop — Week 4. DataFrames and SparkSQL

Calculating a Portfolio of Securities : Finding Best Rate of Return for selection of Best…