Cloud Services
Serverless Data Warehouse
★ 4.8
High-Performance Virtual Machines
★ 4.6
pip install google-cloud-bigquerypip install google-cloud-computepip install google-cloud-bigquerypip install google-cloud-computePython data engineers use the `google-cloud-bigquery` client to run analytical SQL and pull results into pandas — `client.query(sql).to_dataframe()` is the most common pattern. Engineers also use `load_table_from_dataframe()` to write pandas DataFrames back to BigQuery tables, and the BigQuery Storage API for high-throughput reads of large tables.
Python data engineers provision and manage Google Compute Engine VMs using the google-cloud-compute Python library or Terraform. GCE is used to run self-hosted data engineering tools like Apache Airflow, Spark clusters, and PostgreSQL databases on managed VMs. Engineers use Preemptible VMs for cost-efficient batch processing jobs and custom machine types to right-size compute for memory-intensive transformation workloads.
Individual Tool Pages