Home/Library/Google Cloud Cost Optimization
Cluster pillar · Google Cloud · 28 guides

The Complete Guide to Google Cloud Cost Optimization in 2026

Everything that moves a Google Cloud bill, in the order it pays to do it. From committed use discounts and BigQuery editions to GKE and storage lifecycle, written by certified FinOps practitioners who cut bills 31% on average across more than 500 environments.

Google Cloud cost optimization is the discipline of getting the same workloads to run for less on Google Cloud Platform, then keeping the savings in place as you scale. It is not a single setting. It is a sequence: see where the money goes, cut the waste, lock in the discounts on a clean baseline, and run the whole thing continuously. This guide covers that full sequence for GCP, and links to a detailed article for every lever.

Most Google Cloud bills carry the same problem every cloud bill carries. Roughly a third of the spend is waste: oversized Compute Engine VMs, idle Cloud SQL instances, untiered Cloud Storage, BigQuery queries that scan far more than they need, and committed use discounts bought on the wrong baseline. The good news is that GCP gives you unusually good native tooling to find and fix it, from the Recommender and Active Assist to a granular billing export into BigQuery. The skill is using those tools in the right order so you never lock in waste.

We are an independent, vendor neutral advisory. We sit on your side of the table against the bill, not Google's. This guide reflects how we actually run a Google Cloud engagement. If you want the fix done for you, our Google Cloud cost optimization service applies exactly this method, and you can pay a fixed fee or only from realized savings. This pillar is also part of our wider complete cloud cost optimization playbook for 2026, which connects the GCP work to AWS, Azure and OCI.

Where Google Cloud money leaks

Before you reduce a Google Cloud bill you have to know what you are paying for. In our assessments, GCP waste concentrates in five places. Compute Engine is usually first: VMs provisioned for peak and never resized, plus development instances that run nights and weekends for no reason. Cloud SQL is second: high tier instances kept on around the clock for workloads that idle most of the day. Cloud Storage is third: terabytes sitting in Standard class that have not been read in months. BigQuery is fourth: on-demand queries that scan whole tables because nobody partitioned or clustered them. And commitments are fifth: customers who bought a three year commitment on an oversized fleet, locking the waste in for the full term.

The pattern matters because it dictates order. You rightsize and clear idle spend first, because that shrinks the baseline. Only then do you buy commitments, because a commitment bought on a rightsized fleet saves money while a commitment bought on a bloated fleet just subsidizes the bloat. The rest of this guide follows that order. For a fast pass, our Google Cloud cost optimization checklist of 30 quick wins is the condensed version.

Locked proof point

Across more than 500 cloud environments since 2019 we have optimized over $420M in spend at an average 31% reduction in the monthly bill. On the performance model, if we save you nothing, you pay nothing.

Step 1 · See: get the billing picture right

You cannot reduce a Google Cloud bill you cannot read. The Cloud Billing console gives you reports broken down by project, service and SKU, and that is the place to start. But the real visibility comes from the BigQuery billing export, which streams detailed and pricing data into a dataset you can query yourself. With it you can answer questions the console will not, such as which label is driving egress, or how much a single team spent on BigQuery slots last week. Our guide to Cloud Billing reports and the BigQuery billing export walks through standing it up.

Visibility also depends on structure. Google Cloud organizes spend by billing account, then projects, then resources, and you allocate cost with labels and the resource hierarchy of folders. If projects and labels are messy, every report downstream is messy too. Start with understanding Google Cloud billing accounts and projects and then labels and folders for cost allocation. Before you commit to anything, run a structured Google Cloud cost assessment so you have a baseline to measure against.

Step 2 · Cut: rightsize and clear the waste

This is where the fastest savings live, and it happens before any discount is bought. Google Cloud's Recommender continuously analyzes usage and suggests smaller machine types for underused VMs, flags idle resources, and surfaces other savings. Our guide to rightsizing Compute Engine VMs with Recommender shows how to act on those suggestions safely, and GCP Active Assist and Recommender for cost savings covers the wider toolset including idle VM, idle disk and idle IP recommendations.

For interruptible and batch workloads, Spot VMs and preemptible instances cut compute cost by 60 to 91 percent against on-demand. Spot VMs replaced the older preemptible model and removed the 24 hour maximum runtime, so they fit far more workloads than people assume. Cleaning up storage is part of the cut too: persistent disk and storage cleanup removes unattached disks and old snapshots that quietly accrue. Managed services need attention as well, so see Cloud SQL cost optimization and Cloud Run and Cloud Functions cost optimization for the application tier.

Step 3 · Rate: sustained use and committed use discounts

Once the fleet is rightsized, you buy rate. Google Cloud has two distinct discount mechanisms and they stack differently from AWS or Azure.

Sustained use discounts (SUDs) are automatic. Run an eligible machine for a large share of the month and Google applies a discount with no commitment, up to around 30 percent for sustained use on general purpose and sole tenant resources. You do not buy them, but you should understand them because they interact with commitments. See how sustained use discounts work.

Committed use discounts (CUDs) are the bigger lever and they come in two forms that confuse almost everyone. Resource-based CUDs commit you to a quantity of vCPU and memory in a specific region for one or three years and return the deepest discounts, up to around 55 percent for most machine types and higher for some. Spend-based (flexible) CUDs commit you to a dollar amount of hourly spend on eligible services and trade some discount for flexibility, in the region of 28 percent for one year and up to about 46 percent for three. As of January 2026, Google moved spend-based CUDs fully to a direct discount model that shows the discounted price on the bill. Our explainer on committed use discounts, resource versus spend-based is the deep dive, and when it is time to commit at scale, see how to negotiate a Google Cloud commitment and GCP enterprise agreements and discount structures.

DiscountCommitmentTypical savingBest for
Sustained useNone, automaticUp to ~30%Steady VMs you already run
Resource-based CUD1 or 3 yr, region + shapeUp to ~55% and higherPredictable, steady fleets
Spend-based CUD1 or 3 yr, hourly spend~28% to ~46%Changing fleets, managed services
Spot VMsNone, interruptible60% to 91%Batch, fault-tolerant work

Figures are indicative of Google Cloud pricing as of 2026 and vary by machine type and region. Always confirm current rates against Google Cloud's pricing pages before you commit.

BigQuery and data costs

For data-heavy organizations, BigQuery is often the single largest line on the Google Cloud bill, and it has its own optimization model. The first decision is the pricing model. On-demand bills you per data scanned, at $6.25 per TiB with the first 1 TiB each month free. Editions bill for compute capacity in slots, with Standard, Enterprise and Enterprise Plus tiers and autoscaling. Heavy, steady analytics usually belongs on Editions with commitments; spiky or light usage often stays cheaper on-demand. Our comparison of BigQuery cost optimization, on-demand versus editions shows where the break-even sits.

Whichever model you use, the query and storage layer is where the day to day savings are. Partitioning and clustering tables, avoiding SELECT star, and using the right storage billing model can cut a BigQuery bill dramatically. See how to reduce BigQuery storage and query costs. For wider data platform work, our guides to reducing Dataflow and Dataproc costs and Spanner and Bigtable cost optimization cover the rest of the analytics estate, and Vertex AI cost control for ML workloads covers training and inference.

GKE and containers

Kubernetes is its own cost discipline, and on Google Cloud that means Google Kubernetes Engine. The big levers are the cluster mode, Spot pods, and bin packing nodes so you are not paying for idle capacity. Autopilot bills for the resources your pods request rather than the nodes, which removes a whole class of node-level waste but rewards accurate requests. Our guide to GKE cost optimization with Autopilot, Spot and bin packing is the GCP-specific deep dive. Because container cost spans every cloud, it also has its own pillar: the complete guide to Kubernetes cost optimization.

Storage, networking and egress

Cloud Storage costs are driven by class and lifecycle. Data that is rarely read should not sit in Standard; moving it to Nearline, Coldline or Archive on an automated lifecycle policy can cut storage cost by an order of magnitude. See Google Cloud storage classes and lifecycle management. Networking is the other quiet cost. Egress out of Google Cloud and traffic between regions both carry charges that surprise teams, and the fix is usually architectural. Our guide to GCP network egress and inter-region pricing explains how to model it, and how to use the Google Cloud pricing calculator properly helps you estimate before you build.

Step 4 · Lock and Run: keep the savings

Savings drift back the moment attention moves on. Locking them in means budgets, alerts and guardrails. Google Cloud budgets with threshold alerts catch overruns early; see how to set budgets and alerts in Google Cloud. Observability has a cost of its own, and Cloud Logging in particular can balloon, so Cloud Logging and Monitoring cost control is part of governance, not an afterthought. Finally, a Google Cloud bill is a moving target, so forecasting Google Cloud spend keeps finance and engineering aligned on where it is heading.

This is the Run phase: continuous monitoring, fresh commitments as the fleet changes, and a unit cost that keeps falling. It is also what our Google Cloud optimization service delivers on an ongoing basis, whether as a one-time engagement or fully managed.

Want this run for you on Google Cloud?

We will read your billing export, rightsize the fleet, structure your commitments, and lock in the savings. Fixed fee, or pay only from what we save you. No savings, no fee.

Book a GCP cost audit →

Every article in the Google Cloud cluster

This pillar links down to all 28 guides in the Google Cloud cost optimization cluster. Start with whichever lever is biggest on your bill.

Free guide

Want the full field manual? Download the Google Cloud Cost Optimization Field Guide, our gated playbook with the commitment math, the rightsizing workflow, and the storage lifecycle rules in one document.

The Cloud Cost Brief

Cloud pricing moves. We tell you when it matters.

New commitment instruments, FOCUS changes, hyperscaler pricing shifts, and the plays that actually move a bill. No schedule, no filler.

Subscribe · Work email only