StackGres Docs > Administration Manual > Sharded Cluster > Scaling Sharded Clusters

Scaling Sharded Clusters

This guide covers scaling operations for SGShardedCluster, including horizontal scaling (adding shards or replicas) and vertical scaling (changing resources).

Scaling Overview

SGShardedCluster supports multiple scaling dimensions:

Dimension	Component	Configuration
Horizontal - Shards	Number of shard clusters	`spec.shards.clusters`
Horizontal - Replicas	Replicas per shard	`spec.shards.instancesPerCluster`
Horizontal - Coordinators	Coordinator instances	`spec.coordinator.instances`
Vertical	CPU/Memory	`spec.coordinator/shards.sgInstanceProfile`

Adding Shards

To add more shard clusters, increase the clusters value:

apiVersion: stackgres.io/v1alpha1
kind: SGShardedCluster
metadata:
  name: my-sharded-cluster
spec:
  shards:
    clusters: 5  # Increased from 3 to 5
    instancesPerCluster: 2
    pods:
      persistentVolume:
        size: 50Gi

Apply the change:

kubectl apply -f sgshardedcluster.yaml

Or patch directly:

kubectl patch sgshardedcluster my-sharded-cluster --type merge \
  -p '{"spec":{"shards":{"clusters":5}}}'

What Happens When Adding Shards

New shard clusters are created with the specified configuration
Each new shard gets the configured number of replicas
For Citus: New shards are registered with the coordinator
Data is not automatically rebalanced to new shards

Rebalancing Data (Citus)

After adding shards, use SGShardedDbOps to rebalance data:

apiVersion: stackgres.io/v1
kind: SGShardedDbOps
metadata:
  name: rebalance-after-scale
spec:
  sgShardedCluster: my-sharded-cluster
  op: resharding
  resharding:
    citus:
      threshold: 0.1  # Rebalance if utilization differs by 10%

Adding Replicas

To increase replicas per shard for better read scalability:

spec:
  shards:
    clusters: 3
    instancesPerCluster: 3  # Increased from 2 to 3

Or patch:

kubectl patch sgshardedcluster my-sharded-cluster --type merge \
  -p '{"spec":{"shards":{"instancesPerCluster":3}}}'

Replica Considerations

New replicas are created from the primary via streaming replication
Initial sync may take time depending on data size
Consider replication mode (sync vs async) for consistency requirements

Scaling Coordinators

Scale coordinator instances for high availability:

spec:
  coordinator:
    instances: 3  # Increased from 2 to 3

Coordinator Scaling Notes

Minimum recommended: 2 instances for HA
Coordinators handle metadata and query routing
All coordinators can handle read/write queries

Vertical Scaling

Using Instance Profiles

First, create an SGInstanceProfile with desired resources:

apiVersion: stackgres.io/v1
kind: SGInstanceProfile
metadata:
  name: large-profile
spec:
  cpu: "4"
  memory: "16Gi"

Then reference it in the sharded cluster:

spec:
  coordinator:
    sgInstanceProfile: large-profile
  shards:
    sgInstanceProfile: large-profile

Different Profiles for Coordinators and Shards

spec:
  coordinator:
    sgInstanceProfile: coordinator-profile  # Smaller, query routing
  shards:
    sgInstanceProfile: shard-profile        # Larger, data storage

Applying Vertical Scaling

Vertical scaling requires a restart. Use SGShardedDbOps for controlled rolling restart:

apiVersion: stackgres.io/v1
kind: SGShardedDbOps
metadata:
  name: apply-new-profile
spec:
  sgShardedCluster: my-sharded-cluster
  op: restart
  restart:
    method: ReducedImpact
    onlyPendingRestart: true

Autoscaling

SGShardedCluster supports automatic scaling based on metrics.

Horizontal Autoscaling (KEDA)

Enable connection-based horizontal scaling:

spec:
  coordinator:
    autoscaling:
      mode: horizontal
      horizontal:
        minInstances: 2
        maxInstances: 5
        # Scale based on active connections
        cooldownPeriod: 300
        pollingInterval: 30
  shards:
    autoscaling:
      mode: horizontal
      horizontal:
        minInstances: 1
        maxInstances: 3

Vertical Autoscaling (VPA)

Enable CPU/memory recommendations:

spec:
  coordinator:
    autoscaling:
      mode: vertical
      vertical:
        # VPA will recommend resource adjustments
  shards:
    autoscaling:
      mode: vertical

Scale-Down Operations

Reducing Shards

Reducing the number of shards requires data migration:

For Citus: Drain shards before removal:

apiVersion: stackgres.io/v1
kind: SGShardedDbOps
metadata:
  name: drain-shards
spec:
  sgShardedCluster: my-sharded-cluster
  op: resharding
  resharding:
    citus:
      drainOnly: true

After draining, reduce the cluster count:

kubectl patch sgshardedcluster my-sharded-cluster --type merge \
  -p '{"spec":{"shards":{"clusters":3}}}'

Reducing Replicas

Reducing replicas is straightforward:

kubectl patch sgshardedcluster my-sharded-cluster --type merge \
  -p '{"spec":{"shards":{"instancesPerCluster":1}}}'

Monitoring Scaling Operations

Check Cluster Status

# View overall status
kubectl get sgshardedcluster my-sharded-cluster

# Check individual shard clusters
kubectl get sgcluster -l stackgres.io/shardedcluster-name=my-sharded-cluster

# View pods
kubectl get pods -l stackgres.io/shardedcluster-name=my-sharded-cluster

Check DbOps Progress

kubectl get sgshardeddbops rebalance-after-scale -o yaml

Best Practices

Plan capacity ahead: Scale before reaching limits
Test in staging: Validate scaling operations in non-production first
Monitor during scaling: Watch metrics during scale operations
Use ReducedImpact: For vertical scaling, use reduced impact restarts
Backup before major changes: Create a backup before significant scaling
Rebalance after adding shards: Data doesn’t automatically redistribute