This page defines the capacity-level configuration options that must be evaluated for our Microsoft Fabric platform, with a focus on:

  • production stability
  • workload isolation
  • controlled scaling
  • governance of shared resources
  • independence of the Data Platform Core  workspace

Our target operating model uses Fabric primarily as a data storage and exposure platform based on Lakehouse and Warehouse, serving both BI consumption and external data exposure.

Fabric capacity configuration is a platform governance topic, not only an infrastructure topic.

The primary control for protecting Data Platform Core  is capacity isolation.

Shared capacity between Data Platform Core  and Domain production creates shared operational risk.

A dedicated capacity for Data Platform Core  is the recommended baseline for this architecture.


Version

Date

Description

Contributor

V0.1

 

Initial document

COLOMBANI Théo

V0.2

 

Updated with schema proposal and checklist

COLOMBANI Théo



Key messages

What we should implement

Recommended target model

Capacity design principles

Proposed capacity design



Decision guide

Decision areaUse this approach whenRecommended decision
Dedicated capacity for Data Platform Corebronze and silver are production-critical; downstream BI or external exposure depends on them; Domain workloads are more variable than Core workloads; Core continuity is a priorityassign Data Platform Core to a dedicated capacity
Separate Domain production capacitymultiple Domain workspaces coexist; Domain workloads may create contention; business-facing usage is less predictable; Domain growth should not affect Core operationsassign Domain production to a separate capacity
Separate non-production capacitydevelopment and testing are active; experimentation may generate compute spikes; production stability must be protected from non-production activitykeep non-production on a separate capacity
Architecture review requiredCore and Domain are still planned on the same capacity; workspace reassignment is not tightly governed; production and non-production still share capacity; capacity sizing issues become recurrentescalate to architecture review

Checklist

Recommended configuration matrix

SettingData Platform CoreDomain ProductionNon-ProductionRecommendation
Dedicated capacityYesPreferredSeparateMandatory for Data Platform Core
Shared with Data Platform CoreNANoNoNot allowed
Workspace reassignment rightsVery restrictedRestrictedControlledGovern centrally
MonitoringMandatoryMandatoryRecommendedStandard operating baseline
DR assessmentMandatoryCase by caseNot priorityExplicit decision required
Spark governanceCase by caseCase by caseFlexibleOnly where relevant
Scaling review cadenceRegularRegularPeriodicMetrics-driven



Recommended target model

Do not place Data Platform Core production and Domain production on the same capacity if Data Platform Core must remain operational independently.


CapacityScopeUsed for
Capacity A — Data Platform CoreCentral platform production capacityCore production ingestion, preparation, and exposure foundations
Capacity B — Domain ProductionDomain production capacities
  • Domain gold workspaces
  • business-facing data products
  • BI-oriented workloads
  • potentially more variable usage patterns
Capacity C — Non-ProductionShared or segmented non-production capacities
  • Development
  • testing
  • experimentation
  • validation before production promotion



Detailed design sections

Workspace-to-capacity assignment

Key message
Workspace assignment is the primary mechanism used to guarantee production isolation and operational independence. Fabric capacity settings let admins manage assigned workspaces, and workspace reassignment directly affects how workloads share compute risk. (Microsoft Learn)

AreaSummary
What it isThe assignment of workspaces to specific Fabric capacities. (Microsoft Learn)
Why it mattersThis is the most important configuration decision because it determines which workloads share the same compute risk domain. Capacity planning guidance frames capacity allocation as a governance decision, not only a technical sizing choice. (Microsoft Learn)
What to watchIf Core, Domain, and non-production workloads share the same capacity, they also share the same contention and throttling risk envelope. Capacity growth guidance explicitly recommends governance patterns adapted to centralized and decentralized models. (Microsoft Learn)
Recommended postureKeep Data Platform Core on a dedicated capacity, separate Domain production where possible, and isolate non-production from all production capacities. (Microsoft Learn)

Recommendation block


Capacity administration and reassignment governance

Key message
A dedicated production capacity loses most of its value if workspace assignment is not tightly governed. Fabric allows reassignment through admin and workspace-level paths, so governance rules must be explicit. (Microsoft Learn)
AreaSummary
What it isThe set of permissions allowing administrators to manage a capacity and move workspaces into or out of it. (Microsoft Learn)
Why it mattersEven with a good target architecture, weak governance can reintroduce risk if workspaces are moved without control. Workspace admins can also reassign workspaces in some cases, which increases the need for formal guardrails. (Microsoft Learn)
What to watchA capacity model can look clean on paper but drift over time if reassignment rights are too broad. Governance guidance emphasizes strong controls when multiple teams share Fabric at scale. (Microsoft Learn)
Recommended postureRestrict critical-capacity administration to the central platform team and formalize approval for any reassignment that affects the Core production perimeter. (Microsoft Learn)

Recommendation block


Capacity sizing and scaling

Key message
Data Platform Core capacity sizing must prioritize service continuity over cost minimization. Microsoft recommends estimating size from workload characteristics and validating with real usage in the Capacity Metrics App. (Microsoft Learn)
AreaSummary
What it isThe sizing of Fabric capacity and the ability to adjust it as workload volume evolves. (Microsoft Learn)
Why it mattersEven a well-isolated architecture can fail if the capacity is persistently undersized. Strategic planning guidance recommends budgeting, scaling, and optimization as ongoing activities. (Microsoft Learn)
What to watchResizing should be based on observed patterns, not only on incident response. The Metrics App is the main evidence source to understand CU usage, peaks, and the item types driving load. (Microsoft Learn)
Recommended postureSize Data Platform Core with operational headroom. Review Domain production more frequently because its usage is more variable and business-facing. (Microsoft Learn)

Recommendation block


Monitoring and operational visibility

Key message
Capacity monitoring must be part of normal run operations, not only incident management. The Fabric Capacity Metrics App is designed to help admins monitor health, top consumers, compute usage, and issues such as throttling or query rejections. (Microsoft Learn)
AreaSummary
What it isMonitoring of capacity usage, saturation patterns, top consumers, and operational degradation signals. (Microsoft Learn)
Why it mattersCapacity governance is only effective if usage and saturation can be observed and acted upon. Microsoft recommends using the Metrics App to identify top consumers and optimize before throttling becomes recurrent. (Microsoft Learn)
What to watchMonitoring data should be interpreted operationally: recurring peaks, saturation patterns, top-consuming items, and correlations with refresh, ingestion, or user activity. The app also has refresh latency, so near-real-time assumptions should be avoided. (Microsoft Learn)
Recommended postureEvery production capacity should have a monitoring owner, review cadence, threshold model, and escalation path. (Microsoft Learn)

Recommendation block
For each production capacity, define:

Minimum baseline:


Disaster recovery

Key message
For Data Platform Core, disaster recovery should never be left undocumented. Fabric capacity settings include disaster recovery controls, and Microsoft’s recovery guidance makes clear that recovery planning must be explicit. (Microsoft Learn)
AreaSummary
What it isThe capacity-level disaster recovery posture associated with production data continuity. Capacity settings include disaster recovery options and related status information. (Microsoft Learn)
Why it mattersThe Data Platform Core workspace supports bronze and silver foundations, making it a central dependency for downstream exposure. In that model, DR is an architecture topic, not just an ops topic. This last point is an inference from your target design, supported by Microsoft’s capacity governance framing. (Microsoft Learn)
What to watchDR should be documented as an explicit decision: enabled or not enabled, with assumptions and limitations clearly stated. (Microsoft Learn)
Recommended postureAssess DR first for Data Platform Core, then extend case by case to Domain production depending on criticality. This prioritization is a design recommendation based on your architecture. (Microsoft Learn)

Recommendation block


Data Engineering and Spark-related settings

Key message
This is a secondary topic unless Spark becomes a major production dependency. Fabric capacity admins can manage Data Engineering and Data Science settings, including workspace-level compute, runtime defaults, and Spark properties. (Microsoft Learn)
AreaSummary
What it isCapacity-level settings related to Data Engineering and Data Science, including Spark governance options. (Microsoft Learn)
Why it mattersThese settings become relevant when Spark-based processing is materially used in the platform. Microsoft explicitly positions them as admin-governed capacity settings. (Microsoft Learn)
What to watchIf Spark usage grows without governance, compute sprawl can become harder to control. Spark planning guidance also treats development and production needs differently. (Microsoft Learn)
Recommended postureKeep Spark governance centralized and document Spark rules separately if Spark is not a central workload in the platform. The “secondary topic” positioning is a design recommendation for your model. (Microsoft Learn)

Recommendation block



Architecture conclusion

This is the most coherent model for a Fabric platform used primarily as a storage and exposure layer, where the Data Platform Core workspace must remain stable independently from Domain activity.