Version	Date	Description	Contributor
V0.1	15 Apr 2026	Initial document	COLOMBANI Théo
V0.2	20 Apr 2026	Updated with schema proposal and checklist	COLOMBANI Théo

Table of Contents

maxLevel	2

Fabric Capacity Configuration for Our Data Platform

1. Objective

This page defines the capacity-level configuration options that must be assessed for our Microsoft Fabric platform.

The objective is to ensure:

production stability
workload isolation
controlled scalability
governance of shared compute resources
operational independence of the Data Platform Core workspace

In our architecture, Microsoft Fabric is primarily used as a storage and exposure platform, based on Lakehouse and Warehouse, for both BI consumption and external data exposure.

2. Platform Context

Our target operating model is structured as follows:

Data Platform Core workspace workspace

bronze layer
silver layer
core production data preparation and controlled exposure foundation

Domain workspaces

gold layer
business-oriented and BI-ready data products
domain-level exposure for reporting and consumption

Key requirement

The Data Platform Core production workspace must remain operational independently from Domain workspaces, including in situations where Domain workloads generate higher or less predictable compute consumption.

3. Design Principle

Recommendation

Capacity design must be driven by isolation first, then by optimization.

Rationale

In our context, the main purpose of capacity governance is not only to size compute correctly. It is primarily to:

protect critical Data Platform Core workloads
separate critical and non-critical workloads
reduce cross-workspace contention
create predictable operating conditions
support controlled platform growth

Decision statement

For our platform, capacity is an architecture boundary, not only a billing or administration object.

4. Recommended Target Model

Target architecture

Capacity A — Data Platform Core

Used only for:

...

Data Platform Core bronze

...

Data Platform Core silver

...

Key messages

Capacity design must be driven by isolation first, then by optimization.
- The Data Platform Core workspace must remain operational independently from Domain workspaces, including when Domain workloads generate higher or less predictable compute consumption.
Capacity is an architecture boundary, not only a billing or administration object.
The Data Platform Core workspace should not share the same production risk envelope as Domain workloads.
Non-production must be isolated from production capacities.
Capacity governance must cover both technical setup and operating model.

What we should implement

Recommended target model

one dedicated Fabric capacity for Data Platform Core
one separate Fabric capacity for Domain production
one separate non-production capacity
centralized control of workspace assignment
formal governance for capacity admin and reassignment rights
standardized monitoring and review cadence
explicit disaster recovery assessment for Data Platform Core

Capacity design principles

isolate critical and variable workloads
avoid shared production capacity between Core and Domain if operational independence is required
keep non-production outside production capacities
define ownership and review rules for every production capacity

Proposed capacity design

Image Added

Decision guide

Decision area	Use this approach when	Recommended decision
Dedicated capacity for Data Platform Core	bronze and silver are production-critical; downstream BI or external exposure depends on them; Domain workloads are more variable than Core workloads; Core continuity is a priority	assign Data Platform Core to a dedicated capacity
Separate Domain production capacity	multiple Domain workspaces coexist; Domain workloads may create contention; business-facing usage is less predictable; Domain growth should not affect Core operations	assign Domain production to a separate capacity
Separate non-production capacity	development and testing are active; experimentation may generate compute spikes; production stability must be protected from non-production activity	keep non-production on a separate capacity
Architecture review required	Core and Domain are still planned on the same capacity; workspace reassignment is not tightly governed; production and non-production still share capacity; capacity sizing issues become recurrent	escalate to architecture review

Checklist

Has a dedicated capacity been confirmed for Data Platform Core?
Has Domain production been isolated from Data Platform Core?
Has non-production been separated from production capacities?
Have capacity admin roles been limited to the central platform team?
Have workspace reassignment rights been formally governed?
Has a monitoring owner been assigned for each production capacity?
Has disaster recovery been explicitly assessed for Data Platform Core?
Have Spark-related settings been reviewed, if applicable?
Has the target capacity model been approved as part of platform governance?

Recommended configuration matrix

Setting	Data Platform Core	Domain Production	Non-Production	Recommendation
Dedicated capacity	Yes	Preferred	Separate	Mandatory for Data Platform Core
Shared with Data Platform Core	NA	No	No	Not allowed
Workspace reassignment rights	Very restricted	Restricted	Controlled	Govern centrally
Monitoring	Mandatory	Mandatory	Recommended	Standard operating baseline
DR assessment	Mandatory	Case by case	Not priority	Explicit decision required
Spark governance	Case by case	Case by case	Flexible	Only where relevant
Scaling review cadence	Regular	Regular	Periodic	Metrics-driven

...

Recommended target model

Info

title	Recommendation

Do not place Data Platform Core production and Domain production on the same capacity if Data Platform Core must remain operational independently.

Capacity	Scope	Used for
Capacity A — Data Platform Core	Central platform production capacity	Core production ingestion, preparation, and

...

exposure foundations

Capacity B — Domain Production

...

Domain production capacities	Domain gold workspaces business-facing data products BI-oriented workloads potentially more variable usage patterns
Capacity C — Non-Production

Used for:

Shared or segmented non-production capacities

Development

...

testing
experimentation
validation before production promotion

Recommendation

Do not place Data Platform Core production and Domain production on the same capacity if Data Platform Core production must remain operational independently.

Why this matters

A shared capacity creates a shared risk envelope. Even if workspaces are logically separated, they still depend on the same underlying capacity behavior.

5. Capacity-Level Settings to Document

...

Detailed design sections

Workspace-to-capacity assignment

Key message
Workspace assignment is the primary mechanism used to guarantee production isolation and operational independence. Fabric capacity settings let admins manage assigned workspaces, and workspace reassignment directly affects how workloads share compute risk. (Microsoft Learn)

Area	Summary
What it is	The assignment of workspaces to specific Fabric capacities. (Microsoft Learn)
Why it matters	This is the most important configuration decision

...

because it determines

...

which workloads share the same compute risk domain. Capacity planning guidance frames capacity allocation as a governance decision, not only a technical sizing choice. (Microsoft Learn)
What to watch	If Core, Domain, and non-production workloads share the same capacity, they also share the same contention and throttling risk envelope. Capacity growth guidance explicitly recommends governance patterns adapted to centralized and decentralized models. (Microsoft Learn)
Recommended posture	Keep Data Platform Core on a dedicated capacity, separate Domain production where possible, and isolate non-production from all production capacities. (Microsoft Learn)

Recommendation block

assign Assign Data Platform Core production to a dedicated capacity.
assign Assign Domain production to a separate capacity whenever possible.
Keep DEV / QA / other isolate non-production from all off production capacities.
avoid Avoid mixing critical platform workloads with variable business workloads

Confluence panel text

Recommendation
Workspace assignment is the primary mechanism used to guarantee production isolation and operational independence.

...

.

...

Capacity administration and reassignment governance

Info
Key message A dedicated production capacity loses most of its value if workspace assignment is not tightly governed. Fabric allows reassignment through admin and workspace-level paths, so governance rules must be explicit. (Microsoft Learn)

Area	Summary
What it is	The set of permissions allowing administrators to manage a capacity and move workspaces into or out of it. (Microsoft Learn)
Why it matters	Even with a good target architecture, weak governance can reintroduce risk if workspaces are moved without control. Workspace admins can also reassign workspaces in some cases, which increases the need for formal guardrails. (Microsoft Learn)
What to watch	A capacity model can look clean on paper but drift over time if reassignment rights are too broad. Governance guidance emphasizes strong controls when multiple teams share Fabric at scale. (Microsoft Learn)
Recommended posture	Restrict critical-capacity administration to the central platform team and formalize approval for any reassignment that affects the Core production perimeter. (Microsoft Learn)

Recommendation block

restrict Restrict capacity admin rights to the central Data Platform or IT team.
restrict Restrict workspace reassignment rights on critical capacities.
require Require formal approval for any workspace added to the Data Platform Core uction capacity.
prevent Prevent self-service reassignment into critical production capacity

Confluence panel text

Warning
A dedicated production capacity loses most of its value if workspace assignment is not tightly governed.

5.3 Surge protection

What it is

A protection mechanism used to manage overload situations and reduce the impact of excessive background activity on a capacity.

Why it matters

It can help protect shared capacities, especially where Domain workspaces may generate bursty or uneven usage patterns.

Recommendation

consider enabling surge protection on shared Domain production capacities
use it as a protection layer for variable workloads
do not rely on it as the sole protection for Data Platform Core

Position

Surge protection is a supporting control, not a substitute for proper isolation.

Confluence panel text

Recommendation
Use surge protection on shared capacities.
Do not use it as a replacement for dedicated capacity when a workspace is mission-critical.

...

.

...

Capacity sizing and scaling

Info
Key message Data Platform Core capacity sizing must prioritize service continuity over cost minimization. Microsoft recommends estimating size from workload characteristics and validating with real usage in the Capacity Metrics App. (Microsoft Learn)

Area	Summary
What it is	The sizing of Fabric capacity and the ability to adjust it as workload volume evolves. (Microsoft Learn)
Why it matters	Even a well-isolated architecture can fail

...

if the capacity is persistently undersized. Strategic planning guidance recommends budgeting, scaling, and optimization as ongoing activities.

...

(Microsoft Learn)
What to watch	Resizing should be based on observed patterns, not only on incident response. The Metrics App is the main evidence source to understand CU usage, peaks, and the item types driving load. (Microsoft Learn)
Recommended posture	Size Data Platform Core with operational headroom. Review Domain production more frequently because its usage is more variable and business-facing. (Microsoft Learn)

Recommendation block

Size Data Platform Core for continuity first.
Review Domain production more frequently.
Use monitored usage

Recommendation

size Data Platform Core with stability and operational headroom in mind
review Domain production more frequently, as usage can be less predictable
use monitoring trends to drive scaling decisions.
avoid Avoid reactive resizing without understanding the underlying workload pattern

Practical interpretation

Data Platform Core should be sized for continuity first
Domain capacities can be managed more elastically

Confluence panel text

Decision
Data Platform Core capacity sizing must prioritize service continuity over cost minimization.

5.5 Capacity overage

What it is

A mechanism that allows excess usage beyond the purchased capacity threshold, subject to billing and governance.

Why it matters

It can reduce the risk of operational disruption during rare peaks.

Recommendation

consider enabling overage for Data Platform Core only with explicit financial approval
define a capped and governed usage threshold
treat overage as a resilience mechanism, not a normal operating model

Position

Overage is a safety net, not a sizing strategy.

Confluence panel text

Warning
Do not use overage to compensate for structural under-sizing.

5.6 Monitoring and operational visibility

What it is

...

.

...

Monitoring and operational visibility

Info
Key message Capacity monitoring must be part of normal run operations, not only incident management. The Fabric Capacity Metrics App is designed to help admins monitor health, top consumers, compute usage, and issues such as throttling or query rejections. (Microsoft Learn)

Area	Summary
What it is	Monitoring of capacity usage, saturation patterns, top consumers, and operational degradation signals. (Microsoft Learn)
Why it matters	Capacity governance is only effective if usage and saturation can be observed and acted upon. Microsoft recommends using the Metrics App to identify top consumers and optimize before throttling becomes recurrent. (Microsoft Learn)
What to watch	Monitoring data should be interpreted operationally: recurring peaks, saturation patterns, top-consuming items, and correlations with refresh, ingestion, or user activity. The app also has refresh latency, so near-real-time assumptions should be avoided. (Microsoft Learn)
Recommended posture	Every production capacity should have a monitoring owner, review cadence, threshold model, and escalation path. (Microsoft Learn)

Recommendation block
For each production capacity, define:

monitoring owner
review cadence
alert thresholds
escalation path
expected remediation actions

Minimum baseline:

monitor recurring peaks
identify top consuming workspaces and items
review saturation or degradation patterns
correlate operational issues with refresh, ingestion, or usage spikes

Confluence panel text

Recommendation
Capacity monitoring must be part of normal run operations, not only incident management.

...

Disaster recovery

Info
Key message For Data Platform Core, disaster recovery should never be left undocumented. Fabric capacity settings include disaster recovery controls, and Microsoft’s recovery guidance makes clear that recovery planning must be explicit. (Microsoft Learn)

Area	Summary

...


What it is	The capacity-level disaster recovery posture associated with production data continuity. Capacity settings include disaster recovery options and related status information. (Microsoft Learn)
Why it matters	The Data Platform Core

...

workspace supports bronze and silver foundations,

...

making it a

...

central dependency for downstream exposure. In that model, DR is an architecture topic, not just an ops topic. This last point is an inference from your target design, supported by Microsoft’s capacity governance framing. (Microsoft Learn)
What to watch	DR should be documented as an explicit decision: enabled or not enabled, with assumptions and limitations clearly stated. (Microsoft Learn)
Recommended posture	Assess DR first for Data Platform Core, then extend case by case to Domain production depending on criticality. This prioritization is a design recommendation based on your architecture. (Microsoft Learn)

Recommendation block

perform Perform an explicit DR assessment for Data Platform Core Core.
document Document whether DR is enabled or not.
document expected Document recovery assumptions and limitations.
ensure this Ensure DR is an explicit architecture decision

Position

For Data Platform Core , DR should never be left undocumented.

Confluence panel text

Decision
Disaster recovery for Data Platform Core must be assessed explicitly and recorded as an approved architecture choice.

5.8 Notifications and alerting

What it is

The definition of who is informed when capacity issues occur and how operational response is triggered.

Why it matters

Without alert ownership, capacity incidents tend to be handled too late or inconsistently.

Recommendation

Define:

alert recipients
severity levels
response expectations
operational communication path

Confluence panel text

Recommendation
Every production capacity must have a clearly assigned operational owner and alerting path.

5.9 Data Engineering and Spark-related settings

, not an omission.

...

Data Engineering and Spark-related settings

Info
Key message This is a secondary topic unless Spark becomes a major production dependency. Fabric capacity admins can manage Data Engineering and Data Science settings, including workspace-level compute, runtime defaults, and Spark properties. (Microsoft Learn)

Area	Summary
What it is	Capacity-level settings related to

...

Data Engineering and Data

...

Science, including Spark governance options. (Microsoft Learn)
Why it matters	These settings

...

become relevant

...

when Spark-based processing is materially used in the

...

platform. Microsoft explicitly positions them as admin-governed capacity settings. (Microsoft Learn)
What to watch	If Spark usage grows without governance, compute sprawl can become harder to control. Spark planning guidance also treats development and production needs differently. (Microsoft Learn)
Recommended posture	Keep Spark governance centralized and

Recommendation

...

keep Spark governance centralized

...

avoid uncontrolled compute sprawl

document Spark rules separately if Spark is not a central workload in the platform

...

Position

This is a secondary topic in our model unless Spark becomes a major production dependency.

6. Recommended Configuration Matrix

Setting	Data Platform Core	Domain Production	Non-Production
Dedicated capacity	Yes	Preferred	Separate
Shared with Data Platform Core	No	No	No
Workspace reassignment rights	Very restricted	Restricted	Controlled
Surge protection	Optional complement	Recommended	Optional
Capacity overage	Optional, capped	Optional, capped	Usually not required
Monitoring	Mandatory	Mandatory	Recommended
DR assessment	Mandatory	Case by case	Not priority
Spark governance	Case by case	Case by case	Flexible
Scaling review cadence	Regular	Regular	Periodic

7. Operational Rules

Rule 1

Protect Data Platform Core by design.
Critical IT workloads must not depend on the same shared capacity behavior as variable domain workloads.

Rule 2

Use isolation before optimization.
Do not try to solve structural contention only with reactive tuning or protection features.

Rule 3

Treat overage as an exception mechanism.
It may improve resilience, but it must not become the default operating mode.

Rule 4

Make monitoring part of standard operations.
Capacity review must be proactive and periodic.

Rule 5

Separate production from experimentation.
Development and testing workloads must not compete with critical production capacity.

8. Proposed Architecture Decision

Recommended decision

The recommended target state for our platform is:

one dedicated Fabric capacity for Data Platform Core
one separate Fabric capacity for Domain production
one separate non-production capacity
centralized control of workspace assignment
standardized monitoring and alerting
optional capped overage for resilience
explicit DR assessment for Data Platform Core

. The “secondary topic” positioning is a design recommendation for your model. (Microsoft Learn)

Recommendation block

Keep Spark governance centralized.
Avoid uncontrolled compute sprawl.
Document Spark rules separately if Spark is not a central workload in the platform.

...

Architecture

...

conclusion

This is the most coherent model for a Fabric platform used primarily as a storage and exposure layer, where the Data Platform Core

...

workspace must remain stable independently from Domain activity.

...

9. Configuration Decisions to Validate

Checklist

...

Has a dedicated capacity been confirmed for Data Platform Core ?

...

Has Domain production been isolated from Data Platform Core ?

...

Has non-production been separated from production capacities?

...

Have capacity admin roles been limited to the central platform team?

...

Have workspace reassignment rights been formally governed?

...

Has surge protection been evaluated for shared Domain capacities?

...

Has capacity overage been evaluated and financially approved where relevant?

...

Has a monitoring owner been assigned for each production capacity?

...

Have alert thresholds and escalation paths been defined?

...

Has disaster recovery been explicitly assessed for Data Platform Core ?

...

Have Spark-related settings been reviewed, if applicable?

...

Page tree

Page History

Versions Compared

Old Version 9

New Version Current

Key

Fabric Capacity Configuration for Our Data Platform

1. Objective

2. Platform Context

Data Platform Core workspace workspace

Domain workspaces

Key requirement

3. Design Principle

Recommendation

Rationale

Decision statement

4. Recommended Target Model

Target architecture

Capacity A — Data Platform Core

Key messages

What we should implement

Recommended target model

Capacity design principles

Proposed capacity design

Decision guide

Checklist

Recommended configuration matrix

Recommended target model

Recommendation

Why this matters

5. Capacity-Level Settings to Document

Detailed design sections

Workspace-to-capacity assignment

Confluence panel text

Capacity administration and reassignment governance

Confluence panel text

5.3 Surge protection

What it is

Why it matters

Recommendation

Position

Confluence panel text

Capacity sizing and scaling

Recommendation

Practical interpretation

Confluence panel text

5.5 Capacity overage

What it is

Why it matters

Recommendation

Position

Confluence panel text

5.6 Monitoring and operational visibility

What it is

Monitoring and operational visibility

Confluence panel text

Disaster recovery

Position

Confluence panel text

5.8 Notifications and alerting

What it is

Why it matters

Recommendation

Confluence panel text

5.9 Data Engineering and Spark-related settings

Data Engineering and Spark-related settings

Recommendation

Position

6. Recommended Configuration Matrix

7. Operational Rules

Rule 1

Rule 2

Rule 3

Rule 4

Rule 5

8. Proposed Architecture Decision

Recommended decision

Architecture

conclusion

9. Configuration Decisions to Validate