Azure Reliability Assessment & Configuration Quick Reference | Property | Details | |---|---| | Best for | Reliability posture assessment, zone redundancy enablement, multi-region failover setup | | Primary capabilities | Reliability assessment table, Zone Redundancy Configuration, Multi-Region IaC Generation | | Supported services | Azure Functions, App Service (Container Apps planned for a future version) | | MCP tools | Azure Resource Graph queries, Azure CLI commands | When to Use This Skill Activate this skill when user wants to: - "Assess my Function app's reliability" - "Assess my Web…

, 1, name)\n| summarize regions=make_list(location), regionCount=dcount(location), apps=make_list(name) by baseName, appKind\n| where regionCount > 1\n| project baseName, appKind, regionCount, regions, apps\n\" --query \"data[]\" -o json\n```\n\n**Interpretation:**\n- Results show apps with the same base name deployed across multiple regions → ✅ Multi-region\n- No results → ❌ All apps are single-region\n\n**Important:** The `baseName` extraction uses a naming convention (e.g., `my-app-eastus`, `my-app-westus`). If apps don't follow this pattern, also check by resource tags:\n\n```bash\naz graph query -q \"\nResources\n| where type in~ ('microsoft.web/sites', 'microsoft.app/containerapps')\n| where isnotempty(tags['app-group']) or isnotempty(tags['application'])\n| extend appGroup = coalesce(tostring(tags['app-group']), tostring(tags['application']))\n| summarize regions=make_list(location), regionCount=dcount(location) by appGroup\n| where regionCount > 1\n| project appGroup, regionCount, regions\n\" --query \"data[]\" -o json\n```\n\n### Check for Azure Front Door\n\n```bash\naz graph query -q \"\nResources\n| where type =~ 'microsoft.cdn/profiles'\n| where sku.name =~ 'Standard_AzureFrontDoor' or sku.name =~ 'Premium_AzureFrontDoor'\n| project name, resourceGroup, sku=sku.name\n\" --query \"data[]\" -o json\n```\n\n### Check for Traffic Manager Profiles\n\n```bash\naz graph query -q \"\nResources\n| where type =~ 'microsoft.network/trafficmanagerprofiles'\n| extend routingMethod = tostring(properties.trafficRoutingMethod)\n| extend endpoints = array_length(properties.endpoints)\n| project name, resourceGroup, routingMethod, endpoints, status=properties.profileStatus\n\" --query \"data[]\" -o json\n```\n\n### Check Front Door Origins/Backends\n\n```bash\n# List Front Door origin groups and origins\naz afd origin-group list \\\n --profile-name \u003cfront-door-name> \\\n --resource-group \u003crg> \\\n --query \"[].{name:name, origins:length(origins)}\" -o table\n\naz afd origin list \\\n --profile-name \u003cfront-door-name> \\\n --resource-group \u003crg> \\\n --origin-group-name \u003cgroup-name> \\\n --query \"[].{name:name, hostName:hostName, priority:priority, weight:weight}\" -o table\n```\n\n## Assessment Criteria\n\n| Check | Pass | Fail |\n|---|---|---|\n| App deployed in ≥2 regions | ✅ Multi-region | ❌ Single region |\n| Global load balancer exists (Front Door or TM) | ✅ Traffic routing | ❌ No failover mechanism |\n| Health probes configured on load balancer | ✅ Auto-failover | ⚠️ Manual failover only |\n| Storage is geo-redundant (GRS/GZRS) | ✅ Data survives region failure | ❌ Data loss risk |\n\n## Multi-Region Patterns\n\n### Active-Passive (Recommended starting point)\n\n```\nUsers → Azure Front Door → Primary Region (priority 1)\n → Secondary Region (priority 2, standby)\n```\n\n- Primary serves all traffic\n- Front Door health probes detect primary failure\n- Automatic failover to secondary\n- Lower cost (secondary can be scaled down)\n\n### Active-Active\n\n```\nUsers → Azure Front Door → Region A (weight 50)\n → Region B (weight 50)\n```\n\n- Both regions serve traffic simultaneously\n- Better performance (route to nearest)\n- Higher cost (both at full capacity)\n- Requires stateless design or data sync\n\n## Remediation: Generate Multi-Region IaC\n\nWhen user wants multi-region, generate Bicep that includes:\n\n1. **Secondary region compute** — Same service type as primary\n2. **Secondary region storage** — ZRS in secondary region\n3. **Azure Front Door profile** — With:\n - Origin group containing both regions\n - Health probe (HTTP/HTTPS to health endpoint)\n - Routing rule (priority for active-passive, weighted for active-active)\n4. **DNS configuration** — Custom domain on Front Door\n\n### Bicep Skeleton for Active-Passive Front Door\n\n```bicep\nresource frontDoor 'Microsoft.Cdn/profiles@2024-02-01' = {\n name: frontDoorName\n location: 'global'\n sku: {\n name: 'Standard_AzureFrontDoor'\n }\n}\n\nresource originGroup 'Microsoft.Cdn/profiles/originGroups@2024-02-01' = {\n parent: frontDoor\n name: 'primary-group'\n properties: {\n healthProbeSettings: {\n probePath: '/api/health'\n probeProtocol: 'Https'\n probeIntervalInSeconds: 30\n }\n loadBalancingSettings: {\n sampleSize: 4\n successfulSamplesRequired: 3\n }\n }\n}\n\nresource primaryOrigin 'Microsoft.Cdn/profiles/originGroups/origins@2024-02-01' = {\n parent: originGroup\n name: 'primary'\n properties: {\n hostName: primaryAppHostName\n priority: 1\n weight: 1000\n originHostHeader: primaryAppHostName\n }\n}\n\nresource secondaryOrigin 'Microsoft.Cdn/profiles/originGroups/origins@2024-02-01' = {\n parent: originGroup\n name: 'secondary'\n properties: {\n hostName: secondaryAppHostName\n priority: 2\n weight: 1000\n originHostHeader: secondaryAppHostName\n }\n}\n```\n\n## Reporting\n\nFor the reliability checklist, mark the **Multi-Region** column per resource:\n- ✅ — resource is deployed in ≥2 regions AND fronted by Azure Front Door / Traffic Manager with health probes\n- ❌ — single region OR multi-region without an active global load balancer / health probes\n- For Front Door / Traffic Manager rows: ✅ if configured with health probes, ❌ if absent or missing health probes\n","content_type":"text/markdown; charset=utf-8","language":"markdown","size":6120,"content_sha256":"b5f3700d0e40e464675d4f482a06a8046986aa61018d21d28af7eafbc9e3988d"},{"filename":"references/services/app-service/reliability.md","content":"# Azure App Service — Reliability Reference\n\n## Supported Plans & Zone Redundancy\n\n| Plan | Zone Redundancy | Min Instances | Health Check |\n|------|----------------|---------------|--------------|\n| Free/Shared (F1/D1) | ❌ Not supported | N/A | ❌ |\n| Basic (B1/B2/B3) | ❌ Not supported | N/A | ✅ |\n| Standard (S1/S2/S3) | ❌ Not supported | N/A | ✅ |\n| Premium v2 (P1v2+) | ✅ `zoneRedundant: true` + `capacity: 2` | 2 | ✅ |\n| Premium v3 (P0v3+) | ✅ `zoneRedundant: true` + `capacity: 2` | 2 (recommended) | ✅ |\n| Premium v4 (P0v4+) | ✅ `zoneRedundant: true` + `capacity: 2` | 2 (recommended) | ✅ |\n| Isolated v2 (I1v2+) | ✅ `zoneRedundant: true` + `capacity: 2` | 2 | ✅ |\n\n## Assessment Queries\n\n> **⚠️ Output format:** Use `--query \"data[]\" -o json` for `az graph query`. `-o table` only shows summary columns (`Count`, `Total_records`) and hides projected fields. Standard `az webapp` commands work fine with `-o table`.\n\n### Plan Zone Redundancy\n```bash\naz graph query -q \"\nresources\n| where resourceGroup =~ '\u003crg>'\n| where type =~ 'microsoft.web/serverfarms'\n| where kind !contains 'functionapp'\n| project name, sku=sku.name, capacity=sku.capacity, zoneRedundant=properties.zoneRedundant, location\n\" --subscriptions \u003csub-id> --query \"data[]\" -o json\n```\n\n### Health Check Configuration\n```bash\naz webapp config show --name \u003capp> --resource-group \u003crg> \\\n --query \"{healthCheckPath:healthCheckPath, alwaysOn:alwaysOn}\" -o table\n```\n\n### Client Affinity (ARR Affinity) — should be **disabled** for ZR / multi-region\n```bash\naz webapp show --name \u003capp> --resource-group \u003crg> \\\n --query \"clientAffinityEnabled\" -o tsv\n```\nWhen `true`, sticky sessions pin clients to a single instance and defeat zone load balancing.\n\n### Deployment Slots (for zero-downtime deploys)\n```bash\naz webapp deployment slot list --name \u003capp> --resource-group \u003crg> \\\n --query \"[].{name:name, state:state}\" -o table\n```\n\n### Auto Heal\n\nAuto Heal automatically restarts or mitigates your web app when it hits defined thresholds. Can be configured via Azure Portal, CLI or ARM/Bicep\n\nAzure Portal - App Service -> Diagnose and solve problems -> Auto-heal (under Diagnostic Tools) or directly: App Service -> Configuration -> General settings -> Auto Heal\n\nCLI\n```bash\naz webapp config set --resource-group \u003crg> --name \u003capp> --auto-heal-enabled true\n\n# Rules must be set via ARM PATCH (CLI doesn't expose autoHealRules directly)\naz rest --method patch \\\n --uri \"https://management.azure.com/subscriptions/{sub}/resourceGroups/{rg}/providers/Microsoft.Web/sites/{app}/config/web?api-version=2022-03-01\" \\\n --body '{\"properties\":{\"autoHealEnabled\":true,\"autoHealRules\":{...}}}'\n```\n\n## Configure: Zone Redundancy\n\n### Upgrade Plan (if needed)\n```bash\n# Check current SKU\naz appservice plan show --name \u003cplan> --resource-group \u003crg> --query \"sku\"\n\n# Upgrade to Premium v3 (if currently on lower tier)\naz appservice plan update \\\n --name \u003cplan> \\\n --resource-group \u003crg> \\\n --sku P1v3\n```\n\n### Enable Zone Redundancy\n```bash\n# Set min instances (required for ZR)\naz appservice plan update \\\n --name \u003cplan> \\\n --resource-group \u003crg> \\\n --number-of-workers 2\n\n# Enable ZR\naz resource update \\\n --resource-group \u003crg> \\\n --name \u003cplan> \\\n --resource-type \"Microsoft.Web/serverfarms\" \\\n --set properties.zoneRedundant=true\n```\n\n⚠️ Enabling zone redundancy may require scaling up first — for the supported App Service plans listed above, set the plan to at least 2 instances before enabling ZR.\n\n## Configure: Health Check\n\n```bash\n# Enable health check\naz webapp config set \\\n --name \u003capp> \\\n --resource-group \u003crg> \\\n --generic-configurations '{\"healthCheckPath\": \"/api/health\"}'\n```\n\n⚠️ **Warning:** Enabling health check causes an app restart. Configure during maintenance window.\n\n### Health Check Behavior\n- Ping interval: **1 minute**\n- Failure threshold: **10 consecutive failures** (configurable via `WEBSITE_HEALTHCHECK_MAXPINGFAILURES`)\n- After threshold: instance marked unhealthy, replaced within **1 hour**\n- Healthy threshold: **1 successful response** restores instance\n\n### Recommended: Always On\n```bash\naz webapp config set \\\n --name \u003capp> \\\n --resource-group \u003crg> \\\n --always-on true\n```\n\n## Configure: Disable Client Affinity (ARR Affinity)\n\nApp Service enables ARR affinity by default, which pins each client to a single instance via the `ARRAffinity` cookie. **This defeats zone-load-balancing and any multi-region routing**, so it should be disabled for stateless apps:\n\n```bash\naz webapp update --name \u003capp> --resource-group \u003crg> \\\n --client-affinity-enabled false\n```\n\nLeave it on **only** if your app stores state in instance memory and you cannot move it to a shared cache / database.\n\n## Configure: Deployment Slots (Zero-Downtime)\n\nDeployment slots complement reliability by enabling safe deployments:\n\n```bash\n# Create staging slot\naz webapp deployment slot create \\\n --name \u003capp> \\\n --resource-group \u003crg> \\\n --slot staging\n\n# Deploy to staging first, then swap\naz webapp deployment slot swap \\\n --name \u003capp> \\\n --resource-group \u003crg> \\\n --slot staging \\\n --target-slot production\n```\n\n## Back Up Support by SKU\n\n| Plan | Automatic Backup | Custom Backup |\n|------|----------------|---------------|\n| Free/Shared (F1/D1) | ❌ Not supported | ❌ Not supported |\n| Basic (B1/B2/B3) | ✅ | ✅ Configuration required |\n| Standard (S1/S2/S3) | ✅ | ✅ Configuration required |\n| Premium v2 (P1v2+) | ✅ | ✅ Configuration required |\n| Premium v3 (P0v3+) | ✅ | ✅ Configuration required |\n| Premium v4 (P0v4+) | ✅ | ✅ Configuration required |\n| Isolated v2 (I1v2+) | ✅ | ✅ Configuration required |\n\n- Automatic backups recommended since requires no configuration and is automatically enabled\n\n## IaC Patching: Bicep\n\n> **AVM modules:** If the project uses `br/public:avm/res/web/serverfarm` or `br/public:avm/res/web/site`, the parameter names differ from raw ARM (e.g. `zoneRedundant` and `skuCapacity` are top-level params; `siteConfig` is usually preserved). Always grep the actual module call (`Select-String -Path infra -Recurse -Pattern \"avm/res/web/\" -Context 0,15`) and patch the params already in use. The raw-Bicep examples below show the property paths to translate.\n\n### App Service Plan\n```bicep\nresource appServicePlan 'Microsoft.Web/serverfarms@2023-12-01' = {\n name: planName\n location: location\n sku: {\n name: 'P0v3'\n capacity: 2 // ← ADD (min 2 for ZR on P1v3)\n }\n properties: {\n reserved: true // Linux\n zoneRedundant: true // ← ADD\n }\n}\n```\n\n### Web App — Health Check\n```bicep\nresource webApp 'Microsoft.Web/sites@2023-12-01' = {\n name: appName\n location: location\n properties: {\n serverFarmId: appServicePlan.id\n siteConfig: {\n healthCheckPath: '/api/health' // ← ADD\n alwaysOn: true // ← ADD (recommended)\n }\n }\n}\n```\n\n## IaC Patching: Terraform\n\n### App Service Plan\n```hcl\nresource \"azurerm_service_plan\" \"plan\" {\n name = var.plan_name\n location = azurerm_resource_group.rg.location\n resource_group_name = azurerm_resource_group.rg.name\n os_type = \"Linux\"\n sku_name = \"P1v3\"\n worker_count = 2 # ← ADD (min 2 for ZR)\n zone_balancing_enabled = true # ← ADD\n}\n```\n\n### Web App — Health Check\n```hcl\nresource \"azurerm_linux_web_app\" \"app\" {\n name = var.app_name\n resource_group_name = azurerm_resource_group.rg.name\n location = azurerm_resource_group.rg.location\n service_plan_id = azurerm_service_plan.plan.id\n\n site_config {\n health_check_path = \"/api/health\" # ← ADD\n always_on = true # ← ADD\n }\n}\n```\n\n## Virtual Network Integration notes\n\n- Subnet sizing is important. VNet integration consumes IPs during scale-out slot swaps. \n- Undersized subnet cause scale or deployment failures during regional stress or failover. Recommend /26 minimum, /24 for larger plans. Zone Redundant plans require integration subnet to be sized for Zone Redundancy (more IPs).\n- Subnets cannot be resized after assignment without reconfiguring VNET integration.\n- Dependencies reached over private endpoints must have a per-region private endpoint and private DNS Zone. Sharing a single or global private DNS zone linked to the primary VNET will break failover.\n- Recommend Azure DNS Private Resolver per region, or per-region forwarders. Verify WEBSITE_DNS_SERVER/WEBSITE_DNS_ALT_SERVER are set with a fallback.\n- For predictable outbound traffic flow during failover, attach NAT Gateway to the subnet in each region to enable partner allow lists to work for all regions. Use Nat Gateway to avoid SNAT port exhaustion under load.\n- Service Endpoints vs Private Endpoints - Service endpoints are regional and don't failover. Use Private Endpoints per region for resiliency. \n\n## Multi-Region Notes\n\n- App Service supports deployment slots — use slot swap for safe regional deployments\n- Consider auto-scale rules to handle failover traffic surge\n- App Service Managed Certificates don't support custom domains on Front Door — use App Service Certificate or Key Vault\n- Client affinity (ARR Affinity) must be disabled for multi-region (see Configure: Disable Client Affinity above)\n- App Service Environment (v3) live in one subnet and is regional; multi-region still requires one ASE per region with Azure Front Door/Traffic Manager in front. \n\n\n## Reporting (for the assessment table)\n\nWhen the parent skill builds the feature-pivoted assessment table, report each App Service resource on the relevant rows:\n\n| Feature row | What to report |\n|---|---|\n| Zone redundancy — compute | `🟢 ON` if the **plan** has `zoneRedundant: true` AND `sku.capacity ≥ 2`. `🔴 OFF` if either is missing or the plan tier doesn't support ZR (Free / Shared / Basic / Standard). Annotate `(needs plan upgrade)` for unsupported tiers. |\n| Health probes | `🟢 ON` if `siteConfig.healthCheckPath` is set on the **app**. `🔴 OFF` if empty. Basic tier and above support it; Free/Shared do not — annotate `(needs plan upgrade)` in that case. |\n| Multi-region failover | `🟢 ON` if the same app is deployed in ≥2 regions behind Front Door / Traffic Manager. `🟡 PARTIAL` if multi-region is set up but `clientAffinityEnabled` is still `true` (sticky sessions break failover). `🔴 OFF` otherwise. |\n\n","content_type":"text/markdown; charset=utf-8","language":"markdown","size":10484,"content_sha256":"91f1ecd1c07826ed684a5361bcae60bca5fde4b818c4dae04ef965b8aee37077"},{"filename":"references/services/functions/reliability.md","content":"# Azure Functions — Reliability Reference\n\n## Supported Plans & Zone Redundancy\n\n| Plan | Zone Redundancy | Min Instances | Health Check |\n|------|----------------|---------------|--------------|\n| Flex Consumption (FC1) | ✅ `zoneRedundant: true` | Auto-managed | ❌ Platform health check not supported |\n| Premium (EP1/EP2/EP3) | ✅ `zoneRedundant: true` + `sku.capacity: 2` | `minimumElasticInstanceCount: 2` per app | ✅ `healthCheckPath` |\n| Consumption (Y1) | ❌ Not supported | N/A | ❌ Not supported |\n| Dedicated (P1v2+) | ✅ (treated as App Service) | `sku.capacity: 2` | ✅ `healthCheckPath` |\n\n## Assessment Queries\n\n### Zone Redundancy Check\n```bash\naz graph query -q \"\nresources\n| where resourceGroup =~ '\u003crg>'\n| where type =~ 'microsoft.web/serverfarms'\n| where kind contains 'functionapp' or kind =~ 'linux' or kind =~ 'elastic'\n| project name, sku=sku.name, zoneRedundant=properties.zoneRedundant, location\n\" --subscriptions \u003csub-id>\n```\n\n### Function App Instance Count (Premium)\n```bash\naz functionapp show --name \u003capp> --resource-group \u003crg> \\\n --query \"{minInstances:siteConfig.minimumElasticInstanceCount}\" -o table\n```\n\n## Configure: Zone Redundancy\n\n### Flex Consumption (FC1)\n```bash\n# Enable zone redundancy on plan\naz resource update \\\n --resource-group \u003crg> \\\n --name \u003cplan-name> \\\n --resource-type \"Microsoft.Web/serverfarms\" \\\n --set properties.zoneRedundant=true\n```\n\n### Premium (EP1/EP2/EP3)\n```bash\n# Enable zone redundancy + set min capacity\naz appservice plan update \\\n --name \u003cplan-name> \\\n --resource-group \u003crg> \\\n --number-of-workers 2\n\naz resource update \\\n --resource-group \u003crg> \\\n --name \u003cplan-name> \\\n --resource-type \"Microsoft.Web/serverfarms\" \\\n --set properties.zoneRedundant=true\n\n# Set minimum elastic instances per app\naz resource update \\\n --resource-group \u003crg> \\\n --name \u003capp-name> \\\n --resource-type \"Microsoft.Web/sites\" \\\n --set properties.siteConfig.minimumElasticInstanceCount=2\n```\n\n### Consumption (Y1) — upgrade path required\n\nConsumption (Y1) plans do **not** support zone redundancy. The user must upgrade the plan first:\n\n- **Recommended:** Upgrade to **Flex Consumption** — similar serverless model, supports ZR, no per-app minimum cost.\n- **Alternative:** Upgrade to **Premium (EP1+)** — more control, higher base cost (always-ready instances charged 24/7).\n\n⚠️ Inform the user of cost implications **before** initiating any plan change.\n\n## Configure: Health Endpoint\n\nFlex Consumption does NOT support platform health check (`healthCheckPath`). Instead, add an HTTP endpoint in code:\n\n### TypeScript (v4 programming model)\n```typescript\nimport { app } from \"@azure/functions\";\n\napp.http('health', {\n methods: ['GET'],\n authLevel: 'anonymous',\n route: 'health',\n handler: async () => ({ status: 200, body: 'OK' })\n});\n```\n\n### Python (v2 programming model)\n```python\nimport azure.functions as func\n\napp = func.FunctionApp()\n\[email protected](route=\"health\", methods=[\"GET\"], auth_level=func.AuthLevel.ANONYMOUS)\ndef health(req: func.HttpRequest) -> func.HttpResponse:\n return func.HttpResponse(\"OK\", status_code=200)\n```\n\n### C# (isolated worker)\n```csharp\n[Function(\"Health\")]\npublic IActionResult Health([HttpTrigger(AuthorizationLevel.Anonymous, \"get\", Route = \"health\")] HttpRequest req)\n{\n return new OkObjectResult(\"OK\");\n}\n```\n\n### Premium Functions — Platform Health Check\n```bash\naz webapp config set \\\n --name \u003capp-name> \\\n --resource-group \u003crg> \\\n --generic-configurations '{\"healthCheckPath\": \"/api/health\"}'\n```\n\n⚠️ Enabling health check causes an app restart.\n\n## IaC Patching: Bicep\n\n### App Service Plan (AVM module)\n```bicep\nmodule appServicePlan 'br/public:avm/res/web/serverfarm:0.5.0' = {\n params: {\n skuName: 'FC1'\n reserved: true\n zoneRedundant: true // ← ADD\n }\n}\n```\n\n### Premium Plan — extra settings\n```bicep\nmodule appServicePlan 'br/public:avm/res/web/serverfarm:0.5.0' = {\n params: {\n skuName: 'EP1'\n reserved: true\n zoneRedundant: true // ← ADD\n skuCapacity: 2 // ← ADD (min 2 for ZR)\n }\n}\n\n// On the function app resource:\nresource functionApp 'Microsoft.Web/sites@2023-12-01' = {\n properties: {\n siteConfig: {\n minimumElasticInstanceCount: 2 // ← ADD\n }\n }\n}\n```\n\n## IaC Patching: Terraform\n\n```hcl\nresource \"azurerm_service_plan\" \"plan\" {\n sku_name = \"FC1\"\n os_type = \"Linux\"\n zone_balancing_enabled = true # ← ADD\n}\n\n# Premium plan:\nresource \"azurerm_service_plan\" \"plan\" {\n sku_name = \"EP1\"\n os_type = \"Linux\"\n zone_balancing_enabled = true # ← ADD\n worker_count = 2 # ← ADD\n}\n\nresource \"azurerm_linux_function_app\" \"func\" {\n site_config {\n minimum_elastic_instance_count = 2 # ← ADD (Premium only)\n }\n}\n```\n\n## Multi-Region Notes\n\n- Flex Consumption standby costs ~$0 (pay-per-execution) — ideal for active-passive\n- Code must be deployed to both regions separately\n- Event Hub checkpoints are per-app — secondary starts from its own checkpoint on failover\n- Consider Event Hubs Geo-DR for true event replication\n\n## Reporting (for the assessment table)\n\nWhen the parent skill builds the feature-pivoted assessment table, report each Functions resource on the relevant rows:\n\n| Feature row | What to report |\n|---|---|\n| Zone redundancy — compute | `🟢 ON` if the **plan** has `zoneRedundant: true`. For Premium plans, also requires `sku.capacity ≥ 2` AND each Function App has `minimumElasticInstanceCount ≥ 2`. `🔴 OFF` if the plan tier doesn't support ZR (Consumption Y1) — annotate `(needs plan upgrade to Flex / Premium)`. |\n| Health probes | For Premium / Dedicated: `🟢 ON` if `siteConfig.healthCheckPath` is set, `🔴 OFF` otherwise. For Flex Consumption (FC1) / Consumption (Y1): always annotate `🔴 OFF (code-only fix)` — `healthCheckPath` is not supported on these plans, so an HTTP-triggered `/api/health` function must be added in app code (gated by user consent — see [configure-health-probes.md](../../configure-health-probes.md)). |\n| Multi-region failover | `🟢 ON` if the same Function App is deployed in ≥2 regions behind Front Door / Traffic Manager; otherwise `🔴 OFF`. |\n\n## Additional References\n\n- [Reliability in Azure Functions (Microsoft Learn)](https://learn.microsoft.com/en-us/azure/reliability/reliability-functions)\n","content_type":"text/markdown; charset=utf-8","language":"markdown","size":6389,"content_sha256":"95527035e16b597bc0e78e85b74f7e85de1a253fad76066b1d2fdbfc7cfbb5f8"},{"filename":"references/storage-redundancy-checks.md","content":"# Storage Redundancy Checks\n\n## Overview\n\nStorage accounts underpin Azure Functions, Container Apps (for host storage), and App Service. If compute is zone-redundant but storage is not, a zone failure can still cause downtime.\n\n## Replication Types\n\n| Type | Zone Redundancy | Region Redundancy | Description |\n|---|---|---|---|\n| LRS | ❌ None | ❌ None | 3 copies in one datacenter. No zone or region protection. |\n| ZRS | ✅ Zone-redundant | ❌ None | 3 copies across 3 availability zones in one region. |\n| GRS | ❌ None (LRS per region) | ✅ Region-redundant | LRS in primary + LRS in secondary region. Zone failure in primary = risk. |\n| GZRS | ✅ Zone-redundant | ✅ Region-redundant | ZRS in primary region + LRS in secondary region. Best protection. |\n| RA-GRS | ❌ None (LRS per region) | ✅ Region + read | Like GRS but secondary is readable. Still LRS within each region. |\n| RA-GZRS | ✅ Zone-redundant | ✅ Region + read | GZRS + read access to secondary. Maximum redundancy. |\n\n## Minimum Requirement\n\n- If compute is zone-redundant → storage MUST be at least **ZRS** (not GRS — GRS uses LRS in each region and is NOT zone-redundant)\n- For multi-region failover → storage should be **GZRS** (zone + region) or **GRS** (region only, accepts zone risk)\n\n## Resource Graph Queries\n\n> **⚠️ Output format:** Use `--query \"data[]\" -o json` (not `-o table`). `az graph query -o table` only renders summary columns and does not show projected fields.\n\n### Find All Storage Accounts and Their Replication\n\n```bash\naz graph query -q \"\nResources\n| where type =~ 'microsoft.storage/storageaccounts'\n| extend replication = tostring(sku.name)\n| extend tier = tostring(sku.tier)\n| project name, resourceGroup, location, replication, tier, kind\n| order by replication asc\n\" --query \"data[]\" -o json\n```\n\n> **💡 No SKU specified?** If a storage account was deployed without an explicit `sku.name` (raw ARM/Bicep) or `skuName` (AVM module), Azure defaults to **`Standard_GRS`**. Treat any storage account showing `Standard_GRS` as potentially \"defaulted\" rather than intentionally chosen — check the IaC source to confirm and recommend setting it explicitly to `Standard_ZRS` or `Standard_GZRS`.\n\n### Find Storage Accounts Using LRS (Not Zone Redundant)\n\n```bash\naz graph query -q \"\nResources\n| where type =~ 'microsoft.storage/storageaccounts'\n| where sku.name =~ 'Standard_LRS' or sku.name =~ 'Premium_LRS'\n| project name, resourceGroup, location, replication=sku.name\n\" --query \"data[]\" -o json\n```\n\n### Find Function App Host Storage Accounts\n\n```bash\n# List function apps and their storage connections\naz graph query -q \"\nResources\n| where type =~ 'microsoft.web/sites'\n| where kind contains 'functionapp'\n| project name, resourceGroup, location\n\" --query \"data[]\" -o json\n\n# Then for each function app, check its storage:\naz functionapp config appsettings list \\\n --name \u003capp-name> \\\n --resource-group \u003crg> \\\n --query \"[?name=='AzureWebJobsStorage'].value\" -o tsv\n```\n\n### Cross-Reference: Zone-Redundant Compute with Non-ZRS Storage\n\nThis is a critical gap detection query — zone-redundant compute paired with LRS storage:\n\n```bash\n# Step 1: Find zone-redundant plans\naz graph query -q \"\nResources\n| where type =~ 'microsoft.web/serverfarms'\n| where tobool(properties.zoneRedundant) == true\n| project planName=name, resourceGroup, location\n\" --query \"data[]\" -o json\n\n# Step 2: Check if associated storage accounts are ZRS\n# (Requires app-level inspection of AzureWebJobsStorage setting)\n```\n\n## Remediation\n\n### Upgrade Storage from LRS to ZRS\n\n```bash\n# Check if live migration is available (not all regions/account types support it)\naz storage account show \\\n --name \u003caccount-name> \\\n --resource-group \u003crg> \\\n --query \"{name:name, sku:sku.name, kind:kind, location:location}\"\n\n# Request live migration (Standard_LRS → Standard_ZRS)\naz storage account update \\\n --name \u003caccount-name> \\\n --resource-group \u003crg> \\\n --sku Standard_ZRS\n```\n\n⚠️ **Limitations:**\n- Live migration from LRS to ZRS is only supported for Standard general-purpose v2 accounts\n- Premium accounts and legacy account types require manual migration (create new ZRS account + copy data)\n- Migration can take hours to days depending on data volume\n\n### Upgrade Storage from LRS to GRS/GZRS\n\n```bash\n# Upgrade to GRS\naz storage account update \\\n --name \u003caccount-name> \\\n --resource-group \u003crg> \\\n --sku Standard_GRS\n\n# Upgrade to GZRS (zone + region redundant)\naz storage account update \\\n --name \u003caccount-name> \\\n --resource-group \u003crg> \\\n --sku Standard_GZRS\n```\n\n## Reporting\n\nFor the reliability checklist, mark the **ZRS Storage** column per storage account:\n- ✅ — SKU is `Standard_ZRS` or `Standard_GZRS`\n- ❌ (LRS) — `Standard_LRS` (no zone redundancy)\n- ❌ (GRS) — `Standard_GRS` or `Standard_RAGRS` (region-redundant but uses LRS in each region; still a zone-failure risk). Also flag this when no SKU is set in IaC at all — ARM/AVM defaults to `Standard_GRS`.\n\n⚠️ **Key distinction:** GRS provides region redundancy but uses LRS in each region. If compute is zone-redundant but storage is GRS (not ZRS/GZRS), a zone failure can still impact storage availability.\n","content_type":"text/markdown; charset=utf-8","language":"markdown","size":5206,"content_sha256":"2ef140db5d9e29958aa572c8bb90556d12ffc2da96881c2130c36c420bac5536"},{"filename":"references/zone-redundancy-checks.md","content":"# Zone Redundancy — Platform Overview\n\n## Overview\n\nZone redundancy distributes compute instances across availability zones within a region. If one zone fails, instances in other zones continue serving traffic automatically.\n\nThis file covers **platform-level discovery and concepts**. For service-specific assessment queries, configuration commands, and IaC patches, see:\n\n| Service | Reference |\n|---|---|\n| Azure App Service | [services/app-service/reliability.md](services/app-service/reliability.md) |\n| Azure Functions | [services/functions/reliability.md](services/functions/reliability.md) |\n\n> Azure Container Apps deep-dive references are planned for a future version of this skill. The discovery query below still surfaces those resources — just don't dispatch to a per-service reference for them yet.\n\n## Discovery: Find All Non-Zone-Redundant Compute\n\n> **⚠️ Output format:** Use `--query \"data[]\" -o json` for `az graph query`. `-o table` only renders summary columns (`Count`, `Total_records`) and hides projected fields. Pipe JSON through `jq` if you need a table view.\n\nUse this single query to discover every compute resource in scope that is **not** zone-redundant. Use it during Phase 1 (Discover Resources) to decide which service references to load.\n\n```bash\naz graph query -q \"\nResources\n| where type in~ ('microsoft.web/serverfarms', 'microsoft.app/managedenvironments')\n| extend zoneRedundant = tobool(properties.zoneRedundant)\n| where zoneRedundant == false or isnull(zoneRedundant)\n| project name, type, resourceGroup, location, sku=sku.name\n| order by type asc\n\" --query \"data[]\" -o json\n```\n\nFor each row in the result, dispatch to the matching service reference:\n- `microsoft.web/serverfarms` with `kind contains 'functionapp'` → Functions reference\n- `microsoft.web/serverfarms` (other kinds) → App Service reference\n- `microsoft.app/managedenvironments` → _planned (Container Apps)_ — surface in the discovery summary, do not deep-dive\n\n## Regions Supporting Availability Zones\n\n```bash\naz functionapp list-flexconsumption-locations --zone-redundant=true\n```\n\nCommon regions with AZ support across Functions, App Service, and Container Apps:\n\n- East US, East US 2, West US 2, West US 3\n- Central US, South Central US\n- North Europe, West Europe, UK South\n- France Central, Germany West Central, Sweden Central\n- Southeast Asia, Japan East, Australia East\n\n> **Service-specific region availability differs.** Always confirm support for the specific SKU/plan in the target region using the per-service reference.\n\n## Reporting\n\nFor the assessment table's `Zone redundancy — compute` row, the per-service references define exactly what `🟢 ON / 🟡 PARTIAL / 🔴 OFF` mean for that service (e.g. ZR + minimum instance count for App Service, Premium Functions). Only App Service and Functions have a per-service reference in this skill version; Container Apps support is planned.\n","content_type":"text/markdown; charset=utf-8","language":"markdown","size":2930,"content_sha256":"6a328177624859e845e87cec80f3fb59e434b3f077f7232669d5c795cd4310c7"}],"content_json":{"type":"doc","content":[{"type":"heading","attrs":{"level":1},"content":[{"text":"Azure Reliability Assessment & Configuration","type":"text"}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Quick Reference","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Property","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Details","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Best for","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Reliability posture assessment, zone redundancy enablement, multi-region failover setup","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Primary capabilities","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Reliability assessment table, Zone Redundancy Configuration, Multi-Region IaC Generation","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Supported services","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Azure Functions, App Service (Container Apps planned for a future version)","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"MCP tools","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Azure Resource Graph queries, Azure CLI commands","type":"text"}]}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"When to Use This Skill","type":"text"}]},{"type":"paragraph","content":[{"text":"Activate this skill when user wants to:","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Assess my Function app's reliability\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Assess my Web app's reliability\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Check the reliability of my resource group\" (App Service and Functions resources only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Is my app zone redundant?\" (App Service and Functions resources only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Is my app service plan zone redundant?\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Make my app zone redundant\" (App Service and Functions resources only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Make my app service plan zone redundant\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Set up multi-region failover for my app\" (App Service and Functions resources only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Check my reliability posture\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Find single points of failure\" (App Service and Functions resources only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Enable high availability for my app\" (App Service and Functions resources only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Check disaster recovery readiness\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"\"Improve my app's resilience\" (App Service and Functions resources only)","type":"text"}]}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"Scope note:","type":"text","marks":[{"type":"strong"}]},{"text":" This skill currently covers ","type":"text"},{"text":"Azure Functions and Azure App Service","type":"text","marks":[{"type":"strong"}]},{"text":" only. If the user asks about Azure Container Apps reliability, acknowledge that support is planned but not yet available, and only proceed with the parts that apply to App Service and Functions resources in scope.","type":"text"}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Prerequisites","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Authentication: user is logged in to Azure via ","type":"text"},{"text":"az login","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Permissions: Reader access on target subscription/resource group (for assessment)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Permissions: Contributor access (for configuration changes)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Azure Resource Graph extension: ","type":"text"},{"text":"az extension add --name resource-graph","type":"text","marks":[{"type":"code_inline"}]}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"MCP Tools","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Tool","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Purpose","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"mcp_azure_mcp_extension_cli_generate","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Generate ","type":"text"},{"text":"az","type":"text","marks":[{"type":"code_inline"}]},{"text":" CLI commands for resource queries and configuration","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"mcp_azure_mcp_subscription_list","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"List available subscriptions","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"mcp_azure_mcp_group_list","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"List resource groups","type":"text"}]}]}]}]},{"type":"paragraph","content":[{"text":"Primary query method: Azure Resource Graph via ","type":"text"},{"text":"az graph query","type":"text","marks":[{"type":"code_inline"}]},{"text":" (requires ","type":"text"},{"text":"az extension add --name resource-graph","type":"text","marks":[{"type":"code_inline"}]},{"text":").","type":"text"}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Assessment Workflow","type":"text"}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Phase 1: Discover Resources","type":"text"}]},{"type":"ordered_list","attrs":{"order":1,"listStyle":"number"},"content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Identify scope","type":"text","marks":[{"type":"strong"}]},{"text":" — Ask user for resource group, subscription, or app name","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Query Azure Resource Graph","type":"text","marks":[{"type":"strong"}]},{"text":" to discover all resources in scope","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Classify resources","type":"text","marks":[{"type":"strong"}]},{"text":" by service type (Functions, Storage, etc.). If non-Functions compute (App Service sites that aren't Function Apps, Container Apps) is found, ","type":"text"},{"text":"note it but do not deep-dive","type":"text","marks":[{"type":"strong"}]},{"text":" — those services are planned for a future version of this skill.","type":"text"}]}]}]},{"type":"paragraph","content":[{"text":"Important:","type":"text","marks":[{"type":"strong"}]},{"text":" Always scope queries to the user's specified resource group or subscription. Add these filters to every Resource Graph query:","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Resource group: ","type":"text"},{"text":"| where resourceGroup =~ '\u003crg-name>'","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Subscription: Use ","type":"text"},{"text":"--subscriptions \u003csub-id>","type":"text","marks":[{"type":"code_inline"}]},{"text":" flag on ","type":"text"},{"text":"az graph query","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"App name: ","type":"text"},{"text":"| where name =~ '\u003capp-name>'","type":"text","marks":[{"type":"code_inline"}]}]}]}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Phase 2: Assess Reliability","type":"text"}]},{"type":"paragraph","content":[{"text":"Two-step assessment: ","type":"text"},{"text":"platform-level discovery first, then per-service deep dive.","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph","content":[{"text":"Step 1 — Platform discovery (find what's there).","type":"text","marks":[{"type":"strong"}]},{"text":" Use these to enumerate resources in scope and detect cross-cutting reliability gaps:","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Platform check","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Reference","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Zone redundancy — discovery","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/zone-redundancy-checks.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/zone-redundancy-checks.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Storage redundancy (cross-service)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/storage-redundancy-checks.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/storage-redundancy-checks.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Multi-region & global load balancers","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/multi-region-checks.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/multi-region-checks.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Front Door / Traffic Manager / App Insights probes","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/health-probe-checks.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/health-probe-checks.md","title":null}}]}]}]}]}]},{"type":"paragraph","content":[{"text":"Step 2 — Per-service deep dive.","type":"text","marks":[{"type":"strong"}]},{"text":" For each compute resource discovered in Step 1, load the matching service reference. The service reference is the single source of truth for that service's plan/SKU rules, assessment queries, CLI commands, IaC patches (Bicep + Terraform + AVM), and reporting hints.","type":"text"}]},{"type":"paragraph","content":[{"text":"This skill version ships ","type":"text"},{"text":"only the Azure Functions and App Service","type":"text","marks":[{"type":"strong"}]},{"text":" per-service references. Other compute services are listed below explicitly so the dispatch logic is unambiguous: if a resource matches an unsupported row, do ","type":"text"},{"text":"not","type":"text","marks":[{"type":"strong"}]},{"text":" attempt to load a reference, fabricate CLI commands, or generate IaC patches for it.","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Service detected","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Reference","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Azure Functions (","type":"text"},{"text":"microsoft.web/serverfarms","type":"text","marks":[{"type":"code_inline"}]},{"text":" with ","type":"text"},{"text":"kind contains 'functionapp'","type":"text","marks":[{"type":"code_inline"}]},{"text":")","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/services/functions/reliability.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/services/functions/reliability.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Azure App Service (non-Functions sites: ","type":"text"},{"text":"microsoft.web/sites","type":"text","marks":[{"type":"code_inline"}]},{"text":" without ","type":"text"},{"text":"kind contains 'functionapp'","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"microsoft.web/serverfarms","type":"text","marks":[{"type":"code_inline"}]},{"text":" without ","type":"text"},{"text":"kind contains 'functionapp'","type":"text","marks":[{"type":"code_inline"}]},{"text":")","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/services/app-service/reliability.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/services/app-service/reliability.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Azure Container Apps (","type":"text"},{"text":"microsoft.app/containerapps","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"microsoft.app/managedenvironments","type":"text","marks":[{"type":"code_inline"}]},{"text":")","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"⚪ Not yet shipped — planned for a future version","type":"text"}]}]}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"Handling unsupported services:","type":"text","marks":[{"type":"strong"}]},{"text":" If a resource matches an unsupported row above, surface it in the discovery summary, mark it as ","type":"text"},{"text":"⚪ not assessed (planned)","type":"text","marks":[{"type":"code_inline"}]},{"text":" in the Phase 3 table, and skip the per-service remediation steps for it. Do ","type":"text"},{"text":"not","type":"text","marks":[{"type":"strong"}]},{"text":" attempt to fabricate CLI commands or IaC patches for those services.","type":"text"}]}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Phase 3: Generate Reliability Checklist","type":"text"}]},{"type":"paragraph","content":[{"text":"Present findings as a ","type":"text"},{"text":"feature-pivoted","type":"text","marks":[{"type":"strong"}]},{"text":" table: one row per reliability feature (Zone redundancy on compute, Zone-redundant storage, Health probes, Multi-region failover), with a single status indicator and the ","type":"text"},{"text":"specific resources","type":"text","marks":[{"type":"strong"}]},{"text":" that are relevant to that feature. This avoids the noise of one-row-per-resource with mostly ","type":"text"},{"text":"n/a","type":"text","marks":[{"type":"code_inline"}]},{"text":" cells. Do ","type":"text"},{"text":"not","type":"text","marks":[{"type":"strong"}]},{"text":" assign numeric scores or grades.","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"🔍 Reliability Assessment — {scope}\n─────────────────────────────────────────────────────────────────────────────────────────────\nReliability Feature Status Resources\n─────────────────────────────────────────────────────────────────────────────────────────────\nZone redundancy — compute 🔴 OFF • plan-web-ii5trxva2ark4 (P1v3)\n • plan-ii5trxva2ark4 (FC1)\n\nZone-redundant storage 🔴 GRS • stii5trxva2ark4 (defaulted; no SKU set in IaC)\n\nHealth probes 🔴 OFF • func-api-ii5trxva2ark4 — needs code change (FC1)\n • app-web-ii5trxva2ark4 — no health check path\n\nMulti-region failover 🔴 OFF • Single region (eastus) only — Front Door not configured\n─────────────────────────────────────────────────────────────────────────────────────────────\n\nWant me to fix the 🔴 items? I'll do the quick wins first (App\nplan zone redundancy + health checks on supported plans), then ask before\nstorage migration and multi-region setup. (yes/no)","type":"text"}]},{"type":"paragraph","content":[{"text":"Rules for the table:","type":"text","marks":[{"type":"strong"}]}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Four feature rows, in this order:","type":"text","marks":[{"type":"strong"}]},{"text":" Zone redundancy — compute · Zone-redundant storage · Health probes · Multi-region failover. Omit a row entirely only if no resource in scope could ever apply to it.","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Status column","type":"text","marks":[{"type":"strong"}]},{"text":" is one symbol + one short word, no other characters:","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"🟢 ON","type":"text","marks":[{"type":"code_inline"}]},{"text":" — feature is fully enabled across all relevant resources in scope","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"🟡 PARTIAL","type":"text","marks":[{"type":"code_inline"}]},{"text":" — some resources have it, some don't (or partial config like liveness-only)","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"🔴 OFF","type":"text","marks":[{"type":"code_inline"}]},{"text":" — feature is missing on all relevant resources","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"For storage, replace ","type":"text"},{"text":"OFF","type":"text","marks":[{"type":"code_inline"}]},{"text":" with the current SKU when relevant (","type":"text"},{"text":"🔴 LRS","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"🔴 GRS","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"🟢 ZRS","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"🟢 GZRS","type":"text","marks":[{"type":"code_inline"}]},{"text":"). When no SKU is set in IaC, label as ","type":"text"},{"text":"🔴 GRS","type":"text","marks":[{"type":"code_inline"}]},{"text":" (ARM/AVM default) and note that in the resource line.","type":"text"}]}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Resources column","type":"text","marks":[{"type":"strong"}]},{"text":" lists only what's relevant to that feature, one bullet per resource:","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"For \"needs fixing\" resources, include a short inline reason (","type":"text"},{"text":"(FC1)","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"(defaulted; no SKU set)","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"liveness only","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"needs code change (FC1)","type":"text","marks":[{"type":"code_inline"}]},{"text":").","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"For resources that are ","type":"text"},{"text":"already ON","type":"text","marks":[{"type":"strong"}]},{"text":" for that feature, mention them on the same row with ","type":"text"},{"text":"— already ON","type":"text","marks":[{"type":"code_inline"}]},{"text":" so the user sees credit for what's right.","type":"text"}]}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Do not","type":"text","marks":[{"type":"strong"}]},{"text":" include ","type":"text"},{"text":"n/a","type":"text","marks":[{"type":"code_inline"}]},{"text":", ","type":"text"},{"text":"—","type":"text","marks":[{"type":"code_inline"}]},{"text":", or empty cells. If a feature doesn't apply to any resource in scope, drop the row.","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Do not","type":"text","marks":[{"type":"strong"}]},{"text":" include numeric scores, grades, or point totals.","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"End the assessment with a ","type":"text"},{"text":"single yes/no question","type":"text","marks":[{"type":"strong"}]},{"text":" that kicks off the staged remediation flow. Do not enumerate the per-resource fix list here — the user will see it after they say yes (Configuration Workflow Step 1).","type":"text"}]}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"UX Note:","type":"text","marks":[{"type":"strong"}]},{"text":" If the assessment finds the app ","type":"text"},{"text":"already has","type":"text","marks":[{"type":"strong"}]},{"text":" all core reliability features (zone redundancy, ZRS/GZRS storage, health probes), skip the fix-it question and jump straight to Configuration Workflow ","type":"text"},{"text":"Step 3","type":"text","marks":[{"type":"link","attrs":{"href":"#step-3-both-paths-multi-region-followup--ask-and-wait","title":null}}]},{"text":" (Multi-region follow-up). Do ","type":"text"},{"text":"NOT","type":"text","marks":[{"type":"strong"}]},{"text":" start any multi-region work without explicit consent.","type":"text"}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Configuration Workflow","type":"text"}]},{"type":"paragraph","content":[{"text":"When user wants to ","type":"text"},{"text":"fix","type":"text","marks":[{"type":"strong"}]},{"text":" findings from the assessment:","type":"text"}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"⛔ ALWAYS confirm with user before executing changes.","type":"text","marks":[{"type":"strong"}]},{"text":" Show what will change, any cost implications, and any destructive actions (e.g., environment recreation).","type":"text"}]}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Step 1: Present Fix Plan + Choose Path","type":"text"}]},{"type":"paragraph","content":[{"text":"After assessment, if user says \"fix it\" / \"improve my reliability\" / \"enable zone redundancy\":","type":"text"}]},{"type":"ordered_list","attrs":{"order":1,"listStyle":"number"},"content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"List each fixable finding with the specific action","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Flag any cost implications or breaking changes","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Ask user which path they want:","type":"text","marks":[{"type":"strong"}]}]}]}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"I'll start with the quick wins (no downtime, fast):\n\n1. ✏️ Enable zone redundancy on plan-ii5trxva2ark4 (Flex Consumption — no cost change)\n2. ✏️ Set health check path to /api/health on func-api-ii5trxva2ark4\n\nThen, separately, I'll ask if you want to upgrade storage:\n\n3. 🕒 Upgrade stii5trxva2ark4 from LRS → ZRS (small cost increase, migration takes hours)\n — Required for full zone redundancy, but I'll confirm with you before starting.\n\nHow would you like to apply these changes?\n\n A) Fix now — Run az CLI commands against your live resources (immediate, one-time)\n B) Patch my IaC — Update your Bicep/Terraform files so changes persist across deploys\n\n(If you use azd or Terraform, option B is recommended so `azd up` won't overwrite changes.)","type":"text"}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Path A: Fix Now (CLI)","type":"text"}]},{"type":"paragraph","content":[{"text":"Run fixes against live resources using ","type":"text"},{"text":"az","type":"text","marks":[{"type":"code_inline"}]},{"text":" CLI commands. ","type":"text"},{"text":"Quick wins first, then ask before the slow storage migration.","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph","content":[{"text":"The exact CLI commands per service live in the per-service references — pick the one(s) matching the resources discovered in Phase 2:","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Fix","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Reference","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Enable zone redundancy / configure health probes (Functions)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/services/functions/reliability.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/services/functions/reliability.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Enable zone redundancy / configure health probes (App Service)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/services/app-service/reliability.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/services/app-service/reliability.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Upgrade storage replication (cross-service)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/configure-storage.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/configure-storage.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Set up multi-region (cross-service)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/configure-multi-region.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/configure-multi-region.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Platform overview / verification","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/configure-zone-redundancy.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/configure-zone-redundancy.md","title":null}}]},{"text":", ","type":"text"},{"text":"references/configure-health-probes.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/configure-health-probes.md","title":null}}]}]}]}]}]},{"type":"paragraph","content":[{"text":"Execution order — always quick wins first:","type":"text","marks":[{"type":"strong"}]}]},{"type":"ordered_list","attrs":{"order":1,"listStyle":"number"},"content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Zone redundancy on compute","type":"text","marks":[{"type":"strong"}]},{"text":" (fast, in-place property update on the App's plan).","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Health probes","type":"text","marks":[{"type":"strong"}]},{"text":" (Premium / Dedicated only — in-place; for FC1 / Consumption, follow the consent gate in ","type":"text"},{"text":"configure-health-probes.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/configure-health-probes.md","title":null}}]},{"text":").","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Verify","type":"text","marks":[{"type":"strong"}]},{"text":" the compute changes succeeded before doing anything else.","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"⛔ STOP — Ask about storage upgrade.","type":"text","marks":[{"type":"strong"}]},{"text":" Compute is now zone-redundant, but storage may still be LRS or GRS. Ask the user explicitly:","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"✅ Compute is now zone-redundant.\n\nTo be **fully zone-redundant**, your storage account also needs to be upgraded:\n • stii5trxva2ark4: currently `Standard_LRS` → needs `Standard_ZRS`\n\n⚠️ This is a live storage redundancy conversion:\n • Takes hours to days depending on data volume\n • Small ongoing cost increase (~$0.01/GB/month more)\n • Only supported for Standard general-purpose v2 accounts\n\nDo you want me to start the storage migration now? (yes / no / later)","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"yes","type":"text","marks":[{"type":"strong"}]},{"text":" → run ","type":"text"},{"text":"az storage account update --sku Standard_ZRS","type":"text","marks":[{"type":"code_inline"}]},{"text":" (or ","type":"text"},{"text":"migration start","type":"text","marks":[{"type":"code_inline"}]},{"text":" if needed); poll ","type":"text"},{"text":"az storage account show --query sku.name","type":"text","marks":[{"type":"code_inline"}]},{"text":" until it reports ","type":"text"},{"text":"Standard_ZRS","type":"text","marks":[{"type":"code_inline"}]},{"text":".","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"no / later","type":"text","marks":[{"type":"strong"}]},{"text":" → leave storage as-is; note in the re-assessment that ZR storage remains a gap.","type":"text"}]}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Multi-region","type":"text","marks":[{"type":"strong"}]},{"text":" — do NOT auto-run. Handled in ","type":"text"},{"text":"Step 3","type":"text","marks":[{"type":"strong"}]},{"text":" below as an explicit follow-up after re-assessment.","type":"text"}]}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"⚠️ Warning:","type":"text","marks":[{"type":"strong"}]},{"text":" If the user uses ","type":"text"},{"text":"azd up","type":"text","marks":[{"type":"code_inline"}]},{"text":" or ","type":"text"},{"text":"terraform apply","type":"text","marks":[{"type":"code_inline"}]},{"text":" later, CLI-only changes may be overwritten by the IaC definitions. Recommend also patching IaC after CLI fixes.","type":"text"}]}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Path B: Patch IaC","type":"text"}]},{"type":"paragraph","content":[{"text":"Update the user's Bicep or Terraform files so reliability settings are persistent.","type":"text"}]},{"type":"paragraph","content":[{"text":"Step 1: Detect IaC type","type":"text","marks":[{"type":"strong"}]}]},{"type":"ordered_list","attrs":{"order":1,"listStyle":"number"},"content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Look for ","type":"text"},{"text":"infra/","type":"text","marks":[{"type":"code_inline"}]},{"text":" folder in project root","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"If not found, check project root for ","type":"text"},{"text":"*.bicep","type":"text","marks":[{"type":"code_inline"}]},{"text":" or ","type":"text"},{"text":"*.tf","type":"text","marks":[{"type":"code_inline"}]},{"text":" files","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"If still not found, ask user: \"Where are your IaC files located?\"","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Check for ","type":"text"},{"text":"*.bicep","type":"text","marks":[{"type":"code_inline"}]},{"text":" files → use Bicep patching","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Check for ","type":"text"},{"text":"*.tf","type":"text","marks":[{"type":"code_inline"}]},{"text":" files → use Terraform patching","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"If both exist, ask user which to patch","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"If no IaC exists, fall back to Path A (CLI) and inform user","type":"text"}]}]}]},{"type":"paragraph","content":[{"text":"Step 2: Classify each fix by risk level","type":"text","marks":[{"type":"strong"}]}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Fix","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Risk Level","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"What Happens","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Zone redundancy (App plan)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"🟢 Safe patch","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"In-place property update on next deploy","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Storage LRS → ZRS","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"🟡 Pre-migration required","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Live storage migration must complete before the IaC SKU change can deploy. ","type":"text"},{"text":"Never bundle with safe patches","type":"text","marks":[{"type":"strong"}]},{"text":" — use the two-deploy flow in Steps 3–5.","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Health check path (Basic/Standard/Premium / Dedicated)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"🟢 Safe patch","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"In-place update, but causes app restart","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Health check path (FC1 / Consumption)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"⚪ Code-only — ask first","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"healthCheckPath","type":"text","marks":[{"type":"code_inline"}]},{"text":" is unsupported. Adding a health endpoint requires adding an HTTP-triggered ","type":"text"},{"text":"/api/health","type":"text","marks":[{"type":"code_inline"}]},{"text":" function to ","type":"text"},{"text":"app code","type":"text","marks":[{"type":"strong"}]},{"text":". ","type":"text"},{"text":"Always ask the user for explicit consent before touching source code.","type":"text","marks":[{"type":"strong"}]},{"text":" Do ","type":"text"},{"text":"not","type":"text","marks":[{"type":"strong"}]},{"text":" patch IaC.","type":"text"}]}]}]}]},{"type":"paragraph","content":[{"text":"Step 3: Apply patches in two deploys (quick wins first)","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph","content":[{"text":"The IaC patching framework (detection, AVM-module guidance, deploy-order rule, storage SKU patch) lives in:","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"IaC Type","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Framework reference","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Bicep","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/iac-patching-bicep.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/iac-patching-bicep.md","title":null}}]}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Terraform","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"references/iac-patching-terraform.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/iac-patching-terraform.md","title":null}}]}]}]}]}]},{"type":"paragraph","content":[{"text":"The actual ","type":"text"},{"text":"per-service compute patches","type":"text","marks":[{"type":"strong"}]},{"text":" (Function App plan ZR, App Service Plan ZR, etc.) live in the per-service references — load the matching service file from Phase 2 for the exact Bicep / Terraform / AVM snippets. Only Azure Functions and App Service have per-service references in this skill version; Container Apps is out of scope.","type":"text"}]},{"type":"paragraph","content":[{"text":"Deploy 1 — Quick wins only.","type":"text","marks":[{"type":"strong"}]},{"text":" Patch the 🟢 Safe items (zone redundancy on the App Service/Function App plan, health probes on Basic/Standard/Premium / Dedicated). Do ","type":"text"},{"text":"NOT","type":"text","marks":[{"type":"strong"}]},{"text":" include the storage SKU patch in this deploy.","type":"text"}]},{"type":"paragraph","content":[{"text":"After patching, ","type":"text"},{"text":"the skill runs the deploy itself","type":"text","marks":[{"type":"strong"}]},{"text":" (do not stop and tell the user to run it). Detect the deployment tool and confirm once before executing:","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"📦 Patches applied to your IaC. Ready to deploy:\n Tool detected: azd (found azure.yaml)\n Command: azd up\n\nProceed with deployment? (yes / no)","type":"text"}]},{"type":"paragraph","content":[{"text":"On ","type":"text"},{"text":"yes","type":"text","marks":[{"type":"strong"}]},{"text":", run the appropriate command, stream output back to the user, and continue to the next step on success:","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"AZD project (has ","type":"text"},{"text":"azure.yaml","type":"text","marks":[{"type":"code_inline"}]},{"text":"): ","type":"text"},{"text":"azd up","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Bicep-only: ","type":"text"},{"text":"az deployment group create --resource-group \u003crg> --template-file infra/main.bicep --parameters @infra/main.parameters.json","type":"text","marks":[{"type":"code_inline"}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Terraform: ","type":"text"},{"text":"terraform plan -out tfplan","type":"text","marks":[{"type":"code_inline"}]},{"text":" → (show plan summary) → ","type":"text"},{"text":"terraform apply tfplan","type":"text","marks":[{"type":"code_inline"}]}]}]}]},{"type":"paragraph","content":[{"text":"On ","type":"text"},{"text":"no","type":"text","marks":[{"type":"strong"}]},{"text":", stop and report the patched files; do not proceed to Step 4 / Re-Assess.","type":"text"}]},{"type":"paragraph","content":[{"text":"If deployment fails, surface the error and stop — do not continue to the storage step.","type":"text"}]},{"type":"paragraph","content":[{"text":"⛔ STOP — Ask about storage upgrade before Deploy 2.","type":"text","marks":[{"type":"strong"}]},{"text":" After Deploy 1 succeeds, ask the user explicitly:","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"✅ Quick-win patches deployed. Compute is now zone-redundant.\n\nTo be **fully zone-redundant**, your storage account also needs to be upgraded:\n • stii5trxva2ark4: currently `Standard_LRS` → needs `Standard_ZRS`\n\n⚠️ This is a two-part change:\n 1. Live storage migration (`az storage account migration start`) — takes hours to days\n 2. A second deploy to update your IaC's storage SKU to match\n\nDo you want me to start the storage migration now? (yes / no / later)","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"yes","type":"text","marks":[{"type":"strong"}]},{"text":" → the skill runs the migration command itself, polls until complete, then patches the storage SKU in IaC and runs ","type":"text"},{"text":"Deploy 2","type":"text","marks":[{"type":"strong"}]},{"text":" (now a no-op confirmation). The user does not need to run anything manually.","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"no / later","type":"text","marks":[{"type":"strong"}]},{"text":" → leave the storage SKU patch unapplied. Note in the re-assessment that ZR storage remains a gap; suggest revisiting later.","type":"text"}]}]}]},{"type":"paragraph","content":[{"text":"Step 4: Storage migration (only if user said yes in Step 3)","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph","content":[{"text":"The skill runs these commands itself — do not ask the user to run them. Show progress as you go:","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"🔄 Starting storage migration (this can take up to 72 hours)...\n\n az storage account migration start --name stii5trxva2ark4 \\\n --resource-group rg-example --sku Standard_ZRS --no-wait\n\n Polling: az storage account show --name stii5trxva2ark4 --query sku.name\n ...\n ✅ Migration complete: sku.name = Standard_ZRS","type":"text"}]},{"type":"paragraph","content":[{"text":"For very long migrations, you may surface a checkpoint to the user (\"this is still running, check back later\") rather than blocking the entire conversation.","type":"text"}]},{"type":"paragraph","content":[{"text":"Step 5: Deploy 2 — storage SKU patch","type":"text","marks":[{"type":"strong"}]}]},{"type":"paragraph","content":[{"text":"After the migration completes, the skill patches the storage SKU in IaC and runs the same deploy command as Step 3 (e.g. ","type":"text"},{"text":"azd up","type":"text","marks":[{"type":"code_inline"}]},{"text":"). This deploy is a no-op confirmation that the IaC matches the live state. Confirm once with the user before executing, then run it directly.","type":"text"}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Step 2 (both paths): Re-Assess","type":"text"}]},{"type":"paragraph","content":[{"text":"After changes are applied (CLI) or deployed (IaC), automatically re-run the assessment and show the ","type":"text"},{"text":"same feature-pivoted table","type":"text","marks":[{"type":"strong"}]},{"text":" as Phase 3, with each feature row's status updated to reflect the new state. Briefly call out what changed since the previous run.","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"🔄 Reliability Re-Assessment — rg-eventhubs-python-jan13 (eastus)\n───────────────────────────────────────────────────────────────────────────────────────\nReliability Feature Status Resources\n───────────────────────────────────────────────────────────────────────────────────────\nZone redundancy — compute 🟢 ON • plan-ii5trxva2ark4 (FC1) — now ON\n • plan-web-ii5trxva2ark4 (P1v3) — now ON\n\nZone-redundant storage 🟢 ZRS • stii5trxva2ark4 — GRS → ZRS\n\nHealth probes 🟡 PARTIAL • func-api-ii5trxva2ark4 — still off (FC1, code change declined)\n • app-web-ii5trxva2ark4 — now ON\n\nMulti-region failover 🔴 OFF • Single region (eastus) only\n───────────────────────────────────────────────────────────────────────────────────────\n\nWhat changed: Function App and App Service plan zone redundancy, storage replication and health probes on App Service.\n(Multi-region offered next — see Step 3.)","type":"text"}]},{"type":"heading","attrs":{"level":3},"content":[{"text":"Step 3 (both paths): Multi-region follow-up — ASK and WAIT","type":"text"}]},{"type":"paragraph","content":[{"text":"Multi-region is a significant cost/complexity step. Do ","type":"text"},{"text":"NOT","type":"text","marks":[{"type":"strong"}]},{"text":" start it automatically. After re-assessment, only if ","type":"text"},{"text":"all core single-region reliability features are 🟢 ON","type":"text","marks":[{"type":"strong"}]},{"text":" (zone-redundant compute, ZRS/GZRS storage, health probes), explicitly ask the user and ","type":"text"},{"text":"wait for their response","type":"text","marks":[{"type":"strong"}]},{"text":" before doing anything:","type":"text"}]},{"type":"code_block","attrs":{"wrap":false,"language":""},"content":[{"text":"🟢 Your app is now fully zone-redundant in {region}.\n\nThe next step (optional) is multi-region failover with Azure Front Door:\n • Deploys compute + storage in a second region (paired region recommended)\n • Adds Azure Front Door for global load balancing with health-probe-driven failover\n • Protects against full region outages\n • Estimated additional cost: ~2x compute (active-passive); Front Door ~$35/month base\n\nDo you want me to set up multi-region failover now? (yes / no / later)","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"yes","type":"text","marks":[{"type":"strong"}]},{"text":" → proceed with ","type":"text"},{"text":"references/configure-multi-region.md","type":"text","marks":[{"type":"link","attrs":{"href":"references/configure-multi-region.md","title":null}}]},{"text":". Confirm secondary region choice with the user, then:","type":"text"}]},{"type":"ordered_list","attrs":{"order":1,"listStyle":"number"},"content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Generate the multi-region IaC (Bicep / Terraform additions for the secondary region + Front Door).","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Confirm once with the user: ","type":"text"},{"text":"📦 Multi-region IaC generated. Ready to deploy with \\","type":"text","marks":[{"type":"code_inline"}]},{"text":"azd up`. Proceed? (yes / no)`","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"On ","type":"text"},{"text":"yes","type":"text","marks":[{"type":"strong"}]},{"text":", ","type":"text"},{"text":"the skill runs the deploy itself","type":"text","marks":[{"type":"strong"}]},{"text":" (","type":"text"},{"text":"azd up","type":"text","marks":[{"type":"code_inline"}]},{"text":" / ","type":"text"},{"text":"az deployment group create","type":"text","marks":[{"type":"code_inline"}]},{"text":" / ","type":"text"},{"text":"terraform apply","type":"text","marks":[{"type":"code_inline"}]},{"text":") and streams output. Do not stop and tell the user to run it.","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"After successful deploy, run a final re-assessment so the user sees Multi-region failover flip to 🟢 ON.","type":"text"}]}]}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"no / later","type":"text","marks":[{"type":"strong"}]},{"text":" → leave the deployment as-is. Note that single-region zone-redundant is a reliable end state; multi-region can be revisited anytime.","type":"text"}]}]}]},{"type":"blockquote","content":[{"type":"paragraph","content":[{"text":"⛔ Do not skip the wait.","type":"text","marks":[{"type":"strong"}]},{"text":" Do not generate multi-region IaC, deploy a Front Door, or modify any files until the user has explicitly said yes. If core reliability is not yet all 🟢, do ","type":"text"},{"text":"not","type":"text","marks":[{"type":"strong"}]},{"text":" ask about multi-region — finish the core gaps first.","type":"text"}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Priority Classification","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Priority","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Criteria","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Action","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Critical","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"No zone redundancy AND production workload","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Fix immediately","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"High","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"LRS storage on zone-redundant compute","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Fix within days","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Medium","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"No multi-region (single region but zone-redundant)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Plan for next sprint","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Low","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Missing health probes or monitoring gaps","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Track and fix","type":"text"}]}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Error Handling","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Error","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Message","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Remediation","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Authentication required","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"\"Please login\"","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Run ","type":"text"},{"text":"az login","type":"text","marks":[{"type":"code_inline"}]},{"text":" and retry","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Access denied","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"\"Forbidden\"","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Confirm Reader/Contributor role assignment","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Plan doesn't support ZR","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"\"Upgrade required\"","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Inform user of plan upgrade path + cost delta","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Region doesn't support AZ","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"\"Region limitation\"","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Suggest supported regions","type":"text"}]}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Best Practices","type":"text"}]},{"type":"bullet_list","content":[{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Run reliability assessments after every significant infrastructure change","type":"text"}]}]},{"type":"list_item","content":[{"type":"paragraph","content":[{"text":"Test failover scenarios periodically (at least quarterly)","type":"text"}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Skill Boundaries","type":"text"}]},{"type":"table","attrs":{"layout":null},"content":[{"type":"tr","content":[{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Action","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"This skill does","type":"text"}]}]},{"type":"th","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Hand off to","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Assess reliability posture","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"✅ Yes","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"—","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Recommend improvements","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"✅ Yes","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"—","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Enable zone redundancy (CLI commands)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"✅ Yes","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"—","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Patch Bicep/Terraform for reliability","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"✅ Yes","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"—","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Generate multi-region IaC","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"✅ Yes (additions for the secondary region + Front Door)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"azure-prepare","type":"text","marks":[{"type":"code_inline"}]},{"text":" for full new-app IaC scaffolding","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Deploy IaC for reliability changes","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"✅ Yes (runs ","type":"text"},{"text":"azd up","type":"text","marks":[{"type":"code_inline"}]},{"text":" / ","type":"text"},{"text":"terraform apply","type":"text","marks":[{"type":"code_inline"}]},{"text":" / ","type":"text"},{"text":"az deployment","type":"text","marks":[{"type":"code_inline"}]},{"text":" itself, after user confirmation)","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"azure-deploy","type":"text","marks":[{"type":"code_inline"}]},{"text":" for general/non-reliability deploys","type":"text"}]}]}]},{"type":"tr","content":[{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Validate pre-deployment","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"Reliability checks only","type":"text"}]}]},{"type":"td","attrs":{"colspan":1,"rowspan":1,"colwidth":null,"alignment":""},"content":[{"type":"paragraph","content":[{"text":"azure-validate","type":"text","marks":[{"type":"code_inline"}]},{"text":" for full validation","type":"text"}]}]}]}]},{"type":"hr","attrs":{"markup":"---"}}]},"metadata":{"date":"2026-06-05","name":"azure-reliability","author":"@skillopedia","source":{"stars":1127,"repo_name":"azure-skills","origin_url":"https://github.com/microsoft/azure-skills/blob/HEAD/skills/azure-reliability/SKILL.md","repo_owner":"microsoft","body_sha256":"263db1f9ec216e5c17570df669d6cdd99047dd9a62a4441c1ab43adfedbe7d58","cluster_key":"f4a68929932cb99682aac4d393776f8c18a3708ecc148e736cfdb30da0ff9ec3","clean_bundle":{"format":"clean-skill-bundle-v1","source":"microsoft/azure-skills/skills/azure-reliability/SKILL.md","attachments":[{"id":"f453642f-aefe-5d46-af75-6db44507431d","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/f453642f-aefe-5d46-af75-6db44507431d/attachment.md","path":"references/configure-health-probes.md","size":2190,"sha256":"76cbd211b7787625ea07502972c2214fe80e12262997880017bcc13bf2b13f62","contentType":"text/markdown; charset=utf-8"},{"id":"b6ce1412-8543-5b24-9519-b14b8a359e8a","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/b6ce1412-8543-5b24-9519-b14b8a359e8a/attachment.md","path":"references/configure-multi-region.md","size":18934,"sha256":"b68152b061e021402a5862ca7d3de2364bb9a30c00a586c67c68c4a0b16b5ddc","contentType":"text/markdown; charset=utf-8"},{"id":"c1440aef-f8b3-54e6-94ab-b05c525b5a04","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/c1440aef-f8b3-54e6-94ab-b05c525b5a04/attachment.md","path":"references/configure-storage.md","size":5481,"sha256":"e2e4c1ec09593e2e7bc1bfdab69404a361424ef1507348e1ead0efb88d0fad60","contentType":"text/markdown; charset=utf-8"},{"id":"dcc18e89-0bcc-5a5e-b3ef-51d9069ac9ef","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/dcc18e89-0bcc-5a5e-b3ef-51d9069ac9ef/attachment.md","path":"references/configure-zone-redundancy.md","size":1955,"sha256":"e07930ab95e876dce61e013fa55abd62ec7e292056ac62e6cda6e68910f54f4f","contentType":"text/markdown; charset=utf-8"},{"id":"403697d8-85a2-59b4-be9b-e50b0ae7114c","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/403697d8-85a2-59b4-be9b-e50b0ae7114c/attachment.md","path":"references/health-probe-checks.md","size":3544,"sha256":"7a84e9d5120e51b5c10d74d21421b5224dcbea95bfd7ae38bd0966deff971b61","contentType":"text/markdown; charset=utf-8"},{"id":"459c6792-5295-5c97-9a06-727d2cb78092","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/459c6792-5295-5c97-9a06-727d2cb78092/attachment.md","path":"references/iac-patching-bicep.md","size":7031,"sha256":"96d86b9dc0f0bbf4a9b012e7f97bf36dbd6358b942f96c6cb9f217f0d3447f49","contentType":"text/markdown; charset=utf-8"},{"id":"47d588bf-f634-5f8b-b62b-5a2c7b7b337a","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/47d588bf-f634-5f8b-b62b-5a2c7b7b337a/attachment.md","path":"references/iac-patching-terraform.md","size":4815,"sha256":"e9ad7a612fe483d9bcd0298907487b5a8756bc6e7faeee0cca1ed58d45cbfe72","contentType":"text/markdown; charset=utf-8"},{"id":"df63f8cf-5dba-5cd7-98b1-71e2c5a73633","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/df63f8cf-5dba-5cd7-98b1-71e2c5a73633/attachment.md","path":"references/multi-region-checks.md","size":6120,"sha256":"b5f3700d0e40e464675d4f482a06a8046986aa61018d21d28af7eafbc9e3988d","contentType":"text/markdown; charset=utf-8"},{"id":"88684d6f-cb8e-50e7-8d64-a8a438ceffee","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/88684d6f-cb8e-50e7-8d64-a8a438ceffee/attachment.md","path":"references/services/app-service/reliability.md","size":10484,"sha256":"91f1ecd1c07826ed684a5361bcae60bca5fde4b818c4dae04ef965b8aee37077","contentType":"text/markdown; charset=utf-8"},{"id":"f19ad9fe-3b92-573e-9bfd-56a702bf7da7","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/f19ad9fe-3b92-573e-9bfd-56a702bf7da7/attachment.md","path":"references/services/functions/reliability.md","size":6389,"sha256":"95527035e16b597bc0e78e85b74f7e85de1a253fad76066b1d2fdbfc7cfbb5f8","contentType":"text/markdown; charset=utf-8"},{"id":"81c9720b-8638-53ac-9e7d-e5f1e59098bb","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/81c9720b-8638-53ac-9e7d-e5f1e59098bb/attachment.md","path":"references/storage-redundancy-checks.md","size":5206,"sha256":"2ef140db5d9e29958aa572c8bb90556d12ffc2da96881c2130c36c420bac5536","contentType":"text/markdown; charset=utf-8"},{"id":"3d95d189-ec38-5a95-86e0-ca6f6d276e31","key":"uploads/10433ee7-ad12-4ae0-b34e-97553e46c6c8/3d95d189-ec38-5a95-86e0-ca6f6d276e31/attachment.md","path":"references/zone-redundancy-checks.md","size":2930,"sha256":"6a328177624859e845e87cec80f3fb59e434b3f077f7232669d5c795cd4310c7","contentType":"text/markdown; charset=utf-8"}],"bundle_sha256":"2b79266c05bf4ba19d72ee1c95fde3ea91e974a5818205ff3e05e8e4b34792ba","attachment_count":12,"text_attachments":12,"attachment_storage":"skillopedia-attachments-v1","binary_attachments":0,"excluded_attachments":[]},"cluster_size":2,"skill_md_path":"skills/azure-reliability/SKILL.md","import_metadata":{"date":"2026-06-05","author":"@skillopedia","version":"v1","category":"devops-infrastructure","category_label":"DevOps"},"exact_dupes_collapsed_into_this":1},"license":"MIT","version":"v1","category":"devops-infrastructure","metadata":{"author":"Microsoft","version":"1.0.2"},"import_tag":"clean-skills-v1","description":"Assess and improve the reliability posture of PaaS Applications (Azure Functions and Azure App Service). Scans deployed resources for zone redundancy, ZRS storage, health probes, and multi-region failover. Presents a feature-pivoted checklist, then drives staged remediation (CLI or IaC patches) end-to-end with user confirmation. WHEN: \"assess reliability\", \"check reliability\", \"zone redundant\", \"multi-region failover\", \"high availability\", \"disaster recovery\", \"single points of failure\", \"reliability posture\", \"resiliency\"."}},"renderedAt":1782981894067}

Azure Reliability Assessment & Configuration Quick Reference | Property | Details | |---|---| | Best for | Reliability posture assessment, zone redundancy enablement, multi-region failover setup | | Primary capabilities | Reliability assessment table, Zone Redundancy Configuration, Multi-Region IaC Generation | | Supported services | Azure Functions, App Service (Container Apps planned for a future version) | | MCP tools | Azure Resource Graph queries, Azure CLI commands | When to Use This Skill Activate this skill when user wants to: - "Assess my Function app's reliability" - "Assess my Web…