Update06/10/2025

Generally Available: Workspace Gateway Autoscaling in Azure API Management

Azure
Microsoft

Update from:
patrick-devicebase (expert)

Generally Available: Workspace Gateway Autoscaling in Azure API Management
Announcing the general availability of autoscaling in Azure API Management workspace gateways.
With the introduction of autoscaling in workspace gateways, customers can now increase the reliability of their deployments. Autoscaling dynamically adjusts the scale of a workspace gateway based on real-time demand, ensuring seamless performance during traffic spikes while maintaining efficient usage during quieter periods. Autoscaling can be configured using rules—for example, based on gateway metrics, such as CPU and memory utilization, or a schedule.
This capability not only enhances operational reliability but also optimizes platform costs by scaling resources precisely to the needs of your API environment.

Generally Available: Federated Logging in Azure API Management
Announcing the General Availability of Federated Logging in Azure API Management!

With federated logging, platform teams gain a centralized view of logs across the entire API Management instance, while API teams get scoped access to logs for APIs within their own workspaces enabled through the integration of Azure API Management workspaces and Azure Log Analytics.

What this means for you:

Centralized observability for platform teams across the full API estate
Scoped, role-based access for API teams to view only relevant logs
Streamlined troubleshooting, monitoring, and performance tracking
Improved planning and decision-making with greater insights into API trends

Generally Available: Workspace Gateway Metrics in Azure API Management
Announcing the General Availability of Workspace Gateway Metrics in Azure API Management!
Two new metrics—CPU Utilization (%) and Memory Utilization (%)—are now available for API Management workspace gateways, offering deeper visibility into resource usage and helping teams make informed decisions around scaling and cost management.
New metrics now available:

CPU Utilization (%) – Tracks CPU usage across workspace gateway units
Memory Utilization (%) – Tracks memory consumption across workspace gateway units

These metrics enable:

Proactive scaling to meet performance demands
Optimized resource allocation for greater cost efficiency
Better operational insights to support platform reliability and growth

Generally Available: LLM Logging in AI Gateway Capabilities in Azure API Management
Announcing the general availability of LLM logging in Azure API Management’s AI Gateway. This capability allows you to capture and store LLM prompts, completions, and token usage metrics, enabling deeper insights into AI usage and the performance of intelligent applications.

By enabling this feature through diagnostic settings, you can send logs to Azure Monitor, Azure Storage, or Azure Event Hubs for long-term retention, real-time processing, and advanced analytics.

Key benefits:

End-to-end visibility into LLM usage across your applications
Support for streaming completions without disrupting user experience
Enterprise-grade observability aligned with your compliance and monitoring requirements

Generally Available: Session Aware Load Balancing in Azure API Management
Announcing the general availability of session-aware load balancing in Azure API Management. With this feature, requests from the same user session are consistently routed to the same backend, enabling support for advanced stateful scenarios such as the OpenAI Assistant API and Batch API.

Key benefits:

Enables stateful session management across backend services
Supports AI scenarios requiring session continuity, including OpenAI Assistant API
Fully managed within Azure API Management with no custom routing logic required

Generally Available: Change streams for Azure Cosmos DB for MongoDB (vCore)
Now you can easily capture real-time data changes using change streams in Azure Cosmos DB for MongoDB (vCore). This new capability eliminates polling overhead and enables real-time analytics, automation, and synchronization across distributed systems. And with native support for the PyMongo driver, Kafka, and Azure Functions, integrating change streams into your workflows is simple and scalable.

Generally Available: Versionless customer managed keys in Azure Database for PostgreSQL flexible server

You no longer need to manage the key version for customer-managed keys (CMKs) in Azure Database for PostgreSQL flexible server. Versionless keys automate the propagation of newly rotated CMKs independent of the rotation method (auto-rotation or manual).

Generally Available: Cross-cloud data governance with Unity Catalog on Azure Databricks
Unity Catalog access to Amazon S3 data on Azure Databricks is now generally available. This release gives you seamless, multicloud data governance by unifying permissions and access controls across S3 and Azure Data Lake Storage.

With cross-cloud data governance on Unity Catalog, you benefit from:

Unified governance. Manage access policies, security controls, and compliance standards from one place without juggling siloed systems.
Frictionless data access. Securely discover, query, and analyze data across clouds in a single workspace, eliminating silos and reducing complexity.
Stronger security and compliance. Gain centralized visibility, tagging, lineage, data classification, and auditing across all your cloud storage.