Performance Benchmarks
MuleSoft Intelligent Document Processing (IDP) performance benchmarks provide a clear view of how IDP performs under real-world workloads. The benchmarks highlight document processing speed, system scaling behavior, and expected response times under normal and peak load conditions.
These insights help you plan capacity, establish SLAs, and build scalable and stable IDP implementations. The metrics serve as design guidance rather than hard limits for understanding expected performance envelopes and optimizing IDP deployments.
Understanding Benchmark Metrics
The benchmark data serves as planning and design guidance for IDP implementations. These results support:
-
Capacity planning: P95 latency values provide baselines for SLA planning
-
Workload design: Typical loads near "moderate" ranges deliver predictable performance
-
Load management: Peak throughput is supported, though brief queuing may occur
-
Expectation setting: Performance can vary by document size, complexity, or network conditions
Use P95 latency as the baseline for SLA and capacity planning while keeping typical loads within moderate ranges for optimal performance stability.
Test Configuration
The benchmarks were conducted using controlled test scenarios to measure IDP performance across multiple workload conditions.
| Component | Details |
|---|---|
Environment |
STGX |
Document Type |
4.5MB single-page document |
Pods |
12-15 (autoscaling enabled) |
Concurrency |
30 concurrent threads |
Testing Tool |
JMeter |
Test Duration |
15 minutes per scenario |
Latency Types
When submitting documents to IDP, two timing measurements are critical for understanding system performance.
| Latency Type | Description |
|---|---|
Initial Response Latency |
Time from submission (POST) to acknowledgment that processing started |
Complete Operation Latency |
End-to-end time from submission until processed data is available via GET or callback |
Performance Benchmarks
SLA Performance Guidelines
| Load Scenario | Requests per Minute | Average Latency | P95 Latency | Error Rate | Configuration |
|---|---|---|---|---|---|
Moderate Load |
~600 requests per minute |
Initial: 0.5s |
Initial: 0.5s |
0.49% |
5 threads for 15 minutes |
Peak Load |
2,000+ requests per minute |
Initial: 1.3s |
Initial: 1.32s |
0% |
30 threads for 15 minutes |
Performance Insights
The benchmark results provide key insights into IDP response times and system reliability characteristics under various load conditions.
Response Times
-
IDP acknowledges document submissions in under 1.5 seconds, even during heavy load
-
End-to-end document processing completes in 7-11 seconds on average
-
Response times remain consistent across different load scenarios
Scaling and Reliability
-
System auto-scales under pressure to sustain throughput up to ~2,000 requests per minute per tenant
-
No processing errors or performance degradation observed up to 30 concurrent threads
-
Workloads can be confidently planned within these ranges for stable, high-volume document processing
System Specifications
The following specifications define system limits and infrastructure capabilities for IDP document processing and platform requirements.
Considerations
The following considerations provide guidance for planning IDP implementations and understanding factors that influence performance in production environments.
Planning Recommendations
-
Use these benchmarks as design guidance rather than hard performance limits
-
Plan capacity using P95 latency values for SLA establishment
-
Maintain workloads within moderate ranges for predictable performance
-
Monitor queue depth and scaling behavior during production deployments



