What does Multi-Jurisdiction Redundancy Pack include?
Eight components engineered together for resilient cross-region email operations. First: two or three datacenter regions selected based on your customer footprint, regulatory posture, and threat model (typical pairings are EU primary plus EU candidate, or EU primary plus non-EU secondary). Second: independent IP pools per region with separate reputation maintenance, so a deliverability problem in one region does not propagate to the other. Third: DNS health-check failover with default 90-second detection window (tightenable to 30 seconds for critical operations). Fourth: cross-region SMTP suppression list synchronisation every 60 seconds, so unsubscribes and hard bounces captured in one region are honoured in all regions within one minute. Fifth: bounce and complaint event aggregation across regions into a unified view. Sixth: quarterly disaster recovery drill with documented RTO and RPO achieved against the targets defined for your operation. Seventh: daily replication lag monitoring with alert thresholds tied to your RPO target. Eighth: incident response playbook covering six standard failure modes with rollback procedures. Setup EUR 2,999 once, EUR 599 monthly recurring.
Why does email infrastructure need cross-region redundancy?
Real-world data from 2025-2026. On October 20, 2025, an AWS DNS resolution failure in DynamoDB regional endpoints triggered a worldwide business outage affecting major customer-facing services across multiple industries. In March 2026, AWS UAE went partially offline after physical incidents at the datacenter. The pattern is consistent: regional outages happen, they affect entire regions simultaneously, and operations without tested failover stay down until the upstream provider restores service. For email operations specifically, downtime translates directly into business impact. Hostperl reported in 2026 that database failures cost on average EUR 5,600 per minute across industries; e-commerce sites during peak hours reach EUR 50,000 per minute. Email is the backbone for transactional notifications (order confirmations, password resets, security alerts, billing), marketing campaign delivery (revenue events with hard deadlines), and customer support (ticket responses, account updates). An email infrastructure outage during a product launch, regulatory notification window, or peak commercial period creates revenue loss and compliance exposure that vastly exceeds the cost of a redundancy bundle.
What is the difference between active-passive and active-active configurations?
Two architectural patterns with different cost and recovery characteristics. Active-passive runs the primary region serving all production traffic and a secondary region maintained as a warm standby (infrastructure ready, data replicated, but not serving traffic). On failure detection, DNS health checks fail over to the secondary region within 90 seconds (or 30 seconds with tightened settings). RTO measured in minutes, RPO measured in seconds to minutes depending on replication mode. Cost is lower because secondary capacity does not need to match primary capacity at all times. Active-active runs both regions serving production traffic simultaneously with traffic split by latency or weight. On failure, the surviving region absorbs all traffic with RTO measured in seconds because no infrastructure needs to start up; only traffic routing adjusts. Cost is approximately double because both regions run full production capacity. We recommend active-passive for most email operations because the failover RTO is well within acceptable bounds for email workloads and the cost differential is material. Active-active is appropriate for transactional email handling payment confirmations, security-critical notifications, or compliance-driven communications where seconds of downtime have measurable business cost.
What replication mode does the bundle use?
Default is asynchronous replication with sub-minute lag targets. Synchronous replication (RPO zero, zero data loss) is available for operations requiring guaranteed-zero data loss but carries a write latency penalty of 30-100ms depending on inter-region distance, which can affect throughput for high-volume SMTP submission. Asynchronous replication (RPO measured in seconds, low write latency) is the standard configuration for email workloads where a few seconds of replication lag is acceptable in exchange for full local write throughput. Eventual consistency (RPO in minutes, lowest write latency) is available for non-critical aggregation pipelines like deliverability analytics. We document which replication mode applies to which data class in your specific deployment: suppression lists use sub-minute async with 60-second sync interval; bounce events use eventual consistency with 5-minute aggregation; subscriber list changes use sub-30-second async. The choice of mode per data class is documented in the deployment runbook and reviewed during the quarterly DR drill.
How are quarterly DR drills conducted?
Scheduled exercise with documented procedures and measured outcomes. The drill cycle runs every three months. Each drill targets one failure mode from the six covered in the playbook (primary datacenter outage, network partition between regions, IP blacklist event in primary region, DNS hijack scenario, regional regulatory event forcing emergency departure, BGP hijack). The drill begins with notification to the customer 5 business days before execution, runs during a low-traffic maintenance window agreed in advance, executes the failover procedure end-to-end with measured timing at each step, validates the secondary region handles real production traffic successfully, runs failback to primary after validation, and produces a written report covering RTO achieved (measured), RPO achieved (measured), any procedure deviations, any rollback rough edges identified, and corrective actions for the next drill. The report is delivered within 10 business days of drill completion. The first drill happens within 60 days of bundle onboarding to validate that the deployed architecture meets the documented targets before relying on it in a real incident.
What does Multi-AZ is not multi-region mean and why does it matter?
The most common misconception in disaster recovery planning. Multi-AZ (multiple availability zones within one region) protects against single-datacenter failures and is the default high-availability pattern on cloud platforms. It does not protect against region-wide outages because all availability zones in a region share underlying control-plane dependencies, regional DNS infrastructure, and sometimes physical infrastructure or upstream connectivity. The October 2025 AWS US-EAST-1 outage affected all availability zones in that region simultaneously because the failure was in regional DynamoDB DNS infrastructure. Operations that thought they had high availability through Multi-AZ discovered during the outage that they had high availability against datacenter failures but not against regional failures. Multi-jurisdiction redundancy means deploying production capability in genuinely separate regions with separate upstream providers, separate control planes, separate physical infrastructure, and separate regulatory jurisdictions. The bundle deploys to ASH datacenter regions that are operationally and physically independent: an outage in Romania does not affect Bulgaria, Panama, Hong Kong, or Singapore. That independence is what genuine cross-region redundancy delivers.
How does the suppression list sync work across regions?
Real-time concern with a real-time mechanism. When a recipient unsubscribes through any region, that suppression must propagate to all regions before any region attempts to send to that recipient again. The standard implementation uses a centralised suppression authority with sub-minute synchronisation: each region writes suppression events to a globally replicated store, each region polls the store every 60 seconds for new entries, and the SMTP send path consults the local suppression cache before every send. If the primary region experiences a 90-second outage during DNS failover, the secondary region picks up SMTP submission with a suppression cache that may be up to 60 seconds stale; the next sync cycle (within 60 seconds of failover completion) catches up. For operations requiring stricter guarantees (paid newsletter operators, regulated communications), we offer a synchronous suppression mode that confirms write to all regions before acknowledging an unsubscribe to the user; this trades a few hundred milliseconds of unsubscribe latency for strict cross-region consistency. The choice is documented per customer in the onboarding runbook.
What about IP reputation in a secondary region that rarely sends?
Independent and continuously maintained. The most common operational mistake in cross-region email DR is letting the secondary region IPs sit idle until failover, at which point they have no reputation and the failover triggers deliverability problems on top of the original outage. The bundle prevents this by maintaining secondary region IPs as an active sending pool throughout normal operations. Configurations vary: some customers split sending 70/30 between primary and secondary continuously, some send specific traffic categories (transactional, lower-volume) from the secondary, some run scheduled reputation maintenance sends. Whichever pattern fits, the secondary IPs maintain warm reputation with mainstream mailbox providers at all times. When failover triggers, the secondary IPs absorb the full traffic load with established reputation rather than starting from zero. We document the specific reputation maintenance pattern in your deployment runbook and review it during the quarterly DR drill.