Guest

Design Zone for WAN/MAN

Cisco Unified Communications Voice over Spoke-to-Spoke DMVPN Test Results and Recommendations

Table Of Contents

Cisco Unified Communications Voice over Spoke-to-Spoke DMVPN Test Results and Recommendations

Overview

Issues to Address with Voice Over Spoke-to-Spoke DMVPN

Limitations of this Document

Solution Description

Co-location of Voice Main Site and DMVPN Hub

Benefits and Drawbacks of the Solution

Influence of DMVPN on Latency, Jitter, and Packet Loss

Latency

Jitter

Risk of Packet Loss

Planning and Best Practices

Physical Bandwidth Sizing

Allocation of Available Bandwidth Among Traffic Classes

Management of Traffic Using the Low-Latency Queue

Location-based CAC

Call Admission Control for Internet Key Exchange

Assessment of WAN Performance

Mitigation of Path Switch Jitter Distortion

Selection of Spoke Router Hardware

Monitoring DMVPN Performance for Voice

Using IP SLA for Monitoring of DMVPN Performance

Realtime Monitoring Tool

Spoke Router CLI

Summary of Best Practices

Implementation and Configuration

Topology

Location-Based Call Admission Control

Automated Alternate Routing

Music On Hold

SRST and PSTN Gateway at Each Remote Site

WAN Requirements

Differentiated Services

Quality of Service

G.729 Codec

IP SLA Options Relevant to Voice Over DMVPN

UDP Jitter Operation

UDP Jitter With Codec and TOS

VoIP Operation

UDP Echo

Test Approach

Software and Hardware Environment

Voice Traffic Model

Firmware/Software

Test Coverage and Results

Coverage

Results

Performance Information

CPU Load versus Throughput

Tunnel Overhead for Idle Tunnels

Tunnel Overhead for Active Tunnels

PPS versus BPS

Tunnel Setup Load

Serial Interface Overhead

Path Switch Distortion

Typical IP Phone Statistics

Example Configuration Files

Hub Routers Configuration

Cisco 7206 Hub Router Configuration

7609 IPsec Concentrator Configuration

Spoke Router Configuration

MGCP Gateway and SRST Router Configuration

References


Cisco Unified Communications Voice over Spoke-to-Spoke DMVPN Test Results and Recommendations


This document describes the interoperability of the Cisco Dynamic Multipoint VPN (DMVPN) solution with voice over IP (VoIP), which is part of the Cisco Unified Communications solution. This document provides some specific results and recommendations resulting from testing conducted by the Cisco Global Government Systems Group (GGSG) Government Systems Engineering team. The testing was specifically intended to validate the Unified Communications solution in conjunction with a specific spoke-to-spoke DMVPN solution already deployed by a service provider on behalf of a U.S. government agency; however, many of the concepts discussed are applicable to any deployment of VoIP with spoke-to-spoke DMVPN.

The following detailed design guides for the solutions tested already exist:

Cisco Unified Communications SRND Based on Cisco Unified CallManager 4.x— http://www.cisco.com/en/US/docs/voice_ip_comm/cucm/srnd/4x/uc4_2.html

Dynamic Multipoint VPN (DMVPN) Design Guide— http://www.cisco.com/en/US/docs/solutions/Enterprise/WAN_and_MAN/DMVPDG.html

Voice and Video Enabled IPsec (V3PN) SRND— http://www.cisco.com/en/US/docs/solutions/Enterprise/WAN_and_MAN/V3PN_SRND/V3PN_SRND.html

This document extends the scope of the existing documents with a focus on deploying a Cisco Unified Communications solution together with a spoke-to-spoke DMVPN network. The purpose is to provide additional solution design guidance, including predicting performance of particular platforms as spoke routers and configuring the solution to ensure high voice quality.

The data in this document is the result of testing in the GGSG test lab with a sufficient scale of the DMVPN and Unified Communications solutions to mimic real-world behavior. More information on the test methodology and results is provided in References.

Overview

Cisco Unified Communications products are designed and tested with various types of WANs in mind. For example, Call Admission Control (CAC) takes into account the possibility of limited WAN bandwidth over which to transport media. Such features work in conjunction with QoS policies on WAN gateway routers to ensure that available media and call control signaling bandwidth is available.

The standard Unified Communications configurations apply to a deployment using the DMVPN network for media transport. This includes configuration of the Cisco Unified CallManager and Unity servers, gateways, IP phones, and LAN infrastructure.

To account for DMVPN network topology attributes and behavior, some additional requirements and best practices apply. These are discussed throughout this document.

Issues to Address with Voice Over Spoke-to-Spoke DMVPN

There are the following main concerns about the interoperability of Cisco Unified Communications with spoke-to-spoke DMVPN networks:

Caveats exist related to voice quality because of differences in latency between the spoke-hub-spoke path and the spoke-to-spoke path when the spoke-to-spoke tunnel is initially established.

The point-to-multipoint nature of the tunnel interfaces on the spoke routers creates the possibility that a spoke router access link (or associated low-latency queue) could be overwhelmed with traffic from multiple sources.

See References for more information on these issues.

These issues are addressed by using the following voice solution attributes:

Call admission control (location-based CAC)

Quality of service (QoS)

Automated alternate routing (AAR) for calls encountering bandwidth/CAC limits

Survivable Remote Site Telephony (SRST) and Public Switched Telephone Network (PSTN) gateway at each remote site

Adequate WAN performance and capacity, latency, and bandwidth

This remainder of this document addresses these issues and provides additional best practices information.

Limitations of this Document

Significant variations are possible when designing a DMVPN network and the underlying WAN. The test approach, results, measurements, and conclusions in this document are most applicable to the DMVPN network used by the target government customers with a single-cluster Cisco Unified Communications topology. Although the strategies and best practices discussed are generally applicable, the detailed characteristics of other DMVPN deployments need to be considered. Cisco did not examine other variations in topology and scale for this document.

Solution Description

This document is focused on a government network deploying voice over IP using an existing DMVPN network for media and signaling transport between the main office and branch offices. This deployment uses the following solutions in combination:

Cisco Unified Communications (based on the Cisco Unified CallManager version 4.x)

Voice and Video Enabled IPsec VPN (V3PN)

DMVPN

The existing DMVPN network of the customer has a dual-hub, dual DMVPN cloud topology with a dual-tier headend architecture; and approximately 2000 spoke routers connected via the MPLS network of the WAN service provider. The same service provider operates the DMVPN network on behalf of the government customers. This includes hub complex routers at centralized data centers and spoke routers at each customer site. The second DMVPN cloud uses a different service provider network.

The DMVPN network is shared among multiple agencies in a U.S. government department. The DMVPN solution is configured to provide spoke-to-spoke tunnels between any two spoke routers. The DMVPN network has a single level of spokes subtending the hub level.

V3PN QoS recommendations were followed when deploying the DMVPN components, including Low-Latency Queuing (LLQ) and Class-Based Weighted Fair Queuing (CBWFQ) at the interfaces between the spoke routers and the service provider network. The service provider network maintains the Differentiated Services Code Point (DSCP) information in each packet as the packet traverses the WAN. Enhanced Interior Gateway Protocol (EIGRP) distributes routes to each spoke router on the DMVPN cloud, and Next Hop Routing Protocol (NHRP) resolves the DMVPN subnet (tunnel) address to the non-broadcast multiple access (NBMA) IP address of the destination spoke router. Figure 1 shows the general DMVPN topology.

Hub-and-spoke DMVPN topologies were not considered because the target customers have access only to a spoke-to-spoke DMVPN network.

Figure 1 DMVPN Topology

The Cisco Unified Communications solution to be deployed uses the "multisite WAN with centralized call control" topology described in the Cisco Unified Communications SRND based on Cisco Unified CallManager 4.x. The scalability needs of the target customers range from a single main site with 400 users and 3 remote sites with 200 users each (total 1000 users) to a main site with 5000 users and up to 100 remote sites with up to 250 users each (total 30,000 users). The testing did not include clustering over the WAN or multiple clusters with intercluster trunks because these topologies are not required for the deployment scale. Specifically, clustering over the WAN is an option for branch office sites requiring more lines than supported by SRST and require multiple-cluster topologies when the number of lines is more than the capacity of a single cluster or the customer has more large branch office sites than supported by the clustering-over-the-WAN topology. Figure 2 shows the multisite WAN with centralized call control topology integrated with a DMVPN WAN.

Each agency deploys a separate Cisco Unified Communications solution using PSTN connectivity between agencies.

Figure 2 Multisite WAN with Centralized Call Control Topology

Each remote site is equipped with a PSTN gateway router with SRST capability. The new systems are replacing legacy PBX systems or upgrading standalone Cisco Unified CallManager clusters or multisite WAN with centralized call control that are currently using the PSTN for intersite traffic.

Notice that the main site connects to the DMVPN as a spoke, so communication between the main site and any remote site (or between remote sites) is via a spoke-to-spoke tunnel. The permanent tunnel to the hub from each spoke router is normally only briefly used for voice traffic while the spoke-to-spoke tunnel is first being established. The details of the DMVPN and WAN topologies are transparent to the voice solution except for their impact on latency, jitter, and packet loss.

Co-location of Voice Main Site and DMVPN Hub

This test effort did not examine the topology where the Cisco Unified CallManager cluster is located behind a DMVPN because the target customers have access only to the DMVPN via the spoke routers. This is a characteristic of this particular DMVPN network that does not apply to all DMVPN deployments. If the Cisco Unified CallManager cluster is placed behind a DMVPN hub, the voice requirements would be similar because dynamic spoke-to-spoke tunnels would still be generated between branch office sites. The main difference is the nature of the connections between the branch offices and the main site. These would be permanent static tunnels instead of dynamic tunnels kept active by call control keepalive traffic.

Benefits and Drawbacks of the Solution

Spoke-to-spoke DMVPN provides clear benefits for Cisco Unified Communications compared with a hub-and-spoke topology. Spoke-to-spoke tunnels can provide a reduction in end-to-end latency by reducing the number of WAN hops and decryption/encryption stages. In addition, DMVPN offers a simplified means of configuring the equivalent of a full mesh of point-to-point tunnels without the associated administrative and operational overhead. The use of spoke-to-spoke tunnels also reduces traffic on the hub, permitting bandwidth and processing capacity savings.

Spoke-to-spoke DMVPN networks, however, have the following caveats associated with them:

Spoke-to-spoke tunnels are not as resilient to some forms of failure as spoke-to-hub tunnels because there is no routing protocol running through the tunnel.

A spoke-to-spoke tunnel may take a path through the underlying WAN that is more congested than the spoke-hub-spoke path tunnel.

Without careful planning, spoke routers may become overrun with incoming traffic from multiple remote spokes, resulting in degraded voice quality.

The transition of Real-Time Protocol (RTP) packet routing from the spoke-hub-spoke path to the spoke-to-spoke path can create a momentary audio distortion under certain circumstances.

Influence of DMVPN on Latency, Jitter, and Packet Loss

As with any WAN-based transport for voice, the voice quality of intersite calls over a DMVPN network is impacted by latency, jitter, and packet loss. The following sections describe in more detail how the DMVPN solution influences these factors.

Latency

The following factors can contribute to latency on the DMVPN network:

Encryption/decryption delay

Serialization delay

Routing/switching/queuing delay

WAN propagation time

Delays because of DMVPN spoke and hub congestion

A WAN connection between sites has some latency associated with normal operation. DMVPN induces an additional small amount of latency from encrypting and decrypting packets. This was measured as an average of 2 ms on the spoke-to-spoke path (a single encryption and decryption). Other sources of WAN latency are not unique to DMVPN. Latency is higher via the spoke-hub-spoke path because of additional hops, the possibility of additional propagation time, additional decryption/encryption, and the processing overhead on the hub router(s).

Jitter

The amount of jitter (delay variation between packets) is influenced by the following:

Rerouting packets along a different path

Congestion/queuing

Other network conditions

One factor that introduces a transient jitter into a VoIP stream is a change in the path of the RTP stream from spoke-hub-spoke to direct spoke-to-spoke. If two spokes have an established spoke-to-spoke tunnel before initiation of the RTP stream, there is no cut-through issue. The spoke-to-spoke tunnel may have been initiated by a previous VoIP call or some data traffic between the two spokes.


Note IP phones on a spoke regularly exchange keepalive messages with the Cisco Unified CallManager servers on a different spoke. These keepalives keep the tunnel between the spokes established. Any other data traffic (for example, IP SLA probes, network management, and so on) between the two sites can also keep the tunnel established.


Spoke-to-spoke tunnels are dynamically created as the need for them arises. This process takes a relatively small amount of time (less than one second), but the sequence of events has the potential of momentarily impacting voice quality. While establishing the spoke-to-spoke tunnel, the network routes the packets via the spoke-hub-spoke path. This requires a packet to traverse the WAN twice (once from the originating spoke to the hub and another from the hub to the terminating spoke) and to be decrypted/encrypted at the hub, which results in additional latency compared with the spoke-to-spoke path.

Immediately after the spoke-to-spoke tunnel becomes available, new voice packets are transmitted along this new path. This scenario can cause out-of-sequence packets and momentary high inter-arrival jitter at the terminating spoke router as packets arrive from both paths. The jitter buffer algorithm on the terminating VoIP device attempts to account for the jitter but may sometimes be unable to conceal the discontinuity without some audible distortion. A user might describe the sounds as a glitch, gap, or splice, or the sound might not be noticed at all.


Note This effect does not apply to calls traversing an established tunnel between the two spokes.


There are jitter two scenarios caused by the path switch to the spoke-to-spoke tunnel. The usual case (see Figure 3) is when the spoke-hub-spoke path has more latency than the spoke-to-spoke path. When traffic from the spoke-to-spoke tunnel begins arriving at the destination endpoint, it overlaps with an earlier portion of the media stream that traversed the hub path. The endpoint sees the out-of-sequence packets and reorders them before playing them out; or, if the jitter buffer is too full to accommodate the new packets, discards the older packets and retains the newer packets for playout. If the latency difference is significant enough (greater than 100ms), a user may notice a portion of a syllable missing (a "splice").

Figure 3 Jitter and Out-Of-Sequence Packets Because of Path Switch—Spoke-To-Spoke Path Faster

Figure 4 shows the other case. If the spoke-to-spoke path has more latency than the spoke-hub-spoke path, the terminating endpoint detects an unexpected increase in delay (jitter) between the last packet traversing the spoke-hub-spoke path and the first packet traversing the spoke-to-spoke path. If the latency difference is significant enough, a user may notice a period of silence or white noise within the speech.

Figure 4 Jitter Because of Path Switch—Hub Path Faster

Measurements taken for this test effort show that all tested Cisco endpoints were able to avoid or conceal an audio distortion provided that the difference in latency between the spoke-hub-spoke path and the direct spoke-to-spoke is 50 ms or less. Higher latency differences up to 100 ms provided acceptable voice quality for the overlapping stream case (hub path slower) because the momentary audio distortion was almost imperceptible. When there is more latency on the spoke-to-spoke path, the tolerance is lower. Most Cisco endpoints concealed the effects of jitter up to 25 ms, but at 50 ms and higher, some distortion begins to be audible.

Although this behavior impacts voice quality, it has proven acceptable to the user community. Further details and mitigation approaches are provided in Mitigation of Path Switch Jitter Distortion.

Risk of Packet Loss

Packet loss can occur for a variety of reasons, but most often oversubscription of allocated bandwidth is the cause. This can be because of overloading physical interfaces or exceeding bandwidth allocated to QoS queues.

Because spoke-to-spoke tunnels are established dynamically, there is no QoS policy per spoke-to-spoke tunnel that queues, shapes, or polices traffic to manage bandwidth associated with that specific tunnel. Instead, per-interface policies are used. These policies maintain control over the traffic traversing the entire interface, which may include multiple spoke-to-spoke tunnels. There is the potential for multiple spokes to direct more voice traffic to a particular spoke than the interface or low-latency queue of the spoke has bandwidth to receive. Likewise, traffic sources within the local network may attempt to direct more voice traffic out through the DMVPN spoke router than there is available bandwidth. This can lead to poor voice quality.

The components of the Cisco Unified Communications solution are not aware of the specifics of the DMVPN network. CAC can deny a call over the WAN if all bandwidth, at either the originating or terminating spoke, has been allocated to other calls. CAC protects existing calls from the risk of an additional call overflowing the low-latency queue resulting in packet loss across all calls on that interface. Although a Cisco Unified CallManager cluster can be configured to keep track of the WAN bandwidth used between regions (with one region per spoke site, in general), it is otherwise considered "topology unaware". Thus, there is a risk that changes in network topology or bandwidth availability could impact voice quality. Per-location bandwidth values must take LLQ bandwidth allocations and encryption and data link layer protocol overhead into account.

Figure 5 shows a simplified example of voice quality degradation in the absence of CAC. The low-latency queue for traffic leaving the WAN toward Spoke 4 is configured to support two calls. Each originating spoke router also has the low-latency queue bandwidth for two calls in the outgoing direction to the WAN. Assuming congestion conditions on all interfaces, each originating spoke router is easily able to route the media stream for a single call with no packet loss, but when all three streams converge at Spoke 4, one-third of the voice packets are dropped because of the policing function of the low-latency queue.

Figure 5 Degradation Of Voice Traffic Without Call Admission Control

If location-based CAC is enabled (see Figure 6), a third call is denied access to the WAN so that the capacity of the low-latency queue to and from spoke 4 is not exceeded.

Figure 6 Operation Of Call Admission Control

Planning and Best Practices

When you plan to deploy Cisco Unified Communications capability using a DMVPN network for signaling and media transport between sites, consider the following factors:

Sizing of physical bandwidth from the spoke router to the WAN

Allocation of available bandwidth for expected classes of traffic on the spoke router

Management of traffic that uses the low-latency queue

Assessment of WAN performance

Selection of spoke router hardware

Monitoring DMVPN performance to ensure voice quality

Additional WAN planning strategies are provided in Cisco Unified Communications SRND Based on Cisco Unified CallManager 4.x.

Physical Bandwidth Sizing

The amount of bandwidth between the spoke router and the WAN is based on standard traffic engineering methodology. Among the factors influencing this decision are the following:

Average and peak traffic rates

Protocols and applications using the bandwidth

Expected amount of voice traffic

Availability of alternate paths

Traffic growth rate

Encryption and tunneling overhead

Layer 2 encapsulation overhead


Note The voice topology you select may lead to additional bandwidth utilization for music on hold, recorded announcements, auto attendant functionality concentrated at the main site for PSTN calls arriving at a branch site, and voice mail access.


When you calculate required bandwidth, consider that IPsec encryption and Multipoint Generic Routing Encapsulation (mGRE) tunneling for DMVPN adds significant overhead compared with the small payload size of a G.729 RTP packet. The Layer 3 data rate for a G.729 call (50 pps) is 24 kbps in each direction. Encrypting the media stream using IPsec Tunnel Mode for mGRE increases that rate to approximately 56 kbps. Added to this is the Layer 2 overhead, which varies by interface type from 4 bytes to 14 bytes per packet. See Voice and Video Enabled IPsec VPN (V3PN) SRND for more details. Table 1 lists the estimated bandwidth per call for the interfaces on the target DMVPN spoke routers for both G.729 and G.711. Use of the G.729 codec is generally recommended for bandwidth conservation on WAN links.


Note Additional overhead may be required if voice signaling and media are encrypted by the endpoints.


Table 1 Per-Call Bandwidth Including IPsec, GRE, and Layer 2 Encapsulation 

Interface type
Per-call bandwidth (kbps) with IPsec, GRE, and Layer 2 Overhead
G.711
G.729

Gigabit Ethernet

127.2

71.2

OC3 ATM AAL5

237.6

118

2xT1 Multilink PPP

114.4

58.4

Frame Relay

112.4

56.4


In the case of the target government customers, the uplinks to the service provider WANs are already established and administered by the service provider, but deployment of additional voice traffic may require increases in bandwidth on those uplinks after considering call traffic patterns. A standard method of estimating intersite call traffic is to study current traffic patterns on the system being replaced and to add additional volume based on interviewing the customer regarding call rates, durations, peak traffic times, growth plans, and so on. See Cisco Unified Communications SRND Based on Cisco Unified CallManager 4.x and Voice and Video Enabled IPsec VPN (V3PN) SRND for more information.

Allocation of Available Bandwidth Among Traffic Classes

Each spoke site has different bandwidth requirements and differences in expected amounts of traffic for each class. In this case for the targeted government customers, the primary WAN service provider offers a range of pre-defined QoS policies that can be selected for deployment on the spoke router. Table 2 lists the bandwidth allocation by traffic class offered by the service provider. The example sites modeled for this test used profile 110, which provides 40 percent of interface bandwidth for CoS1 (DSCP 46; per-hop behavior EF) and a percentage of the remaining bandwidth for other classes of traffic (Figure 7 and Figure 8).

To make the best use of location-based call admission control (CAC), the bandwidth allocated to voice needs to be allocated symmetrically at each end of the serial link. This means that if 40 percent of the interface bandwidth is allocated to voice on the spoke router uplink to the WAN, the router at the other end needs to have the same percentage allocated for voice traffic toward the spoke router. In addition, the spoke router link to a secondary DMVPN cloud also needs to have at least the same bandwidth allocated to voice as the link to the primary DMVPN cloud.

Table 2 Example COS Packages Offered by Service Provider 

CoS
Package
Classes Available
Profiles
Available
Profile
Number
Bandwidth Allocation
COS1, COS2, COS3, COS4

Multimedia
High

4

24

101

90

0

0

100

102

80

80

10

10

103

80

60

30

10

104

80

40

30

30

105

60

80

10

10

106

60

60

30

10

107

60

40

30

30

108

50

0

0

100

Multimedia
Low

4

16

109

40

80

10

10

110

40

60

30

10

111

40

40

30

30

112

20

80

10

10

113

20

60

30

10

114

20

40

30

30

115

10

80

10

10

116

10

60

30

10

117

10

40

30

30

Critical
Data

3

7

118

0

100

0

0

119

0

80

10

10

120

0

60

30

10

121

0

40

30

30

Business
Data

2

3

122

0

0

100

0

123

0

0

90

10

124

0

0

50

50


Figure 7 Classification of Application Traffic to DSCP Marking

Figure 8 Mapping of DSCP Per-Hop Behavior to Service Provider Classes

Management of Traffic Using the Low-Latency Queue

Packets marked as DSCP 46 (per-hop behavior EF) retain this marking after encryption by the DMVPN spoke router and are routed through a dedicated low-latency queue. The service provider network keeps the real-time priority across the WAN because of the marking. If adequate bandwidth is reserved for the low-latency queue, latency, jitter, and loss should be minimized for these packets. However, voice traffic can exceed reserved bandwidth and, under congestion conditions, voice quality can be impacted as packets drops increase.

Location-based CAC

For DMVPN, Cisco recommends configuring location-based CAC on the Cisco Unified CallManager. Location-based CAC tracks the bandwidth in use for media streams controlled by the Cisco Unified CallManager. When insufficient bandwidth remains to support a new media stream, calls are intercepted or rerouted. This helps prevent the low-latency queue from overflowing and causing voice quality problems because of packet loss. When a call ends, its bandwidth is made available to new calls.

Provided that all sources of DSCP 46 (per-hop behavior EF) traffic are accounted for by the value for maximum voice bandwidth for each Cisco Unified CallManager location, location-based CAC prevents voice quality issues in many situations. However, because the Cisco Unified CallManager does not have any information about the actual network topology, location-based CAC cannot account for temporary network conditions such as a partial loss of bandwidth along a certain path (for example, one link in a multilink interface), so its effectiveness has some limits. An alternate strategy using RSVP for CAC is not yet supported on DMVPN tunnel interfaces.

See the Cisco Unified Communications SRND Based on Cisco Unified CallManager 4.x ("Call Admission Control" chapter) for more information regarding the capabilities and limitations of topology-unaware CAC. See Location-Based Call Admission Control for an example of configuring location-based CAC for DMVPN.

Call Admission Control for Internet Key Exchange

Cisco IOS-based routers provide the capability of limiting the number of Internet Security Association and Key Management (ISAKMP) security associations (SA). This can be based on the number of existing ISAKMP SAs or on the percentage of critical system resources (CPU utilization and memory buffers). IKE CAC can prevent a spoke router from being overwhelmed if it is suddenly inundated with SA requests. This feature does not restrict voice bandwidth utilization directly but it can safeguard spoke router performance from a DMVPN perspective.

See the DMVPN Design Guide and Cisco IOS Security Configuration Guide, Release 12.4—Call Admission Control for IKE for more information.

Assessment of WAN Performance

According to recommendation G.114 of the International Telecommunication Union (ITU), if end-to-end latency is kept below 150 ms, most applications experience essentially transparent interactivity. In practice, satisfactory results are attainable end-to-end latency greater than 150 ms, but the user experience approaches dissatisfaction with latency greater than 300 ms.

To meet this recommendation for IP voice over DMVPN, Cisco recommends following the guidelines in the Voice and Video Enabled VPN SRND regarding WAN service provider performance. Specifically, WAN contribution to jitter should be less than or equal to 20 ms, one-way latency across the WAN should be less than or equal to 60 ms, and packet loss less should be less than or equal to 0.5 percent. WAN QoS, particularly low-latency queuing with guaranteed bandwidth for voice packets, is essential for voice over DMVPN deployments to meet these requirements.

Cisco recommends that you monitor the performance of your WAN to ensure that these guidelines are being met. The IP Service Level Agreements (IP SLA) command provides effective ways of measuring the performance of the WAN and the DMVPN for these recommendations. See G.729 Codec for more information.

Because of the dynamics of establishing a spoke-to-spoke tunnel and the effect on endpoint jitter buffers, Cisco recommends that the difference in latency between the spoke-hub-spoke path and the spoke-to-spoke path be minimized. See Mitigation of Path Switch Jitter Distortion for more information.

Mitigation of Path Switch Jitter Distortion

Even if the average one-way latency across a spoke-to-spoke DMVPN network is within the guidelines, it is possible that a brief period of higher latency will occur when the initial Real-Time Protocol (RTP) packets of a call traverse the spoke-hub-spoke path if that call triggers a new spoke-to-spoke tunnel. Test results show that momentary audio distortion because of jitter when the RTP packets switch from the spoke-hub-spoke path to the spoke-to-spoke path is avoided when the difference in latency between the paths is 50 ms or less. Latency differences between 50 ms and 100 ms produce acceptable audio quality, but if the latency difference is above 100 ms, a momentary audio distortion may be audible at the beginning of a call that triggers a new spoke-to-spoke tunnel. Latency differences higher than 100 ms are associated with increasingly noticeable momentary audio distortions at the time the spoke-to-spoke tunnel starts to be used.

In most cases, the latency difference between the spoke-hub-spoke path and the direct spoke-to-spoke path is under 100 ms. If the latency difference between paths on your network is more than 100 ms, consider one of the following approaches:

Seek WAN performance improvements through modifications of the WAN infrastructure; for example, increasing bandwidth at bottlenecks, validating QoS configurations, reducing the number of hops, and so on. Confirm WAN service provider conformance to contractual service level agreements.

Establish the spoke-hub-spoke path as the only path for packets to a particular remote spoke by creating a static NHRP association on the tunnel interface that directs traffic for that spoke to the hub.

Deploy a point-to-point tunnel interface or dedicated WAN link for traffic to another spoke router.

Establish light traffic between the involved spokes to keep the spoke-to-spoke tunnel active. The performance results in Performance Information confirm that there is only a small overhead associated with maintaining a spoke-to-spoke tunnel carrying little traffic. A simple IP SLA operation such as UDP echo repeated with an interval shorter than the NHRP hold time of the tunnel keeps the tunnel active.


Note On a small scale, techniques used to avoid transition to the spoke-to-spoke tunnel during a voice call are feasible, but those that require destination-specific configuration on the spoke routers diminish a key advantage of DMVPN in which any-to-any tunneling is supported without destination-specific configuration commands. Therefore, you should consider one of the other approaches if an excessive momentary jitter problem is being encountered with a large number of voice-supporting remote spokes.


Selection of Spoke Router Hardware

The spoke router needs to be adequately sized for the expected level of traffic. Considering the performance information in this document and in the Voice and Video Enabled IPsec VPN SRND and the Dynamic Multipoint VPN Design Guide documents, select a platform that supports your estimates of traffic through the DMVPN spoke router.

Table 3 lists the data and voice call capacity of three spoke router platforms used on the customer DMVPN network. For each of these platforms, two physical interfaces are shown to permit easier comparison between the platform performance of each router. The number of voice calls shown is based on 50 percent of the total bit rate divided by the bit rate per G.729 call with encryption, tunneling, and Layer 2 overhead.

Table 3 Comparison of Spoke Router Hardware Performance

Platform
Interface
Number of G.729 calls (50% voice, 50% Imix) single tunnel
Packet rate @ 60% CPU (kpps) single tunnel
Bit rate @ 60% CPU (Mbps), single tunnel

7206 NPE-G1 VAM2+

ATM OC-3

181

17.1

23.2

7206 NPE-G1 VAM2+

GigE

193

18.3

24.7

2821 EPII-PLUS

2xT1 PPP Multilink

38

4.8

4.9

2821 EPII-PLUS

GigE

41

5.1

5.2

1841 BPII-PLUS

Frame relay

11

1.4

1.4

1841 BPII-PLUS

FE

18

2.3

2.3


Monitoring DMVPN Performance for Voice

There are several useful techniques for monitoring DMVPN performance. Some require access to the spoke router console or network management interface, others can be used inside the local network. The following sections discuss these techniques.

Using IP SLA for Monitoring of DMVPN Performance

Useful statistics maintained on the DMVPN spoke routers help to assess the condition of the router and the DMVPN network. For example, QoS policy map statistics, DMVPN, IPsec and ISAKMP statistics, interface counters, and CPU and memory statistics. However, in the case of a DMVPN network operated by a service provider, access to the spoke router console may not be readily available.

IP SLA operations are useful for monitoring DMVPN performance without requiring access to the DMVPN spoke router consoles. You can figure them on a Cisco router that has access to the DMVPN network. Preferably, the source and destination routers should be as close as possible to the DMVPN spoke routers so that the results can more easily be attributed to DMVPN and underlying WAN network conditions.

The following information relevant to voice performance is available through IP SLA operations:

One-way and round-trip latency, jitter, lost and out-of-sequence packet measurements

Estimation of audio quality given measured network conditions

In addition, you may configure the measurements to automatically repeat on a regular basis and have SNMP traps associated with pre-determined thresholds.

For more information, see the "IP SLAs--Proactive Threshold Monitoring of IP SLAs Operations" section of the Cisco IOS IP SLAs Configuration Guide.

Realtime Monitoring Tool

The Cisco Unified CallManager Realtime Monitoring Tool (RTMT) provides visual tracking of many Cisco Unified CallManager performance and configuration metrics. It is helpful to use the RTMT when configuring and monitoring location-based CAC. Bandwidth in use and calls in progress per location can be displayed as shown in Figure 9.

Figure 9 Realtime Monitoring Tool

Spoke Router CLI

Table 4 lists commands that provide information relevant to voice traffic over a DMVPN network from the perspective of the spoke routers.

Table 4 Useful CLI Commands On The Spoke Router For Voice Over DMVPN

Command
Description

show dmvpn

Shows summary of DMVPN tunnels, including peers and uptime; aggregates output from related commands

show ip nhrp

Shows NHRP associations, timers, tunnel uptime

show crypto isakmp

Shows ISAKMP security associations

show policy-map interface

Shows QoS metrics including traffic levels for each queue, packet drops, queue size, and so on.

show interface

Shows interface status, traffic counts


Summary of Best Practices

This section summarizes the best practices for deploying Cisco Unified Communications using a DMVPN network for signaling and media transport between sites:

Adequately size the physical bandwidth between the spoke router and the WAN

Allocate adequate bandwidth for expected classes of traffic on the spoke router

Classify traffic with end-to-end Diffserv QoS markings

Dedicate sufficient bandwidth to low-latency queues along media transmission path

Configure location-based CAC to manage traffic using the low-latency queues

Assess and monitor DMVPN and WAN performance; minimize underlying WAN latency through selection of a Cisco Powered Network service provider

Deploy adequately powered spoke router platforms for the anticipated traffic load

Use available techniques to monitor DMVPN performance to ensure voice quality

Implementation and Configuration

This section provides configuration recommendations for deploying a Cisco Unified Communications voice solution with a DMVPN for intersite media and signaling transport. The recommendations are based on the assumption that an existing spoke-to-spoke DMVPN network is used. See the Dynamic Multipoint VPN Design Guide for more information on configuring DMVPN.

The configuration information in this section is based on the following assumptions:

The customer deploys and uses Skinny (SCCP)-based IP phones and MGCP-based gateways.

A single-cluster solution meets expected scaling requirements for these deployments.

The DMVPN spoke routers are maintained by the service providers and are already configured with QoS policies providing low-latency queuing for IP voice packets and Class-Based Weighted Fair Queuing (CBWFQ) for other types of traffic.

The LAN infrastructure within each spoke site follows Cisco recommendations and preserves the DSCP markings required to identify IP voice traffic.

Adequate bandwidth is provisioned to support the expected peaks in call traffic traversing the DMVPN network.

Topology

In the multisite WAN with centralized call processing model used for this set of customers, a single Cisco Unified CallManager cluster provides call processing for all locations on the IP telephony network. The Cisco Unified CallManager cluster usually resides at the main (central or headquarters) location, along with other devices such as phones and gateways. The remote locations contain additional devices, but no Cisco Unified CallManager servers. IP WAN links connect the remote locations to the main location.

Location-Based Call Admission Control

The Cisco Unified CallManager uses "locations" to implement CAC, which enables regulation of audio quality by limiting the amount of bandwidth that is available for calls over links between the locations. For more information, see the "Call Admission Control" section in the Cisco Unified CallManager System Guide.

If CAC is not used to limit the bandwidth on WAN links, an unlimited number of calls may be active on that link concurrently. This situation can cause the audio quality of all calls to degrade as the physical interface of the local or remote DMVPN uplink becomes oversubscribed.

The main location and each remote site has a unique name configured on the Cisco Unified CallManager. The low-latency queue on the spoke router drops packets that exceed the maximum bandwidth of its queue if all bandwidth allocated to other queues is in use (that is, congestion condition exists). Cisco recommends that you specify only the bandwidth dedicated to voice packets for each location.

The Cisco Unified CallManager automatically assumes a specific bandwidth value per call. Because of the overhead associated with mGRE, IPsec, and the Layer 2 protocol being used, a conversion is required when specifying the bandwidth for the location. To limit the number of calls with a higher encapsulation overhead than the Cisco Unified CallManager assumes, the available bandwidth needs to be adjusted downward to achieve the call limit desired. Table 5 lists the per-call bandwidth assumed by the Cisco Unified CallManager in the CAC calculation.

Table 5 Bandwidth Assumed By The Cisco Unified CallManager Location-Based CAC

Codec
Bandwidth

G.711

80 kbps

G.722

80 kbps

G.723

24 kbps

G.728

16 kbps

G.729

24 kbps

GSM

29 kbps

Wideband

272 kbps


There is no provision in the Cisco Unified CallManager GUI to perform this calculation, so it must be performed outside of the GUI, and the result of the calculation must be used in place of the actual voice bandwidth available. The same concept applies to video bandwidth.

Use the following steps to determine the bandwidth for each location:

1. Determine the available (dedicated) voice bandwidth this should be the bandwidth dedicated to the low-latency queue minus the bandwidth (with overhead) any other known sources of voice traffic using that queue

2. Divide this number by the voice bandwidth per call with encryption, tunneling and Layer 2 encapsulation (refer to the "Bandwidth Provisioning for WAN Edge QoS" section in the Voice and Video Enabled VPN SRND for information on calculating this value)

3. Multiply the result by the Cisco Unified CallManager default bandwidth (Table 6):


Table 6 and Table 7 list an example of the calculation in which the codec is G.729 and the Layer 2 protocol is Frame Relay. In this case, the low-latency queue on the spoke router is allocated 312 kbps. You need to determine this information for each physical interface type.

Table 6 Bandwidth Comparisons At Points In Network IPsec/GRE/FR

 
G.729 Payload
IP + UDP + RTP + payload
Ethernet + 802.1Q + IP + UDP + RTP + Payload
F/R + IPsec + GRE + IP + UDP + RTP + Payload

Bytes

20

60

80

140

Packet rate

50 pps

50 pps

50 pps

50 pps

Bandwidth

8 kbps

24 kbps

34 kbps

56 kbps


Table 7 shows some sample adjustments for location configuration because of encapsulation overhead (Frame Relay).

Table 7 Example Adjustments for Location Configuration

 
Cisco Unified CallManager Default for G.729
(from Table 5)
Actual Utilization with Encapsulation Overhead

Bandwidth per call leg

24 kbps

56 kbps

Calls per 312 kbps

312 / 24 = 13

312 / 56 = 5.57

Bandwidth to specify in location configuration for 312 kbps

312 K

5.57 * 24 kbps = 133 kbps


The value you need to input into the Cisco Unified CallManager Location Configuration screen in the Audio Bandwidth field is 133 kbps. If the actual bandwidth allocated to the low-latency queue, 312 kbps is entered without this adjustment, the Cisco Unified CallManager assumes 24 kbps per G.729 call leg, and allows 13 calls to use the 312 kbps bandwidth even though there is only capacity low-latency queue of the DMVPN uplink interface for 5 calls.

Figure 10 shows a location configuration example.

Figure 10 Location Configuration Example

The effectiveness of the location-based CAC feature requires strict control of traffic using the low-latency queue (in this case DSCP 46; per-hop behavior EF voice traffic) through the spoke router. If necessary, the bandwidth specified for a location can be further reduced to account for other traffic using the same code point or another Cisco Unified CallManager cluster directing calls to the DMVPN via the same tunnel interface.

After the locations have been configured, you need to assign to the endpoints (gateway, IP phone, and so on) that are located behind the WAN interface.

Automated Alternate Routing

Automated Alternate Routing (AAR) is a mechanism to provide an alternate route to remote sites when insufficient bandwidth is available to route the call over the primary route. In this deployment, Cisco recommends having the WAN as the primary route for media and a PSTN connection as the alternate route. There are no DMVPN-specific configuration steps for this feature. See the "Dial Plan/AAR" section of the Cisco Unified Communications SRND Based on Cisco Unified CallManager 4.x document for configuration details.

Music On Hold

Cisco recommends that you enable the SRST-based Music on Hold (MoH) feature to minimize bandwidth consumption on the DMVPN uplinks for sites where bandwidth is constrained. This feature requires setting up the MoH server in the main/headquarters site to multicast MoH for the IP phones that are local to that location; for example, on an address such as 239.192.240.1. The SRST router is then set to multicast an audio stream on the same address so that the IP phones near the SRST router in the remote site can access the MoH audio. In this case, multicast routing of the MoH stream in the main site needs to be blocked from traversing the WAN, either through a reduced time-to-live count or an access list. When a party at the remote site is placed on hold, the Cisco Unified CallManager instructs the IP phone at that site to listen to 239.192.240.1. It receives the music media stream from the local SRST router instead of from the main site music source across the WAN.

The following steps describe the configuration.


Step 1 Create the MoH server with the multicast option and limited hop count (see Figure 11) on the Cisco Unified CallManager.

Figure 11 Creation Of MoH Server

Step 2 Place the server in a media resource group (MRG). (See Figure 12.)

Figure 12 Insertion Of The MoH Server In The Media Resource Group

Step 3 Place the MRG in a media resource group list (See Figure 13.)

Figure 13 Insertion Of Media Resource Group Into Media Resource Group List

Step 4 Put the MoH server in its own region (with intersite codec selection set to G.711) via a dedicated device pool (See Figure 14.)

Figure 14 Creation Of A Dedicated Region For The MoH

Step 5 Enable MoH multicast with the same multicast destination address as on the Cisco Unified CallManager on the SRST router.

ccm-manager music-on-hold
call-manager-fallback
  moh music-on-hold.au
    multicast moh 239.192.240.1 port 16384 


If, as an alternative, you choose to have MoH media unicast across the DMVPN, you can reduce the bandwidth consumed by the music stream by using the G.729 codec and accepting a small reduction in audio fidelity compared with the default G.711 codec.

MoH servers automatically mark audio stream traffic the same as voice bearer traffic DSCP 46 (per-hop behavior EF). Therefore, as long as QoS is properly configured on the network, MoH streams receive the same classification and low-latency queuing treatment as phone-originated RTP media traffic.

For additional information and best practices for MoH, see the Cisco Unified Communications SRND based on Cisco Unified CallManager 4.x and the Integrating Cisco CallManager and Cisco SRST to Use Cisco SRST As a Multicast MOH Resource feature guide.

SRST and PSTN Gateway at Each Remote Site

Each remote site requires local PSTN access, both for PSTN-destined calls and as an alternate route for calls denied bandwidth because of location-based CAC. SRST and gateway configuration for DMVPN follows the standard procedures defined for the Multisite WAN with Centralized Call Control.

WAN Requirements

No configuration is required to verify the WAN requirements, but they are mentioned here because they may require measurement and coordination with the service provider as part of deployment. See Assessment of WAN Performance for best practices related to WAN performance.

Differentiated Services

To minimize latency and jitter, the service provider needs to provide a priority queue for voice packets outbound from the provider edge (PE routers) of the WAN. The bandwidth allocated to the priority queue should mirror the policy on the spoke router. No additional configuration for this requirement is needed because this is already the case for the DMVPN network of the target customer.

Quality of Service

In the case of the target government customers, the QoS configuration on the DMVPN spoke routers is administered by the primary WAN service provider and already supports dedicated bandwidth for voice in the form of a low-latency queue. The following example shows the QoS configuration for the tunnel interface from a spoke router to the DMVPN. The low-latency queue configuration allocates 312 kbps for DSCP 46 (per-hop behavior EF) packets (RTP voice packets) out of an overall T1 frame relay bandwidth of 768 kbps.

! Standard QoS config

class-map match-any PCLASS_COS2_SAA
 match  dscp 27 
class-map match-any PCLASS_COS3_SAA
 match  dscp 19 
class-map match-any PCLASS_COS1_SAA
 match  dscp 47 
class-map match-any PCLASS_COS4_SAA
 match  dscp 19 
class-map match-any PCLASS_COS4
 match not  dscp 1 
class-map match-any QCLASS_COS4
 match not  dscp 1 
class-map match-any PCLASS_COS2
 match  dscp af31 
class-map match-any QCLASS_COS3
 match  dscp 19 
 match  dscp af21 
class-map match-any PCLASS_COS3
 match  dscp af21 
class-map match-any QCLASS_COS2
 match  dscp 27 
 match  dscp af31 
class-map match-any QCLASS_COS1
 match  dscp 47 
 match  dscp ef 
class-map match-any PCLASS_COS1
 match  dscp ef 
class-map match-any PCLASS_NM
 match access-group 180
class-map match-any PCLASS_RP
 match  dscp cs6 
class-map match-any QCLASS_NM
 match  dscp cs6 


class-map match-any Prec_3
 match ip precedence 3 
class-map match-any Prec_2
 match ip precedence 2 
class-map match-any Prec_1
 match ip precedence 1 
class-map match-any Prec_0
 match ip precedence 0 
class-map match-any Prec_7
 match ip precedence 7 
class-map match-any Prec_6
 match ip precedence 6 
class-map match-any Prec_5
 match ip precedence 5 
class-map match-any Prec_4
 match ip precedence 4 
!         
!         
policy-map remark
 class Prec_0
  set ip dscp default
 class Prec_1
  set ip dscp af21
 class Prec_2
  set ip dscp af21
 class Prec_3
  set ip dscp af31
 class Prec_4
  set ip dscp af31
 class Prec_5
  set ip dscp ef
 class Prec_6
  set ip dscp af21
 class Prec_7
  set ip dscp af21


policy-map NM
 class PCLASS_RP
  police cir 8000 bc 8000 be 8000
    conform-action set-dscp-transmit cs6
    exceed-action set-dscp-transmit cs6
 class PCLASS_NM
  police cir 8000 bc 8000 be 8000
    conform-action set-dscp-transmit af21
    exceed-action set-dscp-transmit af21

policy-map COS4
 class PCLASS_COS4
  police cir 80000 bc 10000
    conform-action set-dscp-transmit default
    exceed-action set-dscp-transmit default
 class PCLASS_COS4_SAA
  set dscp default

policy-map COS3
 class PCLASS_COS3
  police cir 144000 bc 18000
    conform-action set-dscp-transmit af21
    exceed-action set-dscp-transmit af22
 class PCLASS_COS3_SAA
  set dscp af21

policy-map COS2
 class PCLASS_COS2
  police cir 280000 bc 35000
    conform-action set-dscp-transmit af31
    exceed-action set-dscp-transmit af32
 class PCLASS_COS2_SAA
  set dscp af31

policy-map COS1
 class PCLASS_COS1
  police cir 312000 bc 39000
    conform-action set-dscp-transmit ef
    exceed-action drop 
 class PCLASS_COS1_SAA
  set dscp ef

policy-map D_768K
 class QCLASS_NM
  bandwidth remaining percent 10
  random-detect dscp-based
  random-detect dscp 18   100   200   10   
  random-detect dscp 48   200   300   10   
  service-policy NM
 class QCLASS_COS1
  priority 312
   compress header ip rtp
  service-policy COS1
 class QCLASS_COS2
  bandwidth remaining percent 60
  random-detect dscp-based
  random-detect dscp 26   200   300   10   
  service-policy COS2
 class QCLASS_COS3
  bandwidth remaining percent 30
  random-detect dscp-based
  random-detect dscp 18   200   300   10   
  service-policy COS3

G.729 Codec

When deploying voice in a WAN environment, Cisco recommends the lower-bandwidth G.729 codec for any voice calls that traverse WAN links because this practice provides bandwidth savings compared to G.711 at the expense of a small reduction in audio fidelity.

To configure a codec for WAN calls, configure a "region" to represent points in the network that impact codec selection. To configure G.729 for intersite calls and G.711 for intrasite calls, a unique region can be created for each remote site, and a codec option can be selected both for calls within that region and for calls to other regions. See the Cisco Unified Communications SRND Based on Cisco Unified CallManager 4.x for more information.