Effects of Packet Loss and Jitter on VoLTE Call Quality
Elena Cipressi, Maria Luisa Merani

TL;DR
This study analyzes how packet loss and jitter affect VoLTE call quality, comparing AMR and AMR-WB codecs over millions of calls using an enhanced E-Model, revealing differences in robustness and quality dependence.
Contribution
It provides a comparative analysis of VoLTE call quality with two codecs, highlighting their robustness and the different relationships between packet loss and call quality.
Findings
AMR-WB calls are more robust against network impairments.
Call quality dependence on packet loss is exponential for AMR.
Call quality dependence on packet loss is nearly linear for AMR-WB.
Abstract
This work performs a preliminary, comparative analysis of the end-to-end quality guaranteed by Voice over LTE (VoLTE), examining several millions of VoLTE calls that employ two popular speech audio codecs, namely, Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate WideBand (AMR-WB). To assess call quality, VQmon, an enhanced version of the standardized E-Model, is utilized. The study reveals to what extent AMR-WB based calls are more robust against network impairments than their narrowband counterparts; it further shows that the dependence of call quality on the packet loss rate is approximately exponential when the AMR codec is used, whereas it is nearly linear for the AMR-WB codec.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Communication Networks Research · Advanced MIMO Systems Optimization · Advanced Wireless Network Optimization
Effects of Packet Loss and Jitter on
VoLTE Call Quality
Elena Cipressi1,2 and Maria Luisa Merani1
1Dipartimento di Ingegneria “Enzo Ferrari”, University of Modena and Reggio Emilia, Modena, Italy
2Empirix Inc., Modena, Italy
email: elena.cipressi, [email protected]
Abstract.
This work performs a preliminary, comparative analysis of the end-to-end quality guaranteed by Voice over LTE (VoLTE), examining several millions of VoLTE calls that employ two popular speech audio codecs, namely, Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate WideBand (AMR-WB). To assess call quality, VQmon®, an enhanced version of the standardized E-Model, is utilized. The study reveals to what extent AMR-WB based calls are more robust against network impairments than their narrowband counterparts; it further shows that the dependence of call quality on the packet loss rate is approximately exponential when the AMR codec is used, whereas it is nearly linear for the AMR-WB codec.
Call Detail Record; Network Measurements; Speech Codecs; Voice Quality; VoLTE
††journalyear: 2018††conference: CoNEXT ’18; December 4–7, 2018; Heraklion/Crete, Greece
1. Introduction and Background
VoLTE is a recent technology, allowing to perform voice calls in LTE cellular systems via the IP Multimedia Subsystem (IMS), an all-IP architectural framework integrated on top of the LTE core network. As for the majority of Voice over IP (VoIP) services, in VoLTE digital voice samples are properly placed in Real-Time Transport Protocol (RTP) packets and constitute the data session of the voice call, whereas the signaling of the call is handled via the Session Initiation Protocol (SIP). Various codecs can be employed to produce the digital version of the speech: unlike the Public Switched Telephone Network (PSTN), where the codec choice is limited to equipments covering voice components within the Hz frequency window, LTE can also leverage on wideband codecs, that span a wider frequency range, from to Hz and higher frequencies; this guarantees an increased intelligibility and naturalness to the reconstructed speech, and ultimately an improved quality being experienced by VoLTE users. AMR (3GPP, 2018c) and AMR-WB (3GPP, 2018d) are among the most commonly deployed codecs in contemporary LTE networks, and as such are the subject of the current investigation. As regards end-to-end voice quality assessment, VQmon® (Clark, 2001) is the objective, non-intrusive tool employed in this study; it is an extension of the well-established E-Model (ITU-T, 2005), and exactly like the latter, it provides an output value between [math] and , the so-called Rating factor, R-factor for short, to grade the overall call quality. A suitable correspondence allows to map the R-factor to the more popular Mean Opinion Score (MOS) on a to scale. Such correspondence has been recently updated for the Wide-band version of the E-Model (ITU-T, 2015), where the R-factor can reach values up to , as it is the case for the AMR-WB codec. On the literature rim, numerous studies have been recently conducted to understand VoLTE behavior. In (Elnashar et al., 2017), the performance of VoLTE and of Circuit-Switched Fall Back was benchmarked, pinpointing what values of call set up delay can be achieved under various radio conditions. In (Abichandani et al., 2017), the authors’ objective was to understand whether the adoption of a lower bit rate of the AMR-WB codec could result in an augmented coverage for VoLTE users. Differently from the previous contributions, the aim of this paper is to discern the dependency of VoLTE call quality on network impairments, i.e., packet loss and jitter, and to grasp the influence that different codec choices, namely, AMR or AMR-WB, have on end-to-end speech quality. Accordingly, a significantly large set of VoLTE calls is examined: the network conditions they encountered were recorded and their quality estimated via VQmon®. The obtained results allow to realistically compare the behavior of AMR and WB-AMR codecs and to shed light on VoLTE performance.
2. Setting
We conducted this study on over ten millions of VoLTE calls, collected within a single commercial LTE network from an urban area, over the first half of 2018. Several relevant information and metrics about the RTP voice flows were captured by a proprietary probe on the interface(3GPP, 2018b), anonymized and aggregated in a .csv file. Positioning the tapping point at the interface allowed to collect call detail records for both directions, i.e., for the voice flow being generated by the calling party and for the flow originated by the callee. We chose to analyze the worst among the two directions, i.e., the uplink, therefore capturing the negative effects that the Radio Access Network (RAN) traversal has on voice packets. For each flow, the examined records were the number of transmitted packets, the number of received packets, the average and maximum jitter, , the R-factor computed according to VQmon®, and the type of codec being used. Moreover, a jitter buffer emulator (JBE) was instantiated, in order to realistically model the compensation that takes place receiver side, smoothing out the delay variations that voice packets exhibit after traversing the network. The emulator forced a delay on packets that arrived early, and immediately forwarded late packets. In our system, the JBE was set to receive initial packets with a ms delay, then to dynamically modify its play-out delay according to the average jitter of the previous packets. Under these assumptions, we were able to estimate the packet loss rate, , evaluating the ratio of the number of lost/excessively delayed packets to the total number of received packets after the JBE. Filtering out invalid data and neglecting the calls that either employ the Enhanced Voice Services (EVS) wideband codec (3GPP, 2018a) or alternative, less popular speech codecs, we were left with voice flows. They were further distinguished in AMR and AMR-WB based, amounting to and , respectively.
3. Results and Discussion
Figs.1(a) and 1(b) display the R-factor of the examined flows as a function of and , for the AMR and AMR-WB case, respectively. The comparison between the two figures indicates that the adoption of the AMR-WB codec guarantees higher R-factor values and leads to a less pronounced dependence of it on and . The jagged behavior appearing in Fig.1(b) is due to the lack of points in the region of high values of packet loss rate and maximum jitter; as a matter of fact, the examined LTE network is well designed, so that the values such network impairments exhibit are modest. Moreover, both figures demonstrate that it is the packet loss rate to most significantly influence the end-to-end quality of VoLTE calls. Next, Fig. 2 reports the average R-factor and its standard deviation values over uniform intervals of packet loss rate, when varies between [math] and .
It is interesting to observe the sharp decay that the average R-factor displays for the AMR case, whereas the decrease is less pronounced for the AMR-WB case. The standard deviation tends to increase for increasing values of the packet loss rate, but this has to be mainly ascribed to a decreasing size of the population of samples. For the AMR case, this figure also shows the first order, exponential fit performed on the set of points, , where represents the median value of in every interval and the value of the corresponding average R-factor. We have used Levenberg-Marquardt algorithm, choosing as the fit function (dashed line). The , and values are , and , respectively. The fitting is truly satisfying, and leads to the conclusion that the quality dependence of AMR VoLTE calls on replicates the Quality of Experience (QoE) exponential dependence on the Quality of Service (QoS) parameters first outlined in (Fiedler et al., 2010). This figure also reports the linear regression for the AMR-WB case, using as the fit function (solid line). The and values are and , respectively. The coefficient of determination that quantifies the fitting goodness (a.k.a. ) is equal to , demonstrating that the linear fit is adequate. Although not reported on the figure, we verified that the exponential fit is not satisfying for the AMR case.
4. Conclusions
Examining over ten million VoLTE calls, this study has demonstrated to what extent AMR-WB flows are more robust than AMR ones against packet losses and jitter. Moreover, the analysis has revealed that the R-factor dependency on the packet loss rate is successfully captured by an exponential law for the calls performed via the AMR codec, whereas it follows a linear decay trend for the AMR-WB case.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1)
- 23GPP (2018 a) 3GPP. June 2018 a. Codec for Enhanced Voice Services (EVS); General overview. 3GPP TS 26.441, V.15.0.0 (June 2018).
- 33GPP (2018 b) 3GPP. June 2018 b. IP Multimedia Subsystem (IMS); Multimedia Telephony; Media Handling and Interaction. 3GPP TS 26.114, V.15.3.0 (June 2018).
- 43GPP (2018 c) 3GPP. June 2018 c. Mandatory speech CODEC speech processing functions; AMR speech Codec; General description. 3GPP TS 26.071, V 15.0.0 (June 2018).
- 53GPP (2018 d) 3GPP. June 2018 d. Speech codec speech processing functions; Adaptive Multi-Rate - Wideband (AMR-WB) speech codec; General description. 3GPP TS 26.171 V 15.0.0 (June 2018).
- 6Abichandani et al . (2017) J. Abichandani, J. Baenke, M. S. Irizarry, N. Saxena, P. Vyas, S. Prasad, S. Mada, and Y. Z. Tafesse. 2017. A Comparative Study of Voice Quality and Coverage for Voice over Long Term Evolution Calls Using Different Codec Mode-sets. IEEE Access 5 (2017), 10315–10322. https://doi.org/10.1109/ACCESS.2017.2707080 · doi ↗
- 7Clark (2001) A.D. Clark. 2001. Modeling the effects of burst packet loss and recency on subjective voice quality. Internet Telephony Workshop (IP-Tel) (2001).
- 8Elnashar et al . (2017) A. Elnashar, M. A. El-Saidny, and M. Mahmoud. 2017. Practical Performance Analyses of Circuit-Switched Fallback and Voice Over LTE. IEEE Transactions on Vehicular Technology 66, 2 (Feb 2017), 1748–1759. https://doi.org/10.1109/TVT.2016.2560962 · doi ↗
