Google Research

optimizing user experience and operational efficiencies in wireless service provider networks

  • Deepak Kakadia
Reliability Engineering & System Safety, science direct, Elsevier B.V. netherlands (2019), pp. 40

Abstract

Since the 1980′s and in particular 1996, telecom operators and recently mobile operators have been facing increasinglyfierce competition, combined withflatsubscriber growth and increased data usage resulting in tremendous downward pressures on profitability, forcing operators to differentiate themselves by trying tooffer network services with better customer experience at lower operational costs. Wireless operators are challenged with measuring user experience which in itself issubjective, in a manner that accurately reflects the functional and emotional aspects of perceived quality and linking to Network Resiliency which characterizes thenetwork behavior as it responds to disruptions. Current network faults and alarms only consider device failures and do not consider actual impact to user experience.For instance a failed router may not impact the users experience due to built in redundancies in the network. Studies to date, have proposed methods and models thatfocus on specific aspects of user experience in wired and cellular networks. However, to the best of our knowledge, there is currently very little research that connectslinking poor user network experience to root cause. Previous recent work in this area focus on identifying what and where measurements to gage subscriber OoE,modeling and high level concepts, but do not address realistic challenges and approaches that can be automated to materially impact improved customer experiencesat lower operational expenses.There is a gap on how operators can automatically associate poor user experience, relevant network metrics and root causes with a suitablemodel that can be analyzed and optimized. We propose a general framework for a solution that links these entities together, with a quantified approach to optimize usernetwork experience by optimizing network resilience using a model that can be analyzed and optimized using machine learning methods to improve resilience andhence user experience. Results of directly applying existing machine learning algorithms for identifying root causes to network telemetry data have proven to beineffective in practice due to the fact that existing machine learning algorithms are designed for prediction, classification and ranking not for identifying causalrelationships and further complicated by the fact that these algorithms have assumptions on the data and in reality the network data distributions vary wildly duringnetwork disturbances. The proposed general framework combines existing methods for anomaly detection and machine learning algorithms, however the novelcontribution centers on improving the accuracy offinding associated root causes by dynamically selecting the optimal machine learning algorithm based on thenetwork telemetry data features that are recomputed before, during and after network disturbances. The proposed approach then allows us to automate the timeconsuming manual tasks of network engineers that proactively monitor key performance metrics for anomalies, correlate with other data sources to ultimatelydetermine actionable insights to maintain a certain acceptable level of user experience by dynamically selecting the appropriate machine learning algorithm for thegiven data characteristics or features. We describe an example case study specific to wireless provider environment, illustrating the potential viability with resultsfrom actual wireless(approx 8 million monthly subscribers) operations data showing promising results by applying the proposed approach. The prototype im-plementation was able to programmatically detect anomalies, identify potential root causes using different algorithms suitable for the given data and time frame,which dramatically increased the accuracy and efficiency of the small network engineering team, and hence improved the user experience by improving network resiliency

Research Areas

Learn more about how we do research

We maintain a portfolio of research projects, providing individuals and teams the freedom to emphasize specific types of work