肖明坤, 王吉顺. Study of big data cleaning solution based on DPI system of operators[J]. 2016, 29(2): 40-43. DOI: 10.13992/j.cnki.tetas.2016.02.011.
基于电信运营商固网DPI系统的大数据清洗方案
摘要
本文提出了一种针对电信运营商固网http信息的清洗方案
经过现网试点部署验证可到80%以上的清洗率
大大节省了存储空间和网络传输带宽
对运营商开展固网大数据业务具有重要的借鉴意义。
Abstract
This paper provides a data clean solution based on DPI raw data provided by operators. The solution is verified that rate of data cleaning can be up to 80% while big data business is not impacted by deploying on real network. The solution is valuable when operators deploy big data business since much bandwith and storage can be saved.