Abstract
With the widely use of smart meters in the energy sector, anomaly detection becomes a crucial mean to study the unusual consumption behaviors of customers, and to discover unexpected events of using energy promptly. Detecting consumption anomalies is, essentially, a real-time big data analytics problem, which does data mining on a large amount of parallel data streams from smart meters. In this paper, we propose a supervised learning and statistical-based anomaly detection method, and implement a Lambda system using the in-memory distributed computing framework, Spark and its extension Spark Streaming. The system supports not only iterative refreshing the detection models from scalable data sets, but also real-time anomaly detection on scalable live data streams. This paper empirically evaluates the system and the detection algorithm, and the results show the effectiveness and the scalability of the lambda detection system.
Original language | English |
---|---|
Title of host publication | 18th International Conference on Big Data Analytics and Knowledge Discovery, DaWaK 2016 |
Editors | Sanjay Madria, Takahiro Hara |
Number of pages | 14 |
Publisher | Springer |
Publication date | 2016 |
Pages | 193-209 |
ISBN (Print) | 978-3-319-43945-7 |
ISBN (Electronic) | 978-3-319-43946-4 |
DOIs | |
Publication status | Published - 2016 |
Event | 18th International Conference on Big Data Analytics and Knowledge Discovery - Porto, Portugal Duration: 6 Sept 2016 → 8 Sept 2016 Conference number: 18 |
Conference
Conference | 18th International Conference on Big Data Analytics and Knowledge Discovery |
---|---|
Number | 18 |
Country/Territory | Portugal |
City | Porto |
Period | 06/09/16 → 08/09/16 |
Keywords
- it