TY - JOUR
T1 - Two approaches for synthesizing scalable residential energy consumption data
AU - Liu, Xiufeng
AU - Iftikhar, Nadeem
AU - Huo, Huan
AU - Rongling, Li
AU - Nielsen, Per Sieverts
N1 - Publisher Copyright: © 2019 Elsevier B.V.
PY - 2019/6
Y1 - 2019/6
N2 - Many fields require scalable and detailed energy consumption data for different study purposes. However, due to privacy issues, it is often difficult to obtain sufficiently large datasets. This paper proposes two different methods for synthesizing fine-grained energy consumption data for residential households, namely a regression-based method and a probability-based method. They each use a supervised machine learning method, which trains models with a relatively small real-world dataset and then generates large-scale time series based on the models. This paper describes the two methods in details, including data generation process, optimization techniques, and parallel data generation. This paper evaluates the performance of the two methods, which compare the resulting consumption profiles with real-world data, including patterns, statistics, and parallel data generation in the cluster. The results demonstrate the effectiveness of the proposed methods and their efficiency in generating large-scale datasets.
AB - Many fields require scalable and detailed energy consumption data for different study purposes. However, due to privacy issues, it is often difficult to obtain sufficiently large datasets. This paper proposes two different methods for synthesizing fine-grained energy consumption data for residential households, namely a regression-based method and a probability-based method. They each use a supervised machine learning method, which trains models with a relatively small real-world dataset and then generates large-scale time series based on the models. This paper describes the two methods in details, including data generation process, optimization techniques, and parallel data generation. This paper evaluates the performance of the two methods, which compare the resulting consumption profiles with real-world data, including patterns, statistics, and parallel data generation in the cluster. The results demonstrate the effectiveness of the proposed methods and their efficiency in generating large-scale datasets.
KW - construction, environment and energy
UR - http://www.scopus.com/inward/record.url?scp=85060912381&partnerID=8YFLogxK
U2 - 10.1016/j.future.2019.01.045
DO - 10.1016/j.future.2019.01.045
M3 - Journal article
SN - 0167-739X
VL - 95
SP - 586
EP - 600
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -