This work is made available under the Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License.
This means you are free to:
Under the following terms:
Notices:
For the avoidance of doubt and explanation of terms please refer to the full license notice and legal code.
If you wish to use any of the material from this report please cite as:
This work is (c) 2019 the authors.
You may not be reading the most recent version of this report. Please check:
the overall package documentation;
This work was supported by:
The NZ GREEN Grid household electricity demand study recruited a sample of c 25 households in each of two regions of New Zealand (Stephenson et al. 2017). The first sample was recruited in early 2014 and the second in early 2015. Research data includes:
NB: Version 1 of the data package does not include the time-use diaries.
This report provides an overview of the GREEN Grid project (Stephenson et al. 2017) research data.
Version 1.0 of the data package contains:
The project research sample comprises 44 households who were recruited via the local power lines companies in two areas: Taranaki starting in May 2014 and Hawkes Bay starting in November 2014.
Recruitment was via a non-random sampling method and a number of households were intentionally selected for their ‘complex’ electricity consumption (and embedded generation) patterns and appliances (Giraldo Ocampo 2015; Stephenson et al. 2017; Jack et al. 2018; Suomalainen et al. 2019).
The lines companies invited their own employees and those of other local companies to participate in the research and ~80 interested potential participants completed short or long forms of the Energy Cultures 2 household survey (Wooliscroft 2015). Households were then selected from this pool by the project team based on selection criteria relevant to the GREEN Grid project. These included:
After informed consent was obtained from each household, an electrician contracted by the two lines companies completed an appliance survey to record detailed information about the appliances in each house. This survey contained information about the number of appliances owned, brand, model number, efficiency and age. The electrician also installed the GridSpy units which recorded electricity power demand at a circuit level. The GridSpy units automatically upload the monitoring data to the GridSpy company’s secure database from where it was downloaded by the GREEN Grid research team.
As a result of this process the sample cannot be assumed to represent the population of customers (or employees) of any of the companies involved, nor the populations in each location (Stephenson et al. 2017).
Table 4.1 shows the number in each sample.
Location | n Households |
---|---|
Hawkes Bay | 20 |
Taranaki | 24 |
Table 4.2 shows the number for whom valid appliance and survey data is available in this data package. Note that even those which appear to lack appliance data may have sufficient survey data to deduce appliance ownership (see question numbers Q19_*
and Q40_*
).
Location | hasShortSurvey | hasLongSurvey | hasApplianceSummary | n Households |
---|---|---|---|---|
Hawkes Bay | NA | NA | NA | 1 |
Hawkes Bay | NA | NA | Yes | 1 |
Hawkes Bay | NA | Yes | Yes | 5 |
Hawkes Bay | Yes | NA | Yes | 13 |
Taranaki | NA | Yes | NA | 12 |
Taranaki | NA | Yes | Yes | 12 |
Figure 5.1 shows the total number of households for whom GridSpy data exists on a given date by sample. The plot includes any data, including partial data and suggests that for analytic purposes the period from April 2015 to March 2016 (indicated) would offer the maximum number of households.
Table 6.1 shows key attributes for the recruited sample. Note that two GridSpy monitors were re-used and so require new hhIDs to be set from the date of re-use using the linkID
variable. This is explained in more detail in the GridSpy processing report. Linkage between the survey and GridSpy data should therefore always use linkID
to avoid errors.
hhID | linkID | Location | surveyStartDate | nAdults | nChildren0_12 | nTeenagers13_18 | notes | r_stopDate | hasApplianceSummary |
---|---|---|---|---|---|---|---|---|---|
rf_06 | rf_06 | Taranaki | 2014-05-19 09:49:00 | 2 | 0 | 0 | NA | NA | NA |
rf_07 | rf_07 | Taranaki | 2014-06-23 21:25:00 | 2 | 2 | 0 | NA | NA | NA |
rf_08 | rf_08 | Taranaki | 2014-05-14 12:21:00 | 2 | 0 | 0 | NA | NA | NA |
rf_09 | rf_09 | Taranaki | 2014-06-19 11:33:00 | 2 | 1 | 0 | NA | NA | NA |
rf_10 | rf_10 | Taranaki | 2014-05-20 17:01:00 | 2 | 1 | 0 | NA | NA | Yes |
rf_11 | rf_11 | Taranaki | 2014-06-06 12:16:00 | 2 | NA | NA | NA | NA | Yes |
rf_12 | rf_12 | Taranaki | 2014-06-16 07:34:00 | 1 | 0 | 0 | NA | NA | NA |
rf_13 | rf_13 | Taranaki | 2014-05-14 12:07:00 | 2 | 1 | 1 | NA | NA | Yes |
rf_14 | rf_14 | Taranaki | 2014-06-10 11:51:00 | 1 | 1 | 0 | NA | NA | Yes |
rf_15 | rf_15a | Taranaki | 2014-06-17 15:38:00 | 1 | 0 | 0 | Disconnected 15/01/2015 | 2015-01-15 | NA |
rf_15 | rf_15b | Taranaki | 2014-05-16 17:36:00 | 2 | 0 | 0 | Re-used 15. Then disconnected 02/04/2016 | 2016-04-02 | NA |
rf_16 | rf_16 | Taranaki | 2014-06-10 15:29:00 | 2 | 0 | 0 | NA | NA | NA |
rf_17 | rf_17a | Taranaki | 2014-05-14 20:04:00 | 2 | 3 | 1 | Unusual & specialist energy tech configuration. Disconnected 28/03/2016. | 2016-03-28 | NA |
rf_17 | rf_17b | Taranaki | 2014-05-22 09:16:00 | NA | NA | NA | Re-used 17 | NA | NA |
rf_18 | rf_18 | Taranaki | 2014-05-14 11:20:00 | 2 | 1 | 0 | NA | NA | NA |
rf_19 | rf_19 | Taranaki | 2014-05-22 13:37:00 | 1 | 0 | 0 | NA | NA | Yes |
rf_20 | rf_20 | Taranaki | 2014-05-14 11:46:00 | 2 | 2 | 0 | NA | NA | NA |
rf_21 | rf_21 | Taranaki | 2014-05-20 16:30:00 | 2 | 0 | 0 | NA | NA | Yes |
rf_22 | rf_22 | Taranaki | 2014-05-14 11:39:00 | 2 | 0 | 0 | NA | NA | Yes |
rf_23 | rf_23 | Taranaki | 2014-05-15 15:51:00 | 1 | 0 | 0 | NA | NA | Yes |
rf_24 | rf_24 | Taranaki | 2014-05-14 11:36:00 | 2 | 2 | 0 | NA | NA | Yes |
rf_25 | rf_25 | Taranaki | 2014-06-18 13:57:00 | 1 | 1 | 0 | NA | NA | Yes |
rf_26 | rf_26 | Taranaki | 2014-06-11 13:34:00 | 2 | 0 | 0 | NA | NA | Yes |
rf_27 | rf_27 | Taranaki | 2014-07-03 15:37:00 | 2 | 1 | 1 | NA | NA | Yes |
rf_28 | rf_28 | Hawkes Bay | 2015-01-20 12:15:00 | 2 | 2 | 0 | NA | NA | Yes |
rf_29 | rf_29 | Hawkes Bay | 2015-02-10 11:39:00 | 2 | 1 | 0 | NA | NA | Yes |
rf_30 | rf_30 | Hawkes Bay | 2015-02-03 10:58:00 | 2 | 0 | 2 | NA | NA | Yes |
rf_31 | rf_31 | Hawkes Bay | 2015-02-09 08:05:00 | 3 | 2 | 0 | NA | NA | Yes |
rf_32 | rf_32 | Hawkes Bay | 2015-02-09 08:35:00 | 2 | 2 | 0 | NA | NA | Yes |
rf_33 | rf_33 | Hawkes Bay | 2015-02-09 16:05:00 | 2 | 1 | 1 | NA | NA | Yes |
rf_34 | rf_34 | Hawkes Bay | 2015-01-06 10:50:00 | 3 | 0 | 0 | NA | NA | Yes |
rf_35 | rf_35 | Hawkes Bay | 2015-02-05 16:00:00 | 2 | 2 | 0 | NA | NA | Yes |
rf_36 | rf_36 | Hawkes Bay | 2015-02-10 20:25:00 | 1 | 0 | 2 | NA | NA | Yes |
rf_37 | rf_37 | Hawkes Bay | 2015-02-09 18:49:00 | 2 | 2 | 0 | NA | NA | Yes |
rf_38 | rf_38 | Hawkes Bay | 2015-02-05 15:30:00 | 2 | 2 | 0 | NA | NA | Yes |
rf_39 | rf_39 | Hawkes Bay | 2015-02-05 15:43:00 | 3 | 0 | 1 | NA | NA | Yes |
rf_40 | rf_40 | Hawkes Bay | NA | 2 | 0 | 0 | NA | NA | Yes |
rf_41 | rf_41 | Hawkes Bay | 2015-01-12 13:16:00 | 2 | 3 | 0 | NA | NA | Yes |
rf_42 | rf_42 | Hawkes Bay | 2015-02-10 18:04:00 | 2 | 3 | 0 | NA | NA | Yes |
rf_43 | rf_43 | Hawkes Bay | NA | 2 | 1 | 0 | NA | NA | NA |
rf_44 | rf_44 | Hawkes Bay | 2015-02-04 20:47:00 | 2 | 2 | 1 | NA | NA | Yes |
rf_45 | rf_45 | Hawkes Bay | 2015-02-09 13:26:00 | 2 | 3 | 0 | NA | NA | Yes |
rf_46 | rf_46 | Hawkes Bay | 2014-12-19 08:40:00 | 2 | 1 | 0 | very large number of circuits including voltage and reactive (imaginary) power and possible typos or relabelling? | NA | Yes |
rf_47 | rf_47 | Hawkes Bay | 2015-01-06 09:01:00 | 3 | 0 | 1 | NA | NA | Yes |
We have provided a number of code examples for suggestions on how to load, further process and analyse the data.
We maintain a known data issues list via our GitHub repository. If you think there is a data issue please check the repo list first and then add a new one if appropriate.
Analysis completed in 20.22 seconds ( 0.34 minutes) using knitr in RStudio with R version 3.5.2 (2018-12-20) running on x86_64-apple-darwin15.6.0.
## R version 3.5.2 (2018-12-20)
## Platform: x86_64-apple-darwin15.6.0 (64-bit)
## Running under: macOS High Sierra 10.13.6
##
## Matrix products: default
## BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
## LAPACK: /Library/Frameworks/R.framework/Versions/3.5/Resources/lib/libRlapack.dylib
##
## locale:
## [1] en_NZ.UTF-8/en_NZ.UTF-8/en_NZ.UTF-8/C/en_NZ.UTF-8/en_NZ.UTF-8
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] skimr_1.0.7 readxl_1.3.1 kableExtra_1.1.0
## [4] lubridate_1.7.4 readr_1.3.1 ggplot2_3.2.1
## [7] data.table_1.12.2 bookdown_0.13 rmarkdown_1.15
## [10] here_0.1 GREENGridData_1.0
##
## loaded via a namespace (and not attached):
## [1] progress_1.2.2 tidyselect_0.2.5 xfun_0.9
## [4] purrr_0.3.2 reshape2_1.4.3 colorspace_1.4-1
## [7] vctrs_0.2.0 htmltools_0.3.6 viridisLite_0.3.0
## [10] yaml_2.2.0 rlang_0.4.0 pillar_1.4.2
## [13] glue_1.3.1 withr_2.1.2 lifecycle_0.1.0
## [16] plyr_1.8.4 stringr_1.4.0 munsell_0.5.0
## [19] gtable_0.3.0 cellranger_1.1.0 rvest_0.3.4
## [22] evaluate_0.14 labeling_0.3 knitr_1.24
## [25] highr_0.8 Rcpp_1.0.2 scales_1.0.0
## [28] backports_1.1.4 webshot_0.5.1 hms_0.5.1
## [31] packrat_0.5.0 digest_0.6.20 stringi_1.4.3
## [34] dplyr_0.8.3 grid_3.5.2 rprojroot_1.3-2
## [37] tools_3.5.2 magrittr_1.5 lazyeval_0.2.2
## [40] tibble_2.1.3 tidyr_1.0.0 crayon_1.3.4
## [43] pkgconfig_2.0.2 zeallot_0.1.0 xml2_1.2.2
## [46] prettyunits_1.0.2 assertthat_0.2.1 httr_1.4.1
## [49] rstudioapi_0.10 R6_2.4.0 compiler_3.5.2
Allaire, JJ, Yihui Xie, Jonathan McPherson, Javier Luraschi, Kevin Ushey, Aron Atkins, Hadley Wickham, Joe Cheng, and Winston Chang. 2018. Rmarkdown: Dynamic Documents for R. https://CRAN.R-project.org/package=rmarkdown.
Anderson, Ben, and David Eyers. 2018. GREENGridData: Processing Nz Green Grid Project Data to Create a ’Safe’ Version for Data Archiving and Re-Use. https://github.com/CfSOtago/GREENGridData.
Csárdi, Gábor, and Rich FitzJohn. 2016. Progress: Terminal Progress Bars. https://CRAN.R-project.org/package=progress.
Dowle, M, A Srinivasan, T Short, S Lianoglou with contributions from R Saporta, and E Antonyan. 2015. Data.table: Extension of Data.frame. https://CRAN.R-project.org/package=data.table.
Giraldo Ocampo, Diana. 2015. “Developing an Energy-Related Time-Use Diary for Gaining Insights into New Zealand Households’ Electricity Consumption.” Master’s thesis, Centre for Sustainability: University of Otago. http://hdl.handle.net/10523/5957.
Grolemund, Garrett, and Hadley Wickham. 2011. “Dates and Times Made Easy with lubridate.” Journal of Statistical Software 40 (3): 1–25. http://www.jstatsoft.org/v40/i03/.
Jack, M. W., K. Suomalainen, J. J. W. Dew, and D. Eyers. 2018. “A Minimal Simulation of the Electricity Demand of a Domestic Hot Water Cylinder for Smart Control.” Applied Energy 211: 104–12. https://www.sciencedirect.com/science/article/pii/S0306261917316197.
Müller, Kirill. 2018. Hms: Pretty Time of Day. https://CRAN.R-project.org/package=hms.
R Core Team. 2016. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
Stephenson, Janet, Rebecca Ford, Nirmal-Kumar Nair, Neville Watson, Alan Wood, and Allan Miller. 2017. “Smart Grid Research in New Zealand–A Review from the GREEN Grid Research Programme.” Renewable and Sustainable Energy Reviews 82 (1): 1636–45. https://doi.org/10.1016/j.rser.2017.07.010.
Suomalainen, Kiti, David Eyers, Rebecca Ford, Janet Stephenson, Ben Anderson, and Michael Jack. 2019. “Detailed Comparison of Energy-Related Time-Use Diaries and Monitored Residential Electricity Demand.” Energy and Buildings 183: 418–27. https://doi.org/10.1016/j.enbuild.2018.11.002.
Wickham, Hadley. 2007. “Reshaping Data with the reshape Package.” Journal of Statistical Software 21 (12): 1–20. http://www.jstatsoft.org/v21/i12/.
———. 2009. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. http://ggplot2.org.
Wickham, Hadley, and Jennifer Bryan. 2017. Readxl: Read Excel Files. https://CRAN.R-project.org/package=readxl.
Wickham, Hadley, and Romain Francois. 2016. Dplyr: A Grammar of Data Manipulation. https://CRAN.R-project.org/package=dplyr.
Wickham, Hadley, Jim Hester, and Romain Francois. 2016. Readr: Read Tabular Data. https://CRAN.R-project.org/package=readr.
Wooliscroft, B. 2015. “National Household Survey of Energy and Transportation: Energy Cultures Two.” Centre for Sustainability: University of Otago. http://hdl.handle.net/10523/5634.
Xie, Yihui. 2016a. Bookdown: Authoring Books and Technical Documents with R Markdown. Boca Raton, Florida: Chapman; Hall/CRC. https://github.com/rstudio/bookdown.
———. 2016b. Knitr: A General-Purpose Package for Dynamic Report Generation in R. https://CRAN.R-project.org/package=knitr.
Zhu, Hao. 2018. KableExtra: Construct Complex Table with ’Kable’ and Pipe Syntax. https://CRAN.R-project.org/package=kableExtra.