Energy Use Intensity

Data Science side project

All rights reserved by Thaiprayoon N.

Table of Contents

Import data

This data set comes from Kaggle.com There are Building simulation results from for than 250k EnergyPlus simulations. With 48 columns of important data for using analytics energy consumption of building comsumtion following.

Universal Design Space Exploration

Design Space Exploration (DSE) analysis techniques represent a data-centric approach to integrating performance analysis in early design phases when there is the greatest potential to cheaply improve the energy efficiency of a building. We focus on a novel extension of DSE called Universal Design Space Exploration (UDSE), which leverages massive databases of pre-simulated analysis that represent all possible outcomes of common analysis workflows. These databases, called Design Spaces, become “universal” when a single pre-simulated design space can be re-applied to future, unknown projects. Unlike current simulation methods, which require a design to exist before it can be analyzed and often take minutes or hours to simulate, UDSE leverages pre-simulation to deliver rapid and relevant insight as new designs are conceptualized. The data underpinning UDSE enables advanced statistical and Artificial Intelligence methods, allowing UDSE to deliver a greater understanding of the larger problem being explored rather than simply delivering analysis of several pre-conceived design options. We believe that UDSE can provide instantaneous, relevant analysis for all building design projects at negligible cost.

AutoBEM

Oak Ridge National Laboratory has developed a collection of software and algorithms, collectively referred to as “Automatic Building Energy Modeling” (AutoBEM), which allows building energy modeling of each building at large geographic scales (AutoBEM). Within AutoBEM, building properties are detected, inferred, or predicted as inputs to generate building energy models using OpenStudio and simulate these buildings using EnergyPlus. OpenStudio is a collection of software tools to support energy modeling in EnergyPlus, which is a physical building energy simulation engine (OpenStudio) (EnergyPlus).

See more on Kaggle

Check values in each columns

EDA and Data preparation.

EDA: Exploratory Data Analysis

Cut off the outlier

Plot some column to explore outlier

IQR (Interquartile Range)

$$Interquartile\ Range = Q_{3} - Q_{1}$$

Cut off outlier data that is below $3^{rd}$quartile ($25^{th}$percentile) and above $3^{rd}$quartile ($75^{th}$percentile). \ Ref: Interquartile Range

iqr_quartiles.png

Min-Max Normalization

Heat map plot to show the data correlation

Predict by using XGboost

Show some of the predicted data

Mean Absolute Error

MAE: Mean Absolute Error

Source: wikipedia.org

Show the first predicted data and MAE

3D Scatter Plot of Predict datas Vs Real data Vs Error

Line Plot of Predict datas, Real data, and Error with 100 observations