AI – ENERGY CONSUMPTION FORECASTER

Curious about applying machine learning to solve industry concerns? Here, I demonstrate how I leveraged my data science knowledge to develop an AI solution for a real company.

In this post, we will discuss the various stages of the project to understand our starting point and end goal. For the purposes and readability of this post, we will focus on the main aspects of developing the AI and not delve into technical details about data science, nor data engineering.

Of course, it would be a pleasure to dive into technical details for those of you who may be interested in further development.
In case you feel the urge to get to know more about this project, I really encourage you to contact me directly.

Big J

Data scientist

ARCHITECTURE

To develop our machine learning algorithm, it is essential to analyze all available tools that may help us reach our goals:

DATA PIPELINE

By putting together all the pieces of the puzzle, we can create a data pipeline, which can be summarized in the following outline:

Remember that we are just going through the basics, not diving into each of the parts, as the main purpose of this post is to get to the AI solution.

Big J

Data scientist

DATASET

To enhance comprehension, below is an example of the dataset’s format and size, where we can highlight:

Structure

Roughly 10.000 rows (depending on the time frame selected), and 9 columns (variables).

Variables

1 target variable (energy consumption), and 1 predictor variable (production)

Index

Time-based index, divided into 1 hour frames.

AI FORECASTER

We must take into account how critical the training stage of our algorithm is. Remember:

"Teach them how to think, not what to think."

This quote reflects the main goal when training an AI. Most of the time, people tend to focus in the train data excessively, ending up in what is called overtraining of the model, which means the system is biased and will not give back reliable forecasts when facing new sets of data.

For this reason, a deep data processing and a good training program is needed.

This is how our dataset looks like right before training the ML algorithm, where:

ID_FECHA

Describes the time frame where the data was collected.

ID_LINEA

Identify the affected machine.

PRODUCCION

Production during this specific time frame.

CONSUMO

Energy consumption during this specific time frame.

And below we see the output of our AI after training stage, where we compare real consumption data against AI predictions:

Reaching this point, we should ask ourselves:

Is it enough?

And the answer should always be no. As a data scientist, it is our responsibility to calculate and contrast how good this is: Never ever rely on a image, but in numbers!

Hypothesis contrast, quality metrics, error calculation, business goals... there are several important techniques to take into account when facing this moment.
However, it is not the purpose of this post to get into it.

Big J

Data scientist

DATA VISUALIZATION

This final step may not be the most challenging, but it holds a significant truth:

"A picture is worth a thousand words."

The most effective approach to incorporating big data into your company is to provide a clear demonstration of its capabilities and establish a user-friendly management system that enables employees to interact with the data and achieve their objectives.

By creating the following Power BI interactive screen, we share some important insights:

Improve your understanding of machine behaviour by tracking monthly energy consumption.
Check the main quality metrics of the ML model.
Easily track the energy consumption forecast, and compare it with real-life.

AI – ENERGY CONSUMPTION FORECASTER

ARCHITECTURE

OPC UA SERVER

JUPYTER

SQL SERVER

POWER BI

DATA PIPELINE

DATASET

AI FORECASTER

DATA VISUALIZATION

Big J Insights

Leave a Comment Cancel Reply

ARCHITECTURE

OPC UA SERVER

JUPYTER

SQL SERVER

POWER BI

DATA PIPELINE

DATASET

AI FORECASTER

DATA VISUALIZATION

Big J Insights

Related Posts

Leave a Comment Cancel Reply