type
Browse files
README.md
CHANGED
@@ -18,9 +18,18 @@ Demonstrates timeseries forecasting using a [LSTM model.](https://keras.io/api/l
|
|
18 |
|
19 |
|
20 |
|
21 |
-
## Intended uses & limitations
|
22 |
|
23 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
## Training and evaluation data
|
26 |
|
|
|
18 |
|
19 |
|
20 |
|
|
|
21 |
|
22 |
+
## Data Preprocessing
|
23 |
+
|
24 |
+
Here we are picking ~300,000 data points for training. Observation is recorded every 10 mins, that means 6 times per hour. We will resample one point per hour since no drastic change is expected within 60 minutes. We do this via the sampling_rate argument in timeseries_dataset_from_array utility.
|
25 |
+
|
26 |
+
We are tracking data from past 720 timestamps (720/6=120 hours). This data will be used to predict the temperature after 72 timestamps (72/6=12 hours).
|
27 |
+
|
28 |
+
Since every feature has values with varying ranges, we do normalization to confine feature values to a range of [0, 1] before training a neural network. We do this by subtracting the mean and dividing by the standard deviation of each feature.
|
29 |
+
|
30 |
+
71.5 % of the data will be used to train the model, i.e. 300,693 rows. split_fraction can be changed to alter this percentage.
|
31 |
+
|
32 |
+
The model is shown data for first 5 days i.e. 720 observations, that are sampled every hour. The temperature after 72 (12 hours * 6 observation per hour) observation will be used as a label.
|
33 |
|
34 |
## Training and evaluation data
|
35 |
|