# What is Time Series Analysis in Machine Learning?

What is a time series?

A collection of data recorded over a period of time (weekly, monthly, quarterly, etc.)

An analysis of history, it can be used by management to make current decisions and plans based on long-term forecasting

Usually assumes past pattern to continue into the future

Time Series Analysis

- Time Series: Ordered sequence of equally spaced values over time

- Time Series Analysis: Accounts for the **internal structure **of observations taken over time

Trend

Seasonality

Cycles

Random

- Goals

To identify the internal structure of the time series

To forecast future events

- Example: Based on sales history, what will next December sales be?

Businesses perform sales forecasting to look ahead in order to plan their investments, launch new products, decide when to close or withdraw products, etc. The sales forecasting process is a critical one for most businesses. Part of the sales forecasting process is to examine the past. How well did we do in the last few months or what were our sales in the same time period for the last few years? Time Series Analysis provides a scientific methodology for sales forecasting. **Time** **Series Analysis **is the analysis of sequential data across equally spaced units of time. Time Series is a basic research methodology in which data for one or more variables are collected for many observations at different time periods. The main objectives in Time Series Analysis are:

To understand the underlying structure of the time series by breaking it down to its components.

Fit a mathematical model and then proceed to forecast the future

The time periods are usually regularly spaced and the observations may be either univariate or multivariate. **Univariate** time series are those where only one variable is measured over time, whereas multivariate time series are those, where multiple variables are measured simultaneously.

Components of a Time Series

Secular Trend â€“ the smooth long term direction of a time series

Cyclical Variation â€“ the rise and fall of a time series over periods longer than one year

Seasonal Variation â€“ Patterns of change in a time series within a year which tends to repeat each year

Irregular Variation â€“ unpredictable but identifiable

Cyclical Variation â€“ Sample Chart

Seasonal Variation â€“ Sample Chart

Seasonal Variation

One of the components of a time series

Seasonal variations are fluctuations that coincide with certain seasons and are repeated year after year

Understanding seasonal fluctuations

**help plan for sufficient goods and materials**on hand to meet varying seasonal demandAnalysis of seasonal fluctuations over a period of years

**help in evaluating current sales**

The Moving Average Method

Useful in smoothing time series to see its trend

Basic method used in measuring seasonal fluctuation

Applicable when time series follows fairly linear trend that have definite rhythmic pattern.

Box-Jenkins Method: What is it?

Models historical behavior to forecast the future

- Applies **ARMA** (Autoregressive Moving Averages)

Input: Time Series

*- Accounting for **Trends** and **Seasonality** components*

Output: Expected future value of the time series

What is ARIMA Model?

- This acronym is descriptive, capturing the key aspects of the model itself:

**AR**:*Autoregression*. A model that uses the dependent relationship between an observation and some number of lagged observations.**I**:*Integrated*. The use of differencing of raw observations (e.g. subtracting an observation from an observation at the previous time step) in order to make the time series stationary.**MA**:*Moving Average*. A model that uses the dependency between an observation and a residual error from a moving average model applied to lagged observations.

- A standard notation is used of ARIMA(p,d,q) where the parameters are substituted with integer values to quickly indicate the specific ARIMA model being used.

- The parameters of the ARIMA model are defined as follows:

**p**: The number of lag observations included in the model, also called the lag order.**d**: The number of times that the raw observations are differenced, also called the degree of differencing.**q**: The size of the moving average window, also called the order of moving average.

Use Cases

**Forecast:**

Next month's sales

Tomorrow's stock price

Hourly power demand

Modeling a Time Series

Let's model the time series as

Yt =Tt +St +Rt, t=1,...,n.

- Tt: Trend term

Air travel steadily

**increased**over the last few years

- St: The seasonal term

Air travel

**fluctuates**in a regular pattern over the course of a year

**- Rt**: Random component

To be modeled with ARMA

Stationary Sequences

- the random component is a *stationary** sequence *

Constant mean

Constant variance

Autocorrelation does not change over time

- Constant correlation of a variable with itself at different times

- In practice, to obtain a stationary sequence, the data must be:

De-trended

Seasonally adjusted

De-trending

- In this example, we see a linear **trend**, so we fit a linear model

Tt = mÂ·t + b

- The **de-trended** series is then

Y1t = Yt â€“ Tt

Trend in a time series is a slow, gradual change in some property of the series over the whole interval under investigation.

De-trending is a pre-processing step to prepare time series for analysis by methods that assume stationarity.

A simple linear trend can be removed by subtracting a least-squares-fit straight line. In the example shown we fit a linear model and obtain the difference. The graph shown next is a de-trended time series.

More complicated trends might require different procedures such a fitting a non-linear model such as a quadratic or a exponential model.

Use a ** Linear Trend Model **if the first differences are more or less constant [ (y2-y1) = (y3-y2) = â€¦â€¦. = (yn-yn-1) ]

Use a ** Quadratic Trend Model **if the second differences are more or less constant. [ (y3-y2) â€“ (y2-y1) = â€¦â€¦â€¦= (yn-yn-1)-(yn-1-yn-2) ]

Use an ** Exponential Trend Model **if the percentage differences are more or constant. [ ((y2-y1) /y1 ) * 100% = â€¦â€¦.((yn-yn-1)/yn-1 ) * 100%

*To get help in Time Series Forecasting you can contact us or send your project requirement details at:*

realcode4you@gmail.com