Time Series: ARIMA, LSTM, Prophet & Stationarity
Forecasting the future from the past
Interview Question
"Explain the concept of stationarity in time series. What is ARIMA and how does it handle non-stationary data? Compare ARIMA with LSTM for time series forecasting."
Difficulty: Hard | Frequently asked at Amazon, Uber, Google
Theoretical Foundation
Stationarity
A time series is stationary if its statistical properties don't change over time:
- Constant mean:
- Constant variance:
- Constant autocovariance:
Why stationarity matters: Most time series models assume stationarity.
Making Non-Stationary Data Stationary
- Differencing: (first order)
- Log transform: Stabilizes variance
- Seasonal differencing:
ARIMA (AutoRegressive Integrated Moving Average)
Components:
- AR(p): AutoRegressive - depends on past values
- I(d): Integrated - differencing order
- MA(q): Moving Average - depends on past errors
Model:
Parameters:
- : Number of AR terms
- : Number of differences
- : Number of MA terms
ARIMA vs LSTM Comparison
| Aspect | ARIMA | LSTM |
|---|---|---|
| Assumptions | Linear, stationary | None (learns patterns) |
| Features | Univariate only | Multivariate |
| Non-linear | No | Yes |
| Data Required | Less | More |
| Interpretability | High | Low |
| Computation | Fast | Slow |
| Long-term Dependencies | Limited | Captures well |
Facebook Prophet
Model: Additive decomposition:
- : Trend (piecewise linear or logistic)
- : Seasonality (Fourier series)
- : Holidays and events
Properties:
- Handles missing data
- Robust to outliers
- Interpretable components
- Automatic changepoint detection
βΉοΈ
Key Insight: ARIMA works well for short-term forecasts of stationary linear processes. LSTM excels when there are complex non-linear patterns and sufficient data. Prophet is great for business time series with known seasonal patterns.
Code Implementation
Real-World Applications
Amazon: Demand Forecasting
- Inventory Management: Predicting product demand
- Supply Chain: Forecasting logistics requirements
- Pricing: Dynamic pricing based on demand patterns
Uber: Ride Demand Prediction
- ETA Prediction: Forecasting arrival times
- Surge Pricing: Predicting demand spikes
- Driver Allocation: Matching supply with demand
π‘
Amazon Interview Tip: Be prepared to discuss handling multiple seasonalities (daily, weekly, yearly) and the importance of feature engineering in time series.
Common Follow-Up Questions
Q1: How do you choose p, d, q for ARIMA? Use ACF/PACF plots, information criteria (AIC/BIC), or auto-ARIMA. d is determined by differencing until stationary.
Q2: When does LSTM outperform ARIMA? When there are complex non-linear patterns, multiple input features, or long-term dependencies that ARIMA cannot capture.
Q3: How do you handle missing values in time series? Forward fill, interpolation, or model-based imputation. Some models (Prophet) handle missing data natively.
Q4: What is the difference between time series regression and forecasting? Forecasting predicts future values using only past values. Time series regression uses external features (weather, events) to predict the target.