Skip to content

Model Selection for Trading

Overview

Choosing the right machine learning model for trading is critical. Unlike other domains, financial data is noisy, non-stationary, and has low signal-to-noise ratios. Model selection must balance predictive power, interpretability, and robustness.

Difficulty advanced

Model Taxonomy

Supervised Learning

Model Use Case Pros Cons
Linear/Logistic Regression Baseline, feature importance Interpretable, fast Assumes linearity
Random Forest Feature selection, non-linear Robust, handles non-linearity Slow inference, overfits
Gradient Boosting (XGBoost, LightGBM) Most tabular data tasks High accuracy, handles missing Complex tuning, overfits
SVM Classification, regime detection Good in high dimensions Slow, hard to tune
Neural Networks (MLP) Complex patterns Universal approximator Data hungry, black box
k-Nearest Neighbors Simple pattern matching Non-parametric Curse of dimensionality

Time Series Models

Model Use Case Pros Cons
ARIMA/ARIMAX Baseline forecasting Interpretable Linear, stationary
GARCH Volatility forecasting Captures clustering Only volatility
LSTM/GRU Sequential prediction Captures long-term deps Hard to train, slow
Transformer Multi-horizon prediction Parallelizable, attention Data hungry, complex
Temporal Fusion Transformer Forecasting with covariates Handles static/dynamic features Very complex
N-BEATS Pure forecasting State-of-the-art for univariate Univariate only

Reinforcement Learning

Model Use Case Pros Cons
Q-Learning / DQN Discrete action spaces Simple, proven Unstable training
PPO Continuous actions Stable, sample efficient Hyperparameter sensitive
SAC Continuous control Sample efficient, stable Complex
A2C/A3C Multi-environment Parallelizable Complex

Selection Framework

Step 1: Problem Definition

1. Prediction Type:
   - Classification: Direction up/down (binary or multi-class)
   - Regression: Price return, volatility, volume
   - Ranking: Relative performance across assets
   - Sequence: Regime identification

2. Time Horizon:
   - Intraday: Seconds to hours
   - Short-term: Days to weeks
   - Medium-term: Weeks to months
   - Long-term: Months to years

3. Data Structure:
   - Tabular: Features × observations
   - Sequential: Time series
   - Graph: Cross-asset relationships
   - Text: News, social media

Step 2: Baseline Models

Always start with simple baselines: a constant predictor (next return = 0), a momentum predictor (sign of trailing return), and OLS on a handful of well-understood features (volatility, spread, prior return). If a fancy model can't beat these out-of-sample, the fancy model isn't the edge — the features are. Used in the feature-engineering phase, before any hyperparameter tuning.

Key Considerations for Trading

1. Overfitting Prevention

1. Out-of-sample testing: Never test on training data
2. Walk-forward validation: Rolling retraining
3. Purged cross-validation: Remove overlapping data
4. Feature importance: Use only meaningful features
5. Regularization: L1/L2 penalties, dropout
6. Ensemble methods: Reduce variance
7. Simplicity first: Prefer simpler models

2. Stationarity

Financial data is non-stationary. Solutions:
- Use returns instead of prices
- Rolling z-scores for features
- Differencing, detrending
- Regime-aware models
- Regular retraining

3. Class Imbalance

Direction prediction is often 50/50, but:
- Extreme moves are rare
- Crisis periods are rare
- Use SMOTE, class weights, or focal loss

4. Latency Requirements

Intraday trading:
- Model inference < 1ms
- Prefer: Linear models, small trees
- Avoid: Large ensembles, deep networks

Daily/weekly trading:
- Inference < 1s acceptable
- Can use: XGBoost, neural networks

Purged Cross-Validation

Standard cross-validation leaks information in time series. Use purged CV: drop training samples whose label window overlaps any test sample, then embargo a buffer of bars after each test fold to defeat serial correlation. Without purging, a model "learns" the label by looking at adjacent overlapping observations and the out-of-sample Sharpe is fiction. Used in the validation phase — every hyperparameter score must come from purged folds.

Model Checklist

  • [ ] Baseline models established
  • [ ] Purged cross-validation used (not random splits)
  • [ ] Features tested for stationarity
  • [ ] Overfitting checked (train vs. test performance)
  • [ ] Feature importance analyzed
  • [ ] Latency requirements met
  • [ ] Model retraining schedule defined
  • [ ] Out-of-sample performance economically significant
  • [ ] Transaction costs included in evaluation
  • [ ] Model explainability sufficient for risk management

References

  1. Lopez de Prado, M. (2018). Advances in Financial Machine Learning. Wiley.
  2. Dixon, M., Klabjan, D., & Bang, J.H. (2020). Machine Learning for Algorithmic Trading (2nd ed.). Packt.
  3. Gu, S., Kelly, B., & Xiu, D. (2020). "Empirical Asset Pricing via Machine Learning." Review of Financial Studies, 33(5), 2223-2273.