Optiver Trading Competition

Overview

This project performs feature engineering and modeling on order book time series data to evaluate the effectiveness of Technical Analysis features in light of predicting the target

The main tasks include:

Exploratory data analysis
Feature engineering
- Technical indicators (RSI, MACD, Bollinger Bands)
- Time-weighted order book features
Feature selection
- Correlation analysis
- Filter-based (ANOVA)
- Wrapper-based (RFE)
Modeling
- XGBoost
- Hyperparameter tuning
- Cross-validation

Data

The train.csv dataset contains the following key features:

stock_id: Stock identifier
bid/ask_price: Bid and ask prices in the order book
bid/ask_size: Bid and ask sizes in the order book
target: The target variable to predict

Installation

This project requires Python 3 and the following libraries:

Pandas
NumPy
Scikit-learn
XGBoost
Optuna

These can be installed with pip or conda.

Results

We evaluated the impact of adding technical analysis (TA) features to our orderbook model across different numbers of stocks: 10, 50, and 100.

The key hypotheses tested were:

TA features reduce MAE by at least 10% compared to no TA features.
TA features reduce MAE compared to no TA features.

Hypothesis 1 Results

H0: TA features reduce MAE by at least 10%
H1: TA features reduce MAE by <10%

# Stocks	Result
10	✘
50	✘
100	✓

For 10 and 50 stocks, the >10% MAE improvement null hypothesis is rejected
For 100 stocks, the null >10% MAE improvement hypothesis holds

Hypothesis 2 Results

H0: TA features reduce MAE
H1: TA features do not reduce MAE

# Stocks	Result
10	✓
50	✓
100	✓

For all stock counts, TA features lead to lower MAE
The alternative hypotheses are rejected

Conclusion: Adding TA features reduces model MAE compared to no TA features for all stock counts. However, the >10% MAE improvement only holds for 100 stocks based on the Welch test results.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Data Wrangling		Data Wrangling
Result		Result
Visualization		Visualization
artifacts		artifacts
.DS_Store		.DS_Store
.gitignore		.gitignore
CSDS313 - Final Project Presentation.pdf		CSDS313 - Final Project Presentation.pdf
Feature Generation Technical Analysis.py		Feature Generation Technical Analysis.py
README.md		README.md
csds313_final.py		csds313_final.py
csds313_final_kiet.py		csds313_final_kiet.py
explain-the-data-lightgbm-baseline.ipynb		explain-the-data-lightgbm-baseline.ipynb
forwardout.txt		forwardout.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optiver Trading Competition

Overview

Data

Contents

Installation

Results

Hypothesis 1 Results

Hypothesis 2 Results

About

Releases

Packages

Languages

KNguyen37/optiver-trading-competition

Folders and files

Latest commit

History

Repository files navigation

Optiver Trading Competition

Overview

Data

Contents

Installation

Results

Hypothesis 1 Results

Hypothesis 2 Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages