Off the top of my head, here is my initial impression after using it for a day.
1. filtering for setups. Its fruitless to try and predict the markets all time time. Much, much belter to filter for specific setups. To do this, we need the ability to apply the filtering indicator to training, testing, and live trading.
2. Continuous contracts are needed but a PITA to create manually. Some tooling would help here. Especially being able to set the roll ratio.
3. My setups happen on a dozen instruments, I would like to train on all those instruments to make one model. Not just one futures contract, but many at once.
4. The data flow is undocumented. Are you using market replay data, historical data? Is it from my local computer or from the cloud? Is it tick data? I tried plugging in an indicator that uses 1m data. The intermediate data file showed that indicator looking totally different from Market Replay. I cannot trouble shoot this without understanding the read path for the data. But as it is, I cannot trust the model at all if the indicator inputs are wrong. Mostly I do need full level 1 tick data, and do not need level two at all.
5. Which data is used to form the "normal" or "do not take a trade now" background data set? This seems undocumented. Its very important, obviously.
6. Regression usually has so much more statistical power than classifiers. Why not allow/use regression techniques alongside classifiers?
7. How is the input covariate normalization done? This seems undocumented. Are you keeping a rolling volatility estimate (like an ATR) for each indicator? Is it a fixed, constant for all time normalization factor?
8. Feature importance output would be nice to help with feature selection when you have a ton of parameter choices for your indicators.
9. Which results are out-of-sample? Which are from in-sample? How do those two differ? The labeling needs to be clearer.
I may think of other things, but there's a start.
1. filtering for setups. Its fruitless to try and predict the markets all time time. Much, much belter to filter for specific setups. To do this, we need the ability to apply the filtering indicator to training, testing, and live trading.
2. Continuous contracts are needed but a PITA to create manually. Some tooling would help here. Especially being able to set the roll ratio.
3. My setups happen on a dozen instruments, I would like to train on all those instruments to make one model. Not just one futures contract, but many at once.
4. The data flow is undocumented. Are you using market replay data, historical data? Is it from my local computer or from the cloud? Is it tick data? I tried plugging in an indicator that uses 1m data. The intermediate data file showed that indicator looking totally different from Market Replay. I cannot trouble shoot this without understanding the read path for the data. But as it is, I cannot trust the model at all if the indicator inputs are wrong. Mostly I do need full level 1 tick data, and do not need level two at all.
5. Which data is used to form the "normal" or "do not take a trade now" background data set? This seems undocumented. Its very important, obviously.
6. Regression usually has so much more statistical power than classifiers. Why not allow/use regression techniques alongside classifiers?
7. How is the input covariate normalization done? This seems undocumented. Are you keeping a rolling volatility estimate (like an ATR) for each indicator? Is it a fixed, constant for all time normalization factor?
8. Feature importance output would be nice to help with feature selection when you have a ton of parameter choices for your indicators.
9. Which results are out-of-sample? Which are from in-sample? How do those two differ? The labeling needs to be clearer.
I may think of other things, but there's a start.
0