What is systematic trading?

Most investors make decisions in the moment — reacting to headlines, following tips, or trading on gut feel. Systematic trading is the opposite: you define your rules in advance — when to enter, when to exit, how much to risk — and the system follows them precisely, every time, without emotion. Oyamori gives you access to proven systematic strategies so you can trade on data, not instinct.

Do I need to know how to code to use Oyamori?

No. You choose a strategy from the catalog, connect your brokerage account, set your risk parameters, and Oyamori handles the rest. The intelligence is built in — you bring the decision, the platform brings the execution. If you are a developer and want to go deeper, the CLI and API are there. But coding is never a requirement.

What is a trading edge?

An edge is a repeatable market condition where the odds are in your favor — not guaranteed, but statistically consistent over time. Think of it like a weather forecast: not certain, but informed. Serious investors do not guess; they find edges and execute them systematically. Oyamori's catalog is built on proven edges across different asset classes and market conditions, each grounded in rigorous quantitative research.

How does AI sentiment trading work?

News moves markets — but most investors only find out after the price has already moved. Oyamori's AI intelligence layer, Newsvibe, reads financial news in real time and scores it for urgency, sentiment strength, and likely market impact. When the news is strongly positive but the price has not reacted yet, that gap is your signal. Oyamori detects it automatically and acts on your behalf — before the crowd catches up.

Is my capital safe with Oyamori?

Your money never moves to Oyamori. You connect your own brokerage account via a read-and-trade API key — Oyamori only routes signals, it never has access to withdraw or transfer your funds. Your capital stays in your account, under your name, at all times. Revoke access in seconds from your Alpaca dashboard whenever you need to.

How is Oyamori different from a hedge fund or robo-advisor?

A hedge fund takes your money and makes all the decisions — you hand over control and hope for the best. A robo-advisor puts you in a generic portfolio that looks the same as everyone else's. Oyamori is different: your capital stays in your own account, you choose the strategy, you set the risk limits. The AI does the heavy lifting — monitoring signals, enforcing your rules, executing trades — but every decision traces back to what you set up. You stay in control. The platform amplifies it.

Backtesting a Trading Strategy: What the Results Actually Mean

Backtesting a trading strategy is the process of applying your rules to historical price data and measuring what would have happened. It answers one question: did this logic produce positive results in the past? Before you commit real capital, a backtest is the minimum due diligence a systematic trader must do. But the number that comes back from that process is frequently misread — and that misreading is expensive.

This article explains what backtesting actually tests, which metrics to trust, and how to avoid the trap that kills most strategy development work: overfitting.

What Backtesting Actually Tests

A backtest does not predict future performance. It measures whether your rules, applied to a specific historical dataset, would have generated profit after transaction costs. That's a narrower claim than most traders think.

The result is conditional on:

The date range you chose
The instruments you included
The transaction cost assumptions you made
The number of times you modified the rules to make the results look better

That last point is where most backtests fail in practice. Every time you adjust a parameter to improve the historical result, you are fitting your model to noise that will not repeat. The backtest looks better; the live performance does not follow.

Key Metrics That Matter

When you run a backtest, you receive a result table. Here is what each number actually measures, and which ones deserve the most weight.

1.42

Sharpe Ratio

-18.3%

Max Drawdown

54%

Win Rate

1.68

Profit Factor

312 trades

Sample Size

+$87

Expectancy per Trade

Sharpe Ratio — return relative to volatility. The higher, the better the risk-adjusted performance. A Sharpe above 1.0 is considered acceptable for systematic strategies; above 1.5 is strong; above 2.0 is exceptional and worth verifying carefully for data-fitting.

The Sharpe ratio formula applied to a strategy's trade returns:

S = \frac{\bar{R} - R_f}{\sigma_R}

Where $\bar{R}$ is the mean return per period, $R_f$ is the risk-free rate, and $\sigma_R$ is the standard deviation of returns. For a strategy backtest without a risk-free component, the simplified form uses zero as the baseline — you are simply dividing average return by return volatility.

Max Drawdown is the largest peak-to-trough decline in equity. A 30% drawdown means at some point you would have been down 30% from a prior high. Most traders cannot hold through that psychologically — even if the system eventually recovered. Max drawdown sets your position sizing ceiling. (See the Kelly Criterion guide for how drawdown feeds into sizing decisions.)

Win Rate alone means nothing. A strategy with 35% win rate and large winners versus small losers can dramatically outperform one with 65% win rate and a poor reward-to-risk ratio.

Profit Factor divides total gross wins by total gross losses. A value above 1.0 means the strategy made money historically. Above 1.5 is a reasonable threshold for systematic trading.

Expectancy is average profit per trade across all trades, both winners and losers. This is the number that matters most for live trading — it tells you what to expect on a per-trade basis.

Sample size matters more than any single metric. Under 100 trades, the statistical noise dominates. Under 50 trades, any result is nearly meaningless.

The In-Sample / Out-of-Sample Problem

Every backtest has a dataset. The most common mistake is to develop and validate on the same dataset. This is the in-sample problem.

	In-Sample
Description	Data used during strategy development \| Data never seen during development \| Rolling windows — train on earlier period, test on next period
Purpose	Fit rules to historical patterns \| Validate that rules generalize \| Simulate realistic deployment conditions
Risk	High — you can overfit unknowingly \| Low — results show true generalizability \| Low — closest simulation of live trading
When to Use	Initial exploration only \| After rules are fully locked \| Before going live

The discipline is to partition your data before you start. Reserve 30–40% as a holdout set that you do not look at during development. Only after you have finalized every parameter do you run a single test on the holdout. That result is your honest performance estimate.

If your out-of-sample result is dramatically worse than in-sample, the strategy is overfit.

Overfitting: How It Happens and How to Detect It

🚨 DANGER

Overfitting occurs when you have too many parameters relative to your sample size. A strategy with 6 parameters optimized on 80 trades has effectively memorized noise. It will fail live.

Overfitting is not always deliberate. It can happen through innocent parameter searching. You test 20 combinations of moving average periods, pick the best one, and report that result. But you have implicitly used all 20 combinations — you have just hidden the search.

Signs your backtest may be overfit:

Sharpe Ratio above 3.0 on the in-sample period but below 0.5 out-of-sample
Win rate above 70% on small samples (under 150 trades)
Strategy only works in a narrow date range
Performance drops sharply when transaction costs increase slightly
Rules have many conditionals that combine in unusual ways

The practical test: reduce the number of parameters. A robust strategy should work across a range of parameter values, not just the single optimized setting. If changing one moving average period from 20 to 22 destroys the results, the edge is in the parameter, not the logic.

Walk-Forward Testing

Walk-forward testing is the most rigorous pre-live validation method available without live trading. It works as follows:

flowchart LR A[Build Strategy Logic] --> B[Backtest In-Sample Window] B --> C[Validate Out-of-Sample Window] C --> D{Results Hold?} D -- No --> A D -- Yes --> E[Paper Trade 30-60 Days] E --> F[Live Deployment — Small Size] F --> G[Scale Up After 100 Live Trades]

In each walk-forward step, you train on a fixed lookback window, test on the next period, then advance both windows forward in time. You repeat this 5 to 10 times across the full dataset. If the strategy is robust, the out-of-sample results should be consistently positive — not perfect, but directionally aligned with in-sample results. Consistent degradation across windows indicates overfitting.

Minimum Sample Size

💡 TIP

Minimum threshold for statistical significance in a trading backtest is 200 trades. Below this level, the performance numbers carry too much uncertainty to draw conclusions. This is not about intuition — it is about the mathematics of sample statistics.

At 200 trades, a measured Sharpe Ratio of 1.0 still has a 95% confidence interval that includes values below zero. The standard error of a Sharpe estimate decreases with the square root of sample size. You need more trades to trust smaller Sharpe values.

Practical guidance:

Under 100 trades: Do not evaluate the strategy. Generate more history or more instruments.
100–200 trades: Results directional only. Proceed with extreme caution.
200–500 trades: Minimum viable. Begin out-of-sample validation.
500+ trades: Reliable enough to estimate real-world performance within a reasonable confidence band.

The Limits of Any Backtest

⚠️ WARNING

Even a well-designed backtest with strong out-of-sample results does not guarantee live performance. Market regimes shift. Liquidity conditions change. The edge you measured may exist only in the historical period you tested.

A backtest cannot account for:

Execution slippage in real order books
Position impact on thin instruments
Data errors and survivorship bias (backtesting only on stocks that still exist today)
Regime changes where your edge disappears

This is not a reason to skip backtesting — it is a reason to treat backtest results as a lower bound on what can go wrong, not an upper bound on what you will earn.

Frequently Asked Questions

What is a good Sharpe ratio for a trading strategy?

For systematic strategies, a Sharpe ratio above 1.0 is acceptable, above 1.5 is good, and above 2.0 is excellent. Values above 3.0 in-sample should trigger suspicion of overfitting rather than excitement. The important number is the out-of-sample Sharpe, not the in-sample one.

How do I avoid overfitting in backtesting?

Lock your rules before you look at the out-of-sample data. Limit parameter count — aim for fewer than 5 adjustable inputs per strategy. Test robustness by checking whether the strategy still works if you shift parameters by 10–20%. Use walk-forward testing rather than a single train/test split. The more times you touch the rules, the more your out-of-sample set is contaminated.

What sample size is enough for a reliable backtest?

200 trades is the minimum threshold for drawing any conclusions. At that level, a Sharpe of 1.5 is still uncertain enough that you should paper trade before committing capital. 500 trades gives you meaningful confidence. If your strategy trades infrequently — monthly or weekly signals — you may need 10 to 20 years of data to accumulate sufficient sample size, which introduces its own regime-change risks.

Key Takeaway

A backtest proves your rules worked on a specific dataset in the past — nothing more. To turn that into a deployable edge, you need an honest out-of-sample test, at least 200 trades in your sample, a walk-forward validation, and the discipline to not retrofit the rules after seeing the results. The Sharpe ratio and max drawdown matter most; win rate alone tells you almost nothing.

What Backtesting Actually Tests

Key Metrics That Matter

The In-Sample / Out-of-Sample Problem

Overfitting: How It Happens and How to Detect It

Walk-Forward Testing

Minimum Sample Size

The Limits of Any Backtest

Entry Confirmation Checklist: Why Waiting Beats the Perfect Price

Smart Money in Trading: Who the Big Money Really Is

ICT vs SMC Terminology: The Complete Cross-Reference Lexicon