Backtesting Swing Trading Strategies: A Step-by-Step Guide

This is an amateur website and It’s not a professional publication. Pages are written on an occasional basis and are free to read. Contents herein do not predict economic scenarios or financial outcomes and to the best knowledge of the author they represent the current consensus in technical and academic research and are presented for educational purpose only and under any circumstance they are not financial advice or solicitation to trade. Pages contain paid links. The whole content of this website is not intended for residents of Chile, Andorra, Italy, Spain, France, Germany, Turkey, Greenland or any individual under legal age.

Backtesting Swing Trading Strategies: A Step-by-Step Guide

1. Define Your Strategy with Precision

Before writing a single line of code or opening a spreadsheet, you must codify your swing trading strategy into a set of unambiguous, binary rules. Ambiguity is the silent killer of backtesting. A “buy when the stock looks strong” is not a rule; it is a feeling. Your rules must be specific enough that two different traders, given the same data, would execute the exact same trades.

Key components to define:

Entry Conditions: Specify the exact indicators, patterns, and price levels. For example: “Buy when the 10-day Exponential Moving Average (EMA) crosses above the 30-day EMA, AND the Relative Strength Index (RSI) is below 30 (oversold), AND the volume is at least 1.5 times the 20-day average volume.”
Exit Conditions: Define profit targets (e.g., “Sell 50% at a 2% gain, trail stop on the remainder at the 20-day EMA”), stop-losses (e.g., “Hard stop at 1.5x Average True Range (ATR) below entry”), and time-based exits (e.g., “Exit all positions after 10 trading days regardless of price”).
Position Sizing: Determine the exact percentage of capital risked per trade (e.g., “Risk 1% of total account equity per trade”).
Universe Selection: Specify which stocks or assets you will trade (e.g., “Only S&P 500 constituents with an average daily volume above 1 million shares”). Filter out illiquid, penny, or volatile garbage stocks to avoid unrealistic fills.

2. Gather High-Quality Data

Your backtest is only as good as the data you feed it. Using adjusted, survivorship-bias-free data is non-negotiable. Survivorship bias—where data only includes stocks that still exist today—massively inflates past performance because it ignores the many stocks that went bankrupt or were delisted.

Data essentials:

OHLCV Data: Open, High, Low, Close, and Volume for every trading day of your testing period.
Dividend & Split Adjustments: Raw prices are useless; use adjusted close prices to account for stock splits and dividends, ensuring your backtest reflects total return.
Time Period: At minimum, test over 5–10 years to capture multiple market regimes (bull, bear, sideways, high volatility, low volatility). For example, test from 2014 to 2024 to include the 2015 flash crash, the COVID-19 crash, and the 2022 rising-rate drawdown.
Data Sources: Reliable providers include EOD Historical Data, Polygon.io, or your brokerage’s API (e.g., TD Ameritrade, Interactive Brokers). Avoid free Yahoo Finance for professional work; data gaps and adjustments are common.

3. Choose Your Backtesting Platform

You have three primary paths, each with trade-offs:

Excel/Google Sheets: Best for beginners and extremely simple strategies (e.g., single moving average crossover). Pros: Total control and transparency. Cons: Slow, crashes with large datasets, no automated optimization, no advanced features like multi-asset portfolios.
Programming (Python/R): The industry standard for serious traders. Use Python libraries like backtrader, zipline, or vectorbt. Pros: Unlimited flexibility, fast execution, integrated portfolio analytics, walk-forward optimization, Monte Carlo simulation. Cons: Steep learning curve; you need coding skills.
Dedicated Software (TradeStation, NinjaTrader, MultiCharts): Built for backtesting with point-and-click strategy builders. Pros: Integrated brokerage execution, real-time data, extensive indicator libraries. Cons: Subscription costs, less flexible than code, proprietary scripting languages.

Recommended data structure in Python:

import pandas as pd
import backtrader as bt

data = bt.feeds.PandasData(dataname=df)
cerebro = bt.Cerebro()
cerebro.adddata(data)

4. Code or Drag-and-Drop Your Strategy

Translate your written rules into logic. If using Python with backtrader, a simple strategy class might look like:

class SwingBreakout(bt.Strategy):
    params = (('period', 20), ('atr_mult', 1.5),)

    def next(self):
        if not self.position:
            if self.data.close[0] > self.data.high[-self.params.period:].max():
                self.buy(exectype=bt.Order.Limit,
                         price=self.data.close[0])
                self.stop_price = self.data.close[0] - (self.params.atr_mult * bt.ind.ATR(self.data, period=14)[0])
        else:
            if self.data.close[0] < self.stop_price:
                self.close()

Key logic elements to include in your code:

No Look-Ahead Bias: Ensure every calculation uses only data available at the time of the decision. Never use self.data.close[0] to trigger a buy when the close is unknown until the session ends. Use self.data.close[-1] for previous close or open-only triggers.
Slippage & Commission: Add a realistic 0.1%–0.3% slippage (depending on liquidity) and brokerage commissions. Many platforms allow you to set a fixed percentage per trade.
Position Sizing Logic: Code your cash management—e.g., self.sizer = bt.sizers.PercentSizer(percents=2) to risk 2% of capital.

5. Run the Initial Backtest

Execute the backtest over your full historical period. This first run is not your final result; it is your baseline. Focus on generating the following metrics:

Total Return: Obvious, but compare to buy-and-hold (S&P 500) over the same period.
CAGR (Compound Annual Growth Rate): The annualized growth rate.
Maximum Drawdown: The largest peak-to-trough decline. For a swing strategy, a drawdown above 30% is generally unacceptable for a systematic approach.
Sharpe Ratio: Risk-adjusted return. Above 1.0 is decent; above 2.0 is excellent. Use daily returns for calculation.
Win Rate: Percentage of profitable trades. Swing strategies often have lower win rates (40-50%) but higher reward-to-risk ratios.
Profit Factor: Gross profit divided by gross loss. Above 1.5 is good; above 2.0 is strong.
Average Trade Duration: Important for swing trading. Ideally 3–15 days.

Example output from a 10-year test of a 20-day breakout strategy:

Total Return: 340%
CAGR: 12.8%
Max Drawdown: -22.4%
Sharpe Ratio: 1.21
Win Rate: 43.2%
Profit Factor: 1.85
Avg Trade Duration: 7.3 days

6. Analyze the Equity Curve and Trade Log

Numbers lie; the equity curve rarely does. Plot the cumulative equity of your strategy over time. Look for:

Long periods of flat or declining equity (drawdown). Are there specific market regimes where the strategy fails? For example, does it crash during low-volatility trends?
Clusters of losses. Do losses come in waves? This indicates the strategy is not robust across different market conditions.
Runs of large winners skewing results. One trade accounting for 20%+ of total profit is a red flag; the strategy may be overly dependent on a single outlier event.

Scrutinize the trade log. Check for unrealistic entries and exits. Did the strategy “buy” at the exact low of a candlestick? In reality, you’d likely get a fill a few cents higher. Did it exit at the exact high? Impossible. Adjust your code to model fills at the open price of the next bar (OCO) to simulate real-world execution.

7. Walk-Forward Optimization (WFO)

Standard optimization (testing many parameter combinations on the same data) overfits instantly. WFO mimics real-time trading by optimizing only on past data and testing on unseen future data.

How to implement WFO for swing trading:

In-sample (IS) window: Choose 2–3 years of data to optimize parameters (e.g., 20-day lookback, 1.5x ATR stop).
Out-of-sample (OOS) window: The next 6–12 months of data. Run the strategy using only the optimized parameters from the IS window.
Roll forward: Shift the IS window forward by the OOS period, re-optimize, test again. Repeat for the entire dataset.

Minimum acceptable WFO results: The OOS performance should be at least 70% of IS performance in terms of Sharpe ratio and CAGR. If your OOS drawdown is double your IS drawdown, the strategy is fitted to noise, not signal.

8. Sensitivity Analysis and Monte Carlo Simulation

Even a robust strategy has random variance. Monte Carlo simulation reshuffles the sequence of your trade returns (or randomly resamples them) to show the range of possible outcomes.

Steps:

Take your list of actual trade returns (e.g., +1.2%, -0.5%, +3.1%, etc.).
Randomly select returns (with replacement) to simulate 1,000 hypothetical trade sequences.
Calculate CAGR and max drawdown for each simulation.
Examine the distribution. If 10% of simulations show a max drawdown exceeding 40%, your strategy is too risky for a typical swing trading account. If 50% of simulations show a negative CAGR, your edge is weak.

Tools: numpy, scipy, or platforms like TradingView’s Strategy Tester (basic). For Python, use pandas to resample with replacement.

9. Reality Check with Slippage and Liquidity Constraints

Swing trading strategies often target mid-cap or large-cap stocks, but even those can have slippage during fast moves. Adjust your backtest to:

Apply a fixed slippage per trade: Use 0.05% * 2 (entry + exit) for liquid large caps (e.g., Apple, Microsoft). Use 0.2% * 2 for mid-caps or high-volatility names.
Model partial fills: If your strategy buys 1,000 shares but volume at that moment was only 500, your actual fill will be worse. Use volume-weighted average price (VWAP) or limit-order modeling in advanced platforms.
Beware of “ghost liquidity”: In backtests, the strategy often assumes it can trade at any price with infinite liquidity. In reality, a 1,000-share order in a stock with 200,000 shares/day will have significant slippage. Limit your trade size relative to volume (e.g., never trade more than 5% of 20-day average volume).

10. Overfitting Detection: The “Out-of-Backtest” Test

The final and most rigorous step is to test on data that you have never seen—literally unobserved by you.

How to do it:

Split your historical data into three chunks: training (60%), validation (20%), and testing (20%).
Use the training and validation sets for WFO and optimization.
Hold back the final 20% (e.g., most recent 2 years) entirely. Do not look at it, optimize on it, or adjust your strategy based on it.
Run the finished, untouched strategy on this final 20% exactly once.
If the performance metrics (especially Sharpe ratio, profit factor, and max drawdown) are similar to the validation set, your strategy passes. If it fails badly (e.g., Sharpe drops from 1.5 to 0.3), the strategy is overfit to noise and will likely fail in live trading.

Bonus tip: Record your exact parameter set before running this final test. If you change even one number after seeing the result, you invalidate the test.

11. Account for Portfolio Realities (Multiple Positions)

Most swing traders run multiple positions simultaneously. A single-stock backtest ignores correlation effects.

Portfolio-level backtesting adjustments:

Set a maximum number of concurrent positions (e.g., 5–10).
Rank signals by strength (e.g., highest breakout percentage) or randomize to avoid bias.
Apply a capital allocation rule: equal weight, risk-parity, or position size based on ATR.
Run the portfolio backtest: Compare the portfolio equity curve to your single-stock curve. You will likely see lower maximum drawdown but also lower CAGR due to dilution.

12. Documentation and Audit Trail

Treat your backtest as a scientific experiment. Maintain a detailed log:

Date of test
Dataset used (source, time period, universes)
Exact parameter values (e.g., lookback=20, ATR_mult=1.5, stop_type=trailing)
Slippage and commission assumptions
Software version (e.g., backtrader 1.9.76, Python 3.9)
Results (all metrics from step 5)
Observations (e.g., “Strategy failed during low-VIX periods below 12”)

Without documentation, you will repeat mistakes and never know why a strategy worked (or didn’t).

13. Emotional and Psychological Variance

Backtests do not capture the emotional agony of a 15% drawdown or the boredom of a flat year. A strategy that looks amazing on paper can be abandoned in practice.

Analyze your own psychological tolerance:

If your backtest shows a max drawdown of 20%, can you realistically execute through a 15% drawdown? Most retail traders chase losses and deviate from the plan.
Does the strategy produce months with zero trades? Swing traders often feel compelled to trade “something” even when conditions are poor.
Simulate the equity curve with a 2x weekly check-in. Would you have stayed the course?

14. The Final Validation: Paper Trade

Only after passing all statistical and robustness tests should you risk real capital. Run the strategy in a paper trading account (using real-time data) for at least 3–6 months. This tests:

Execution fidelity: Does the brokerage platform execute fills as your backtest assumed?
Data latency: Are you seeing signals early enough to react?
Psychological adherence: Do you close a losing trade when the backtest says to, even when it hurts?

Compare paper trade P&L to backtest P&L. A 20–30% degradation due to slippage, commissions, and human error is normal. More than 50% degradation means your backtest assumptions were too optimistic.

15. Iterate, Don’t Expect Perfection

The best swing trading strategies are not perfect; they are robust. After backtesting, you will likely refine entry/exit conditions, adjust position sizing, or add a market regime filter (e.g., “Only trade when VIX is above 15 and below 30”). Document each iteration and its rationale. A strategy that has survived dozens of rigorous backtests, walk-forward analyses, and Monte Carlo simulations is far more likely to survive the market’s next ambush.

Common pitfall: Parameter curve fitting. If you change your stop-loss from 1.5 ATR to 1.45 ATR and the Sharpe ratio jumps from 1.2 to 1.6, you are overfitting. Robust parameters show stable performance across a range of values (e.g., 1.3–1.7 ATR all produce Sharpe above 1.0).

Backtesting Swing Trading Strategies: A Step-by-Step Guide

Backtesting Swing Trading Strategies: A Step-by-Step Guide

Swing Trading Chart Patterns: Head and Shoulders, Flags, and More

Day Trading Indicators: Moving Averages, RSI, and VWAP

Best Trend Following Books to Master the Strategy

Backtesting Swing Trading Strategies: A Step-by-Step Guide

Backtesting Swing Trading Strategies: A Step-by-Step Guide

Swing Trading Chart Patterns: Head and Shoulders, Flags, and More

Day Trading Indicators: Moving Averages, RSI, and VWAP

Best Trend Following Books to Master the Strategy

Discover more from DNS Research