How to Test Market Efficiency: Strategies That Actually Work

Quick Summary
Learn how to rigorously test market efficiency using event studies, portfolio analysis, and investor group studies — with real data and risk-adjusted returns.
In This Article
Why Market Efficiency Testing Is the Foundation of Active Investing
Every active investor — whether running a $10 million equity portfolio or managing a $50 billion hedge fund — is making one core bet: that markets are inefficient enough to exploit. But making that bet stick requires more than a hunch. Testing market efficiency is a disciplined, quantitative process, and most retail investors get it wrong before they even start.
The framework for testing market efficiency breaks down into three distinct methodologies: event studies, portfolio studies, and investor group studies. Each answers a different question. Together, they form the empirical backbone of any serious case for — or against — active investing. Understanding how these tests work, where they break down, and what they actually reveal about markets is essential knowledge for any ambitious investor or finance professional.
The Joint Hypothesis Problem Every Investor Ignores
Before diving into methodology, there is a foundational problem that undermines almost every market efficiency test ever conducted: the joint hypothesis problem.
When you test whether a strategy beats the market, you are simultaneously testing two things:
- Whether the market is truly mispricing assets
- Whether the model you are using to calculate expected returns is correct
If your test shows a strategy earns excess returns, you cannot definitively say markets are inefficient. It is equally possible your risk model is wrong. This is not a theoretical nuance — it has real consequences. Dozens of apparent market anomalies have later been attributed to flawed risk models rather than genuine pricing inefficiencies.
This is why the choice of benchmark matters enormously. Comparing raw returns to the S&P 500 is the most common approach, but it is also the most flawed. A strategy that returns 18% annually when the S&P 500 returns 12% looks brilliant — until you discover it carries three times the systematic risk. Risk-adjusted measures tell a very different story.
The main risk-adjusted benchmarks used in market efficiency testing include:
- Sharpe Ratio: Return divided by standard deviation of returns, compared against a passive benchmark
- Jensen's Alpha: Actual return minus the expected return derived from CAPM (risk-free rate + beta × equity risk premium)
- Treynor Index: Excess return over the risk-free rate, divided by beta
- Information Ratio: Excess returns relative to tracking error against an index
Each of these forces you to ask: given what I risked, did I actually earn more than I should have? That is the only question worth answering.
Event Studies: Measuring How Markets React to New Information
An event study tests whether a specific piece of information — an earnings announcement, a Fed rate decision, an M&A filing — triggers a price reaction that investors could have exploited for profit.
The methodology follows a precise sequence:
- Define the exact event — not broadly, but precisely. If you are studying the effect of options listings on stock prices, the relevant date is the announcement that options will be listed, not the date they actually begin trading. That distinction alone can make or break your results.
- Collect returns around the event window — typically a range of days before and after. A common window is 10 trading days before to 10 trading days after (21 days total), though this depends on the event.
- Adjust for market movements and stock-specific risk — using beta to calculate expected returns for each day in the window. Subtract expected returns from actual returns to isolate the event's impact.
- Average across all companies in the sample — and compute standard errors to assess statistical significance.
Two types of significance matter: statistical significance (is the result unlikely to be random?) and economic significance (is the effect large enough to profit from after transaction costs?).
Consider an older but instructive study on options listing announcements. When options begin trading on a stock, theory suggests it should improve price discovery and potentially attract new investor types. Across a sample of 100 stocks, the average excess return on the announcement day was negligible — and across the full 21-day window, the cumulative effect, while slightly positive, barely cleared statistical significance thresholds. The T-statistic on any individual day hovered around 1.3 to 1.9, well below the 2.0 threshold typically required for 95% confidence.
The takeaway is instructive: even when a result appears directionally correct, if it lacks both statistical and economic significance, it is not a tradeable edge. Later event studies — particularly around earnings announcements and M&A disclosures — show far stronger signals, but the methodology remains identical.
Portfolio Studies: Testing Whether Stock Characteristics Predict Returns
Portfolio studies ask a different question: can you consistently beat the market by selecting stocks that share a specific measurable characteristic?
Low price-to-earnings (P/E) ratios are among the most studied. The intuition is that cheap stocks — those trading at low multiples of earnings — may be systematically undervalued. But intuition is not evidence. A proper portfolio study requires:
- Selecting and defining the variable — P/E ratio, institutional ownership percentage, founder-CEO status, or any other quantifiable characteristic
- Ranking and sorting all eligible stocks — typically into five or ten groups (quintiles or deciles) from lowest to highest values
- Measuring returns across each portfolio over a forward-looking period — not the same period in which the hypothesis was developed. This holdout period requirement is critical and frequently violated.
- Adjusting for risk — using pre-period betas to calculate expected returns for each portfolio, then computing excess returns
- Accounting for real-world complications — delistings, bankruptcies, and acquisitions cannot be excluded from the sample. A stock that goes bankrupt delivers a -100% return and must stay in the data.
A study of New York Stock Exchange stocks from 1987 illustrates the process. Starting with roughly 1,500 companies with positive earnings (negative-earnings firms drop out, introducing a survivorship bias worth noting), stocks were divided into five P/E quintiles of approximately 300 stocks each. Over the subsequent five years, the lowest P/E quintile outperformed the highest by approximately 4.56% per year on a risk-adjusted basis. With that magnitude, the result is almost certainly statistically significant — though the practical value depends entirely on your transaction costs and tax situation.
One methodological refinement worth understanding: sorting stocks into discrete buckets loses information. Running a continuous regression of excess returns against P/E ratios — or against multiple variables simultaneously — preserves all variation in the data and allows you to control for confounding factors. Want low P/E stocks that also have high growth and low debt? A multivariate regression handles that in a single model. The caveat is that standard regressions assume linear relationships, which often do not hold in financial data. Careful specification and diagnostic testing are required before trusting the output.
Investor Group Studies: Do Hedge Funds and Analysts Actually Beat the Market?
The third form of market efficiency testing shifts focus from strategies to people. Investor group studies ask: do professional investors — hedge funds, mutual fund managers, equity analysts — systematically earn returns that exceed what a passive index investor would have made?
The methodology mirrors the earlier approaches but adds a critical practical concern: survivorship bias.
If you study only hedge funds that exist today and track their historical performance backwards, your sample is severely distorted. Funds that failed, were wound down, or were merged out of existence between 2010 and today are invisible in your data — but they would have been live investments at the time. A realistic test must include the full population of funds operating during the study period, including those that no longer exist.
Beyond survivorship bias, investor group studies must grapple with:
- Backfill bias — many databases only add a fund's historical returns when it joins, typically after a strong performance run
- Fee structures — reported gross returns look very different from net-of-fee returns, which is what investors actually receive
- Risk measurement difficulty — hedge funds often use leverage, derivatives, and illiquid assets that standard beta calculations do not capture well
The academic literature on mutual fund performance is fairly consistent: after fees, the average actively managed mutual fund underperforms its benchmark index over long periods. The evidence on hedge funds is more mixed, partly because of data quality issues and partly because the top-performing funds are often closed to new investors by the time the evidence is clear.
The Practical Limits of Market Efficiency Research
Even a methodologically perfect study faces constraints that limit its real-world utility:
Data mining risk: With enough variables and time periods, patterns will appear by chance. The academic literature is littered with anomalies that disappeared as soon as they became widely known — the act of publication essentially arbitraged them away.
Capacity constraints: A strategy that earns 4% in excess returns with a $5 million portfolio may generate negligible alpha when scaled to $500 million. Market impact erodes edge rapidly.
Free Weekly Newsletter
Enjoying this guide?
Get the best articles like this one delivered to your inbox every week. No spam.
Transaction costs and taxes: Academic studies often ignore or underestimate these. A strategy requiring monthly rebalancing of small-cap stocks looks very different before and after realistic friction costs.
Regime dependency: Many documented anomalies — value, momentum, small-cap premium — show strong period-dependency. What worked from 1963 to 1990 may not work from 1990 to 2025, and vice versa.
The disciplined response is not to abandon quantitative testing, but to hold the bar high: require large samples, long time horizons, out-of-sample validation, and economic plausibility before concluding that any strategy represents a genuine market inefficiency.
What Market Efficiency Testing Actually Tells Investors
The practical value of understanding how market efficiency is tested extends well beyond academic curiosity. For the ambitious investor or finance professional, it provides a critical filter:
- When someone claims a strategy beats the market, ask which risk-adjustment model they used and whether the excess returns survive after fees and transaction costs
- When reading fund marketing materials, ask whether the track record includes the full sample of comparable funds or just survivors
- When a back-test shows stunning results, ask whether the strategy parameters were chosen from the same data period being tested
- When a variable appears to predict returns, ask whether that relationship holds out-of-sample and across different market regimes
The S&P 500 returned approximately 10.7% per year on average over the past 50 years. Most active managers did not match it after fees. That is not because outperformance is impossible — it is because most tests of market-beating strategies fail under rigorous scrutiny. The ones that survive tend to involve either genuine information advantages, behavioural edges that markets are slow to arbitrage away, or risk exposures that most investors are unwilling to hold through the inevitable drawdowns.
Understanding market efficiency testing will not guarantee you find an edge. But it will stop you from fooling yourself into thinking you have one when you do not.
This article is for informational purposes only and does not constitute financial advice. Always consult a qualified financial professional before making investment decisions.
Frequently Asked Questions
What is the joint hypothesis problem in market efficiency testing?
The joint hypothesis problem means that any test of market efficiency simultaneously tests two things: whether the market is mispricing assets, and whether the risk model used to calculate expected returns is correct. If a strategy appears to earn excess returns, it could mean markets are inefficient — or it could mean the expected return model is wrong. This makes it impossible to definitively prove or disprove market efficiency from any single test.
What is the difference between statistical significance and economic significance in an event study?
Statistical significance tells you whether a result is unlikely to be due to random chance — typically measured by a T-statistic above 2.0 for 95% confidence. Economic significance tells you whether the effect is large enough to actually profit from after accounting for transaction costs, taxes, and execution friction. A result can be statistically significant but economically meaningless if the excess return is, say, 0.3% annually before costs.
Why is survivorship bias a major problem in hedge fund and mutual fund performance studies?
Survivorship bias occurs when performance databases only include funds that are still operating today, excluding funds that closed, failed, or merged. Since weaker-performing funds are disproportionately likely to close, any study that looks only at surviving funds will overstate the average performance of the group. A realistic assessment must include the full universe of funds that existed during the study period, regardless of whether they survived to the end.
Can low P/E stock strategies consistently beat the market?
Historical data — including studies covering U.S. equities from the late 1980s onward — suggests that low P/E portfolios have outperformed high P/E portfolios by approximately 3–5% per year on a risk-adjusted basis over long periods. However, this edge is not guaranteed to persist, varies significantly by time period and market regime, and requires careful accounting for transaction costs, rebalancing frequency, and the exclusion of negative-earnings stocks from the universe. Most researchers treat it as a persistent but cyclical anomaly rather than a guaranteed return premium.
Frequently Asked Questions
Why Market Efficiency Testing Is the Foundation of Active Investing
Every active investor — whether running a $10 million equity portfolio or managing a $50 billion hedge fund — is making one core bet: that markets are inefficient enough to exploit. But making that bet stick requires more than a hunch. Testing market efficiency is a disciplined, quantitative process, and most retail investors get it wrong before they even start.
The framework for testing market efficiency breaks down into three distinct methodologies: event studies, portfolio studies, and investor group studies. Each answers a different question. Together, they form the empirical backbone of any serious case for — or against — active investing. Understanding how these tests work, where they break down, and what they actually reveal about markets is essential knowledge for any ambitious investor or finance professional.
The Joint Hypothesis Problem Every Investor Ignores
Before diving into methodology, there is a foundational problem that undermines almost every market efficiency test ever conducted: the joint hypothesis problem.
When you test whether a strategy beats the market, you are simultaneously testing two things:
- Whether the market is truly mispricing assets
- Whether the model you are using to calculate expected returns is correct
If your test shows a strategy earns excess returns, you cannot definitively say markets are inefficient. It is equally possible your risk model is wrong. This is not a theoretical nuance — it has real consequences. Dozens of apparent market anomalies have later been attributed to flawed risk models rather than genuine pricing inefficiencies.
This is why the choice of benchmark matters enormously. Comparing raw returns to the S&P 500 is the most common approach, but it is also the most flawed. A strategy that returns 18% annually when the S&P 500 returns 12% looks brilliant — until you discover it carries three times the systematic risk. Risk-adjusted measures tell a very different story.
The main risk-adjusted benchmarks used in market efficiency testing include:
- Sharpe Ratio: Return divided by standard deviation of returns, compared against a passive benchmark
- Jensen's Alpha: Actual return minus the expected return derived from CAPM (risk-free rate + beta × equity risk premium)
- Treynor Index: Excess return over the risk-free rate, divided by beta
- Information Ratio: Excess returns relative to tracking error against an index
Each of these forces you to ask: given what I risked, did I actually earn more than I should have? That is the only question worth answering.
Event Studies: Measuring How Markets React to New Information
An event study tests whether a specific piece of information — an earnings announcement, a Fed rate decision, an M&A filing — triggers a price reaction that investors could have exploited for profit.
The methodology follows a precise sequence:
- Define the exact event — not broadly, but precisely. If you are studying the effect of options listings on stock prices, the relevant date is the announcement that options will be listed, not the date they actually begin trading. That distinction alone can make or break your results.
- Collect returns around the event window — typically a range of days before and after. A common window is 10 trading days before to 10 trading days after (21 days total), though this depends on the event.
- Adjust for market movements and stock-specific risk — using beta to calculate expected returns for each day in the window. Subtract expected returns from actual returns to isolate the event's impact.
- Average across all companies in the sample — and compute standard errors to assess statistical significance.
Two types of significance matter: statistical significance (is the result unlikely to be random?) and economic significance (is the effect large enough to profit from after transaction costs?).
Consider an older but instructive study on options listing announcements. When options begin trading on a stock, theory suggests it should improve price discovery and potentially attract new investor types. Across a sample of 100 stocks, the average excess return on the announcement day was negligible — and across the full 21-day window, the cumulative effect, while slightly positive, barely cleared statistical significance thresholds. The T-statistic on any individual day hovered around 1.3 to 1.9, well below the 2.0 threshold typically required for 95% confidence.
The takeaway is instructive: even when a result appears directionally correct, if it lacks both statistical and economic significance, it is not a tradeable edge. Later event studies — particularly around earnings announcements and M&A disclosures — show far stronger signals, but the methodology remains identical.
Portfolio Studies: Testing Whether Stock Characteristics Predict Returns
Portfolio studies ask a different question: can you consistently beat the market by selecting stocks that share a specific measurable characteristic?
Low price-to-earnings (P/E) ratios are among the most studied. The intuition is that cheap stocks — those trading at low multiples of earnings — may be systematically undervalued. But intuition is not evidence. A proper portfolio study requires:
- Selecting and defining the variable — P/E ratio, institutional ownership percentage, founder-CEO status, or any other quantifiable characteristic
- Ranking and sorting all eligible stocks — typically into five or ten groups (quintiles or deciles) from lowest to highest values
- Measuring returns across each portfolio over a forward-looking period — not the same period in which the hypothesis was developed. This holdout period requirement is critical and frequently violated.
- Adjusting for risk — using pre-period betas to calculate expected returns for each portfolio, then computing excess returns
- Accounting for real-world complications — delistings, bankruptcies, and acquisitions cannot be excluded from the sample. A stock that goes bankrupt delivers a -100% return and must stay in the data.
A study of New York Stock Exchange stocks from 1987 illustrates the process. Starting with roughly 1,500 companies with positive earnings (negative-earnings firms drop out, introducing a survivorship bias worth noting), stocks were divided into five P/E quintiles of approximately 300 stocks each. Over the subsequent five years, the lowest P/E quintile outperformed the highest by approximately 4.56% per year on a risk-adjusted basis. With that magnitude, the result is almost certainly statistically significant — though the practical value depends entirely on your transaction costs and tax situation.
One methodological refinement worth understanding: sorting stocks into discrete buckets loses information. Running a continuous regression of excess returns against P/E ratios — or against multiple variables simultaneously — preserves all variation in the data and allows you to control for confounding factors. Want low P/E stocks that also have high growth and low debt? A multivariate regression handles that in a single model. The caveat is that standard regressions assume linear relationships, which often do not hold in financial data. Careful specification and diagnostic testing are required before trusting the output.
Investor Group Studies: Do Hedge Funds and Analysts Actually Beat the Market?
The third form of market efficiency testing shifts focus from strategies to people. Investor group studies ask: do professional investors — hedge funds, mutual fund managers, equity analysts — systematically earn returns that exceed what a passive index investor would have made?
The methodology mirrors the earlier approaches but adds a critical practical concern: survivorship bias.
If you study only hedge funds that exist today and track their historical performance backwards, your sample is severely distorted. Funds that failed, were wound down, or were merged out of existence between 2010 and today are invisible in your data — but they would have been live investments at the time. A realistic test must include the full population of funds operating during the study period, including those that no longer exist.
Beyond survivorship bias, investor group studies must grapple with:
- Backfill bias — many databases only add a fund's historical returns when it joins, typically after a strong performance run
- Fee structures — reported gross returns look very different from net-of-fee returns, which is what investors actually receive
- Risk measurement difficulty — hedge funds often use leverage, derivatives, and illiquid assets that standard beta calculations do not capture well
The academic literature on mutual fund performance is fairly consistent: after fees, the average actively managed mutual fund underperforms its benchmark index over long periods. The evidence on hedge funds is more mixed, partly because of data quality issues and partly because the top-performing funds are often closed to new investors by the time the evidence is clear.
The Practical Limits of Market Efficiency Research
Even a methodologically perfect study faces constraints that limit its real-world utility:
Data mining risk: With enough variables and time periods, patterns will appear by chance. The academic literature is littered with anomalies that disappeared as soon as they became widely known — the act of publication essentially arbitraged them away.
Capacity constraints: A strategy that earns 4% in excess returns with a $5 million portfolio may generate negligible alpha when scaled to $500 million. Market impact erodes edge rapidly.
Transaction costs and taxes: Academic studies often ignore or underestimate these. A strategy requiring monthly rebalancing of small-cap stocks looks very different before and after realistic friction costs.
Regime dependency: Many documented anomalies — value, momentum, small-cap premium — show strong period-dependency. What worked from 1963 to 1990 may not work from 1990 to 2025, and vice versa.
The disciplined response is not to abandon quantitative testing, but to hold the bar high: require large samples, long time horizons, out-of-sample validation, and economic plausibility before concluding that any strategy represents a genuine market inefficiency.
What Market Efficiency Testing Actually Tells Investors
The practical value of understanding how market efficiency is tested extends well beyond academic curiosity. For the ambitious investor or finance professional, it provides a critical filter:
- When someone claims a strategy beats the market, ask which risk-adjustment model they used and whether the excess returns survive after fees and transaction costs
- When reading fund marketing materials, ask whether the track record includes the full sample of comparable funds or just survivors
- When a back-test shows stunning results, ask whether the strategy parameters were chosen from the same data period being tested
- When a variable appears to predict returns, ask whether that relationship holds out-of-sample and across different market regimes
The S&P 500 returned approximately 10.7% per year on average over the past 50 years. Most active managers did not match it after fees. That is not because outperformance is impossible — it is because most tests of market-beating strategies fail under rigorous scrutiny. The ones that survive tend to involve either genuine information advantages, behavioural edges that markets are slow to arbitrage away, or risk exposures that most investors are unwilling to hold through the inevitable drawdowns.
Understanding market efficiency testing will not guarantee you find an edge. But it will stop you from fooling yourself into thinking you have one when you do not.
This article is for informational purposes only and does not constitute financial advice. Always consult a qualified financial professional before making investment decisions.
Frequently Asked Questions
What is the joint hypothesis problem in market efficiency testing?
The joint hypothesis problem means that any test of market efficiency simultaneously tests two things: whether the market is mispricing assets, and whether the risk model used to calculate expected returns is correct. If a strategy appears to earn excess returns, it could mean markets are inefficient — or it could mean the expected return model is wrong. This makes it impossible to definitively prove or disprove market efficiency from any single test.
What is the difference between statistical significance and economic significance in an event study?
Statistical significance tells you whether a result is unlikely to be due to random chance — typically measured by a T-statistic above 2.0 for 95% confidence. Economic significance tells you whether the effect is large enough to actually profit from after accounting for transaction costs, taxes, and execution friction. A result can be statistically significant but economically meaningless if the excess return is, say, 0.3% annually before costs.
Why is survivorship bias a major problem in hedge fund and mutual fund performance studies?
Survivorship bias occurs when performance databases only include funds that are still operating today, excluding funds that closed, failed, or merged. Since weaker-performing funds are disproportionately likely to close, any study that looks only at surviving funds will overstate the average performance of the group. A realistic assessment must include the full universe of funds that existed during the study period, regardless of whether they survived to the end.
Can low P/E stock strategies consistently beat the market?
Historical data — including studies covering U.S. equities from the late 1980s onward — suggests that low P/E portfolios have outperformed high P/E portfolios by approximately 3–5% per year on a risk-adjusted basis over long periods. However, this edge is not guaranteed to persist, varies significantly by time period and market regime, and requires careful accounting for transaction costs, rebalancing frequency, and the exclusion of negative-earnings stocks from the universe. Most researchers treat it as a persistent but cyclical anomaly rather than a guaranteed return premium.
About Zeebrain Editorial
Zeebrain publishes independent analysis of markets, investing, personal finance, and business. We disclose affiliate relationships, never accept payment for coverage, and fact-check all claims against primary sources. Read our editorial policy →
Disclaimer: Content on Zeebrain is for informational and educational purposes only and does not constitute financial advice or a recommendation to buy or sell any security. Always conduct your own research and consult a qualified financial adviser before making investment decisions. Past performance is not indicative of future results.
More from Business & Money
Related Guides
Keep exploring this topic
Investment Philosophy: Why Every Investor Needs One
Business & Money · investment philosophy · active investing
Active vs Passive Investing: Why Active Managers Keep Losing
Business & Money · passive investing · active investing
Federal Reserve interest rates: Impact on businesses and investments
Business & Money
Google & SpaceX: The Space Data Centre Deal That Changes Everything
Business & Money
Explore More Categories
Keep browsing by topic and build depth around the subjects you care about most.


