An experiment in what an agentic-AI research framework can do — published in full so you can check its work, not a tip service to follow. Generic research and education, not advice, not a personal recommendation, not FCA-authorised. Hypothetical book, not real money. Capital at risk.
Portfolio Lab · Research Lab
Portfolio Lab — transparent, low-cost portfolios with real out-of-sample performance, net of all costs. View the portfolios →
Evidence note. Our own data and analysis of this signal — the graded verdict evolves as more data accrues. Research and education about a process, not investment advice.

← All signal studies

Quality / Gross Profitability — is it still true?

OVERLAY ONLY Quality
C+SIGNAL GRADE
OVERLAY ONLY

Real over 60 years (t=3.2) — but no single decade is significant, the 2010s were flat, and the live long-only quality ETF LOST to the S&P by 0.7%/yr over 13 years. We use it to exclude junk and up-weight cheap-and-quality — never as standalone alpha, never a reason to overpay.

Effect size
+3.1%/yr long-short, full sample
Significance
only the pooled 60-yr sample clears t=2; no decade does
Regime-robust
near-flat in the 2010s (t=0.8)
Benchmark-robust
live long-only QUAL underperformed the S&P
Durability
a genuine low-turnover, long-horizon factor
Our-universe fit
cheap, long-only, low-turnover — a natural overlay
How we tested it
Data
Ken French RMW profitability factor (monthly) + live QUAL vs SPY total returns
Sample
1963–2025 factor; 13 years of the live long-only ETF
Benchmark
the market (SPY) — does the long-only leg beat it?
Method
annualised premium + t-stats by decade; live after-fee ETF comparison

The short version

Buying high-quality, profitable companies has beaten the market over the very long run — but only barely, and not reliably: in the last decade the edge was flat, and a real-world "quality" fund actually lagged the market over 13 years. So quality is worth using to avoid junk and to lean toward cheap-and-good companies — but it's not a money-maker on its own, and never a reason to overpay.

Abstract

We tested whether the profitability/quality premium still predicts returns — especially recently, and in a form we can actually use (long-only). Using Ken French's own profitability factor (RMW), 1963–2025, pulled ourselves, plus the live QUAL ETF: the premium is positive on average (full-sample +3.1%/yr, t=3.2) but is not statistically significant in any single decade, was near-flat in the 2010s (+1.4%/yr, t=0.8), and the live long-only QUAL ETF underperformed the S&P 500 by ~0.7pp/yr over 13 years. Conclusion: we keep quality, but only as a junk-exclusion screen and a "cheap + quality" up-weight / thesis-durability check — not as a standalone return signal, and never as a reason to overpay.

1. The claim (and why it matters to us)

High gross-profits-to-assets (Novy-Marx) and the broader "quality" bundle (profitable, stable, low-leverage, high-payout) are said to predict higher risk-adjusted returns. If true, it's an attractive signal for us: low-turnover (annual accounting data), long-side-viable, cheap to apply, and a natural drawdown cushion for a retail book.

2. What the literature claims — and our scepticism

  • Novy-Marx (2013, JFE): gross-profits-to-assets long-short ≈ 0.31%/mo, ~same power as value; quality+value combined has a much higher Sharpe.
  • Asness/Frazzini/Pedersen "Quality Minus Junk" (2019): information ratio >1, positive in 23/24 countries. (AQR sells quality strategies — an incentive worth naming.)
  • Fama-French (2015): profitability factor RMW ≈ 0.25–0.34%/mo; Hou-Xue-Zhang (2020): GPA survives the replication crisis at 0.38%/mo, t=2.62 (one of the few that does).
  • Our scepticism: all of these are long-short, mostly US, mostly samples ending 2010–2016, and McLean-Pontiff (2016) put average post-publication decay at ~50%. None of them answer "does the long-only leg still beat the market now?"

3. Our own analysis ← the heart of the report

Data we pulled: Ken French Data Library, F-F_Research_Data_5_Factors_2x3 (monthly RMW), and F-F_Momentum_Factor, 1963→May 2025; plus yfinance total returns (dividends reinvested) for QUAL and SPY. Method: mean monthly return, annualised (×12), t = mean ÷ (std/√n), bucketed by period. Fully reproducible from the Dartmouth zip.

RMW — Robust-minus-Weak profitability (long-short), our pull:

Period Premium / yr t-stat
1963–1989 +1.9% 1.8
1990–1999 +2.3% 1.2
2000–2009 +8.3% 1.9
2010–2019 +1.4% 0.8
2020–2025 +3.8% 1.0
Full sample +3.1% 3.2

Live long-only check: QUAL (iShares MSCI USA Quality) vs SPY, 2013-07 → 2026-05 (12.9y): QUAL +13.63%/yr vs SPY +14.29%/yr over the same window → −0.66pp/yr (quality underperformed). Context (same pull): Value (HML) was negative in the 2010s (−2.4%/yr) and post-2010 overall; momentum post-2010 +3.6%/yr (t=1.2, also sub-significant), though the MTUM ETF beat SPY by +1.45pp/yr.

4. Findings

  • The profitability premium is real over the full 60 years (t=3.2) but the significance comes from poolingno individual decade clears t=2.
  • It was weakest in the most recent full decade (2010s: +1.4%/yr, t=0.8 — effectively flat).
  • The live, long-only, after-fee version (QUAL) did not beat the market over 13 years.
  • Quality held up better than value (which was negative in the 2010s) but neither was a significant standalone winner recently.

5. Conclusion — keep, but as an overlay (and why)

We keep quality, demoted to a confirming overlay, because the evidence does not support it as standalone alpha at our scale, recently, long-only. Its real, defensible uses: 1. Junk-exclusion — drop low/negative gross-profits-to-assets, high leverage, unstable earnings. Avoiding junk is the most reliable long-side use of the factor. 2. "Cheap + quality" up-weight — the single best-evidenced combination (Novy-Marx); quality and value are genuinely independent families (see ../families_and_independence.md), so this is real confluence, not double-counting. 3. Durability check / drawdown cushion — this is the bridge to the house view that a trend (or long-run average) is only trustworthy if backed by quality. Quality's job here is to confirm a thesis will hold, not to time an entry.

6. Limitations & open questions

  • Factor data (a diversified long-short portfolio) is not single-name selection skill.
  • QUAL carries a fee (~0.15%) and MSCI's specific quality construction — not a pure GPA sort.
  • We have not yet run a single-name gross-profits-to-assets sort on our HL-tradeable universe, or tested the quality+cheap combination directly on names we can hold.
  • RMW is US; UK/HL applicability untested by us.

7. How we would extend this

Build a single-name gross-profits-to-assets sort on our HL universe via the verification harness; test quality-AND-cheap directly (does the combination beat either alone, recently, long-only, net of cost?). That converts "we trust the overlay" into "we measured the overlay on our own names."

Sources

  • Pulled ourselves: Ken French Data Library — F-F 5-Factor (RMW), HML, Momentum (monthly, 1963→2025); yfinance total returns QUAL, SPY, MTUM.
  • Cited: Novy-Marx (2013, JFE 108(1)); Asness/Frazzini/Pedersen "Quality Minus Junk" (2019, RAS 24); Fama-French (2015, JFE 116(1)); Hou-Xue-Zhang "Replicating Anomalies" (2020, RFS 33); McLean-Pontiff (2016, J. Finance).

Changelog

  • v1.0 (2026-05-31) — initial study; Ken French RMW by decade + QUAL/SPY live check.

Common questions

Do quality stocks beat the market?

Over 60 years the profitability factor is real (about +3.1%/yr, t=3.2), but the strength comes from pooling — no single decade is statistically significant, the 2010s were flat, and the live long-only quality ETF actually trailed the S&P by ~0.7%/yr over 13 years. Quality is a sound junk-filter and a 'cheap + quality' up-weight, not standalone alpha.

Is the quality factor still working?

Weakly. It held up better than value (negative in the 2010s), but neither was a significant standalone winner recently. We keep quality as a confirming overlay, not a reason to buy on its own.

Our own data and analysis — sources and dates below. Numbers are labelled gross/net and by sample in the text. Research and education, not advice.