Evidence-based investing research
Value Investing Strategy (Strategy Overview)
Allocations for June 2026 (Final)
Cash TLT LQD SPY
Momentum Investing Strategy (Strategy Overview)
Allocations for June 2026 (Final)
1st ETF 2nd ETF 3rd ETF

Investing Expertise

Can analysts, experts and gurus really give you an investing/trading edge? Should you track the advice of as many as possible? Are there ways to tell good ones from bad ones? Recent research indicates that the average “expert” has little to offer individual investors/traders. Finding exceptional advisers is no easier than identifying outperforming stocks. Indiscriminately seeking the output of as many experts as possible is a waste of time. Learning what makes a good expert accurate is worthwhile.

Asset Class ETF Picking by AI Panel

Is the evolving set of artificial intelligence (AI) platforms based on large language models interesting as asset class allocation advisors? Are they monolithic, or diverse? As a simple exploration, we pose to each of Grok, ChatGPT, Claude, Perplexity and Gemini the following two prompts regarding the 12 asset class exchange-traded funds (ETF) considered in the Simple Asset Class ETF Value Strategy (SACEVS) and the Simple Asset Class ETF Momentum Strategy (SACEMS):

  1. Using all training and real-time data available to you, please provide your unique view on whether investing in each of the following ETFs is favorable, neutral or unfavorable for the balance of 2026: SPY, IWM, QQQ, EFA, EEM, TLT, LQD, BIL, EMB, VNQ, GLD and DBC. Do not provide any explanations.
  2. Which three of your ETFs with favorable views work best together as a diversified set?

We then compare and contrast results from AI panel members. Using responses to the two prompts as posed in late May 2026, we find that: Keep Reading

AI Panel Inflation Projections

Is the evolving set of artificial intelligence (AI) platforms based on large language models interesting as economic forecasters? Are they monolithic, or diverse? As a simple exploration, we pose to each of Grok, ChatGPT, Claude, Perplexity and Gemini the following prompt regarding the 12-month trailing U.S. consumer inflation rate over the next 12 months:

Applying all available training data and real-time updates, please project the 12-month trailing U.S. consumer inflation rate for each of the next 12 months. Do not provide any explanations.

We then compare and contrast results from AI panel members. Using responses to the prompt as posed in early June 2026, we find that: Keep Reading

Reassessing Use of Machine Learning in Stock Portfolio Construction

Are the conclusions of recent studies that machine learning models can materially boost risk-adjusted stock portfolio performance reliable? In their May 2026 preliminary paper entitled “All Sizzle, No Steak! How Robust Are Financial Machine Learning Results Really?”, Kristian-Alexander Janisch, Johannes Dreyer, Fuad Mehraliyev and Kristian Sund retest elastic net and neural network models as applied to stock portfolio construction in recent seminal papers with alternative samples and extended sample periods. They also investigate the incremental value of macroeconomic variables compared to firm characteristics as model inputs. Using samples of macroeconomic variables, firm characteristics, technical indicators and stock returns from prior studies, all extended through 2023, they find that:

Keep Reading

U.S. Treasuries Selection by AI Panel

Is the evolving set of artificial intelligence (AI) platforms based on large language models interesting as U.S. Treasuries selection advisors? Are they monolithic, or diverse? As a simple exploration, we pose to each of Grok, ChatGPT, Claude, Perplexity and Gemini the following prompt regarding the seven U.S. Treasuries exchange-traded funds (ETF) considered in “Treasuries ETFs Momentum Strategy Update/Extension”:

Using all training and real-time data available to you, please provide your unique view on whether investing in each of the following ETFs is favorable, neutral or unfavorable for the balance of 2026: BIL, SHY, VTIP, IEI, IEF, TIP and TLT. Do not provide any explanations.

We then compare and contrast results. Using responses to the prompt as posed in late May 2026, we find that: Keep Reading

Whale Wisdom

Does the informativeness of markets come from wisdom of the crowd or the wisdom of a few? If a few, who are they? In their April 2026 paper entitled “Beyond the Wisdom of the Crowd: Concentrated Informed Trading in Earnings Prediction Markets” Wan Chu Cheong and Ane Tamayo examine who makes Polymarket earnings prediction markets (whether firms will beat earnings forecasts) informative. Specifically, for each market, they:

  • Rank each wallet (trader) by the value of their net positions.
  • Compare the predictive accuracy of top-ranked traders (whales) to that of other traders.

To ensure market liquidity, they focus on 435 earnings prediction markets created since September 2025, which involve median 184 traders and $16,665 volume. They employ all 435 markets to assess prediction accuracy and 381 markets matched to stock prices for tests involving equity returns. Using wallet addresses, timestamps, trade directions (buy or sell) and dollar values, predicted outcomes (Yes or No) and prices for 309,574 Polymarket trades by 21,083 traders, actual firm earnings beats/misses and matched stock returns as available during September 2025 through February 2026, they find that:

Keep Reading

How Are AI-powered ETFs Doing?

How do exchange-traded-funds (ETF) that employ artificial intelligence (AI) to pick assets perform? To investigate, we consider ten such ETFs, eight of which are currently available:

We use SPDR S&P 500 ETF Trust (SPY) for comparison, though it is not conceptually matched to some of the ETFs. We focus on monthly return statistics, along with compound annual growth rates (CAGR) and maximum drawdowns (MaxDD). Using monthly total returns for the ten AI-powered ETFs and SPY as available through April 2026, we find that: Keep Reading

Prediction Markets Are Better than Humans as Earnings Analysts?

Are prediction markets better at forecasting firm earnings than professional analysts? In their April 2026 paper entitled “Beating the Earnings Game: Why Do Prediction Markets Outperform Professional Analysts?”, Daniel Rabetti, Jiaqi Shao and Che Zhang investigate whether and, if so, why a blockchain-based prediction market such as Polymarket outperforms professional analysts in forecasting U.S. stock earnings. The earnings predictions of this market are public and unchangeable contracts, taking the form:

“Will [Company] beat earnings for [Quarter] [Fiscal Year]?”

relative to analyst consensus as of contract creation date. Using data for 469 Polymarket firm-quarter earnings beat contracts, corresponding analyst earnings forecast data and associated daily stock prices during September 2025 through February 2026, they find that:

Keep Reading

Wisdom of a Few?

Does the empirical accuracy of prediction markets derive from crowd wisdom or an informed few? In their April 2026 paper entitled “Prediction Market Accuracy: Crowd Wisdom or Informed Minority?”, Roberto Cram, Yunhan Guo, Theis Jensen and Howard Kung investigate why prediction markets exhibit accuracy. Specifically, they compare the distribution of actual trade directions with a hypothetical distribution of random trades, and thereby classify traders as:

  • Market makers, who provide liquidity by posting limit orders.
  • Skilled traders, winners whose gains cannot be attributed to chance.
  • Other winners and other losers, who respectively earn positive and negative returns but whose performance is not statistically significant.
  • Persistent losers, who consistently and significantly lose.

Using the Polymarket universe of transactions and accounts with at least 10 trades across propositions created after the beginning of January 2023 and resolved by the end of December 2025, they find that:

Keep Reading

Managing AI Researchers

Can artificial intelligence (AI) agents based on a large language model (LLM) carry most of the load in strategic asset allocation? In their April 2026 paper entitled “The Self-Driving Portfolio: Agentic Architecture for Institutional Asset Management”, Andrew Ang, Nazym Azimbayev and Andrey Kim present a 6-step strategic asset allocation system in which:

  1. A macro agent identifies the economic regime (expansion, late-cycle, recession or recovery).
  2. Asset class agents each assigned one class run in parallel to estimate respective expected returns, expected volatilities and confidence levels.
  3. A covariance agent generates an asset class covariance matrix.
  4. Portfolio construction agents each independently employ Step 2 and 3 outputs to proposed a portfolio based on an assigned method (such as equal weight, inverse volatility, mean-variance optimization or risk parity), including:
    • A researcher agent to propose novel portfolio construction methods.
    • An adversarial agent to uncover unconventional allocation ideas.
  5. Multiple agents review all proposed portfolios and vote on them.
  6. A chief investment officer agent scores, selects and combines surviving proposed portfolios using an ensemble of seven combination methods. This agent then summarizes a final recommendation/reasoning/dissenting views.

They include a meta-agent that compares forecasted and realized returns and rewrites agent scripts to improve future performance. They specify each agent in this system via a description, a set of scripts, a collection of skills and a structured output. An Investment Policy Statement (specifying asset class universe, objective, tracking error) constrains the AI agents. Overall, this system compresses days or weeks of human work into minutes. Based on prior research and experience with LLM-based AI agents, they observe that: Keep Reading

Differences in AI and Human Financial Research

When assigned to perform the same empirical financial research, do the findings of human researchers and large language models (LLM) as a kind of artificial intelligence (AI) differ? If so, why? In their March 2026 paper entitled “AI ‘Errors'”, Wenqian Huang, Albert Menkveld and Shihao Yu compare outcomes for 158 AI model iterations (agents) to those from prior research for 164 independent human teams employing the same sample of 720 million equity index futures trades to test the same six hypotheses. They choose the GPT-5.2 LLM to construct AI agents, with variability in outcomes driven by its probabilistic decision-making. They further examine which types of research decisions drive any differences. Using outcomes from the AI and human researcher test runs, they find that: Keep Reading

Research Finder

Search 1,200+ research articles