Can Generative AI Really Predict the Stock Market?

Can Generative AI Really Predict the Stock Market?

By Daniela Pedroza29 min read Uncategorized

Key Takeaways

  • Advanced generative AI models can predict the market’s initial reaction to news surprisingly well – often above human forecasts and common technical signals.
  • The edge appears to fade as adoption rises: GPT-4 accuracy drops from about 62% (Q1 2024) to about 51% (Q4 2025), close to a coin-flip baseline.
  • AI looks most “tradable” where markets are least efficient: small caps, negative news, and complex information types (like insider transactions).
  • Transaction costs are the reality check: strong backtests can still fail in live trading once slippage, spreads, and constraints are included.

So… can it actually predict the market?

The short answer: it can often predict directional movement right after news hits, but that’s not the same thing as a plug-and-play trading strategy. This report evaluates directional accuracy across multiple models using 159,137 firm-headline-date observations (Oct 2021–May 2024). The charts below focus on what tends to get cited and linked: model accuracy, how it changes over time, and where performance is strongest.

1) Generative AI Market Prediction Accuracy

This table summarizes reported directional accuracy by model/method, alongside strategy-level risk metrics where available. It’s the clean “reference table” people typically cite when comparing AI approaches.

AI model / method Accuracy rate Strategy Sharpe ratio Hit rate
GPT-5 Thinking 74.2% 2.97 93.3%
Gemini 2.5 Pro 71.2% 2.63 88.8%
GPT-4 58–74% 2.97 93.3%
GPT-3.5 56.1% 1.66 Not reported
Claude Sonnet 4 46.2% Not reported 46.2%
Traditional AI (Average) 75% 1.29 67%
Human Analyst Forecasts 52–58% 0.85 55%
Random Guess Baseline 50% 0.00 50%
Table 1. Generative AI stock market prediction accuracy (reported results).

Here’s the same information in a cleaner visual. For methods reported as ranges (like GPT-4 and human analysts), the horizontal line shows the full range and the dot marks the midpoint.

The takeaway is separation: several AI approaches outperform common baselines on directional calls. But trading profit depends on execution – timing, liquidity, and costs.

2) AI Prediction Accuracy: 12-Month Trend

Does the edge persist – or fade?

This time-series chart is the reality check. It suggests that as AI tools become widely used, the predictive edge can compress. GPT-4 accuracy declines from roughly 62% (Q1 2024) to roughly 51% (Q4 2025), near a 50% baseline.

Quarterly Accuracy Trend — 2024–2025

Accuracy (%) by quarter. Values: Q1’24 62, Q2’24 59, Q3’24 57, Q4’24 56, Q1’25 54, Q2’25 53, Q3’25 52, Q4’25 51.

Quarterly Accuracy Trend — 2024-2025 Accuracy (%) Quarter 45 50 55 60 65 Q1 2024 Q2 2024 Q3 2024 Q4 2024 Q1 2025 Q2 2025 Q3 2025 Q4 2025
Figure. Quarterly accuracy trend (2024–2025). Accuracy declines from ~62% (Q1 2024) to ~51% (Q4 2025).

That pattern makes intuitive sense: if a signal becomes widely available, it can get priced in faster – turning an advantage into “just another input.”

Accelerating adoption eliminates information advantage: The most significant drops occurred during 2024, when ChatGPT reached peak adoption among retail and institutional traders. As AI-generated insights became universally accessible, markets incorporated this information into prices almost instantly, transforming what was once a valuable signal into merely another factor already reflected in market efficiency.

3) AI vs traditional signals (what it beats – and what it doesn’t)

Prediction method Accuracy rate Sharpe ratio Average return
GPT-4 (Overnight News) 93.3% 2.97 34 bps/day
GPT-4 (Intraday News) 88.8% 2.63 50 bps/day
Sentiment Analysis (RavenPack) 65.0% 1.12 18 bps/day
RSI Signals 54.0% 0.51 11 bps/day
Moving Average Crossover 52.0% 0.42 8 bps/day
Analyst Recommendations 55-58% 0.85 15 bps/day
Buy & Hold (Market) 59.0% 0.32 6 bps/day
Table 2. AI vs traditional prediction methods (2024).

A practical way to read this research is as a benchmark test: how do AI-driven headline strategies compare to the tools traders already use (sentiment feeds, RSI, moving averages, analyst calls, and buy-and-hold)?

Accuracy by Prediction Method (2024)

Accuracy rate (%) across AI, sentiment, technical indicators, analyst calls, and buy-and-hold.

Accuracy by Prediction Method (2024) 0 20 40 60 80 100 Accuracy rate (%) Buy & Hold (Market) 59.0% Analyst Recommendations 55.0% RSI Signals 54.0% Moving Average Crossover 52.0% Sentiment Analysis (RavenPack) 65.0% GPT-4 (Intraday News) 88.8% GPT-4 (Overnight News) 93.3%
Figure. Accuracy by prediction method (2024). GPT-4 headline strategies outperform traditional technical indicators and analyst recommendations on directional accuracy.

Accuracy alone isn’t enough, so the next chart shows risk-adjusted performance (Sharpe ratio). This is where transaction costs and real-world constraints usually show up as the difference between “promising” and “tradeable.”

Sharpe Ratio by Prediction Method (2024)

Risk-adjusted performance comparison across AI, sentiment, technical indicators, and benchmarks.

Sharpe Ratio by Prediction Method (2024) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Sharpe ratio Buy & Hold (Market) 0.32 Analyst Recommendations 0.85 RSI Signals 0.51 Moving Average Crossover 0.42 Sentiment Analysis (RavenPack) 1.12 GPT-4 (Intraday News) 2.63 GPT-4 (Overnight News) 2.97
Figure. Sharpe ratio by prediction method (2024). GPT-4 headline-driven strategies deliver substantially higher risk-adjusted returns than traditional indicators and passive benchmarks.

4) AI Accuracy by Market ConditionWhere AI performs best

Performance isn’t uniform. The report shows stronger drift opportunities in places where markets are harder to arbitrage – especially small caps, negative news, and complex information types.

For context, here’s initial-reaction accuracy across the same segments. Notice how some categories can have modest initial accuracy but still show meaningful drift – which suggests slower price discovery rather than immediate overreaction.

Segment / info type Initial reaction accuracy Drift prediction (bps/day) Sharpe ratio
Large-cap (>$10B) 91% 14 1.82
Mid-cap ($2-10B) 92% 28 2.45
Small-cap (<$2B) 94% 48 3.76
Positive news 91% 8 0.78
Negative news 95% 26 2.01
Neutral news 87% 4 0.32
Earnings reports 96% 6 0.95
Clinical trial results 94% 12 1.42
Insider transactions 89% 42 3.15
Conference presentations 87% 38 2.88
Table 3. Performance by segment and information type (2024).

Conclusion

Generative AI can predict the market’s first move after news better than many traditional baselines – at least in the tested windows. But the edge is not guaranteed: the time-series trend suggests compression as adoption rises, and transaction costs can erase much of the paper advantage. In practice, these tools tend to shine as decision support – summarizing news, stress-testing theses, and surfacing risk – more than as standalone trade signals.

FAQ

Does high accuracy mean you can profitably trade it? Not necessarily. Accuracy doesn’t include transaction costs, slippage, shorting constraints, or how fast the market prices in the news.

Why does accuracy decline over time in the trend chart? As more market participants use similar AI tools, the market can incorporate that information faster – reducing any informational advantage.

Where is AI most useful according to these results? Small caps, negative-news setups, and complex information categories – where underreaction appears more likely.

How should a typical investor use these tools? As an analysis layer: summarizing news, mapping scenarios, comparing catalysts, and spotting risks – rather than treating the output as a trade instruction.

Sources

Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models (2025)

AI-Based Stock Trading: Which Gen AI Tool Is Better (2025)

2025’s Highest Profit Factor: The Top 3 AI Trading Agents (2025)

Does generative AI facilitate investor Trading? Early evidence from ChatGPT outages (2025)