Can Generative AI Really Predict the Stock Market?

Key Takeaways

Advanced generative AI models can predict the market’s initial reaction to news surprisingly well – often above human forecasts and common technical signals.
The edge appears to fade as adoption rises: GPT-4 accuracy drops from about 62% (Q1 2024) to about 51% (Q4 2025), close to a coin-flip baseline.
AI looks most “tradable” where markets are least efficient: small caps, negative news, and complex information types (like insider transactions).
Transaction costs are the reality check: strong backtests can still fail in live trading once slippage, spreads, and constraints are included.

So… can it actually predict the market?

The short answer: it can often predict directional movement right after news hits, but that’s not the same thing as a plug-and-play trading strategy. This report evaluates directional accuracy across multiple models using 159,137 firm-headline-date observations (Oct 2021–May 2024). The charts below focus on what tends to get cited and linked: model accuracy, how it changes over time, and where performance is strongest.

1) Generative AI Market Prediction Accuracy

This table summarizes reported directional accuracy by model/method, alongside strategy-level risk metrics where available. It’s the clean “reference table” people typically cite when comparing AI approaches.

AI model / method	Accuracy rate	Strategy Sharpe ratio	Hit rate
GPT-5 Thinking	74.2%	2.97	93.3%
Gemini 2.5 Pro	71.2%	2.63	88.8%
GPT-4	58–74%	2.97	93.3%
GPT-3.5	56.1%	1.66	Not reported
Claude Sonnet 4	46.2%	Not reported	46.2%
Traditional AI (Average)	75%	1.29	67%
Human Analyst Forecasts	52–58%	0.85	55%
Random Guess Baseline	50%	0.00	50%

Table 1. Generative AI stock market prediction accuracy (reported results).

Here’s the same information in a cleaner visual. For methods reported as ranges (like GPT-4 and human analysts), the horizontal line shows the full range and the dot marks the midpoint.

The takeaway is separation: several AI approaches outperform common baselines on directional calls. But trading profit depends on execution – timing, liquidity, and costs.

2) AI Prediction Accuracy: 12-Month Trend

Does the edge persist – or fade?

This time-series chart is the reality check. It suggests that as AI tools become widely used, the predictive edge can compress. GPT-4 accuracy declines from roughly 62% (Q1 2024) to roughly 51% (Q4 2025), near a 50% baseline.

Quarterly Accuracy Trend — 2024–2025

Accuracy (%) by quarter. Values: Q1’24 62, Q2’24 59, Q3’24 57, Q4’24 56, Q1’25 54, Q2’25 53, Q3’25 52, Q4’25 51.

Figure. Quarterly accuracy trend (2024–2025). Accuracy declines from ~62% (Q1 2024) to ~51% (Q4 2025).

That pattern makes intuitive sense: if a signal becomes widely available, it can get priced in faster – turning an advantage into “just another input.”

Accelerating adoption eliminates information advantage: The most significant drops occurred during 2024, when ChatGPT reached peak adoption among retail and institutional traders. As AI-generated insights became universally accessible, markets incorporated this information into prices almost instantly, transforming what was once a valuable signal into merely another factor already reflected in market efficiency.

3) AI vs traditional signals (what it beats – and what it doesn’t)

Prediction method	Accuracy rate	Sharpe ratio	Average return
GPT-4 (Overnight News)	93.3%	2.97	34 bps/day
GPT-4 (Intraday News)	88.8%	2.63	50 bps/day
Sentiment Analysis (RavenPack)	65.0%	1.12	18 bps/day
RSI Signals	54.0%	0.51	11 bps/day
Moving Average Crossover	52.0%	0.42	8 bps/day
Analyst Recommendations	55-58%	0.85	15 bps/day
Buy & Hold (Market)	59.0%	0.32	6 bps/day

Table 2. AI vs traditional prediction methods (2024).

A practical way to read this research is as a benchmark test: how do AI-driven headline strategies compare to the tools traders already use (sentiment feeds, RSI, moving averages, analyst calls, and buy-and-hold)?

Accuracy by Prediction Method (2024)

Accuracy rate (%) across AI, sentiment, technical indicators, analyst calls, and buy-and-hold.

Figure. Accuracy by prediction method (2024). GPT-4 headline strategies outperform traditional technical indicators and analyst recommendations on directional accuracy.

Accuracy alone isn’t enough, so the next chart shows risk-adjusted performance (Sharpe ratio). This is where transaction costs and real-world constraints usually show up as the difference between “promising” and “tradeable.”

Sharpe Ratio by Prediction Method (2024)

Risk-adjusted performance comparison across AI, sentiment, technical indicators, and benchmarks.

Figure. Sharpe ratio by prediction method (2024). GPT-4 headline-driven strategies deliver substantially higher risk-adjusted returns than traditional indicators and passive benchmarks.

4) AI Accuracy by Market Condition – Where AI performs best

Performance isn’t uniform. The report shows stronger drift opportunities in places where markets are harder to arbitrage – especially small caps, negative news, and complex information types.

For context, here’s initial-reaction accuracy across the same segments. Notice how some categories can have modest initial accuracy but still show meaningful drift – which suggests slower price discovery rather than immediate overreaction.

Segment / info type	Initial reaction accuracy	Drift prediction (bps/day)	Sharpe ratio
Large-cap (>$10B)	91%	14	1.82
Mid-cap ($2-10B)	92%	28	2.45
Small-cap (<$2B)	94%	48	3.76
Positive news	91%	8	0.78
Negative news	95%	26	2.01
Neutral news	87%	4	0.32
Earnings reports	96%	6	0.95
Clinical trial results	94%	12	1.42
Insider transactions	89%	42	3.15
Conference presentations	87%	38	2.88

Table 3. Performance by segment and information type (2024).

Conclusion

Generative AI can predict the market’s first move after news better than many traditional baselines – at least in the tested windows. But the edge is not guaranteed: the time-series trend suggests compression as adoption rises, and transaction costs can erase much of the paper advantage. In practice, these tools tend to shine as decision support – summarizing news, stress-testing theses, and surfacing risk – more than as standalone trade signals.

FAQ

Does high accuracy mean you can profitably trade it? Not necessarily. Accuracy doesn’t include transaction costs, slippage, shorting constraints, or how fast the market prices in the news.

Why does accuracy decline over time in the trend chart? As more market participants use similar AI tools, the market can incorporate that information faster – reducing any informational advantage.

Where is AI most useful according to these results? Small caps, negative-news setups, and complex information categories – where underreaction appears more likely.

How should a typical investor use these tools? As an analysis layer: summarizing news, mapping scenarios, comparing catalysts, and spotting risks – rather than treating the output as a trade instruction.

Sources

Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models (2025)

AI-Based Stock Trading: Which Gen AI Tool Is Better (2025)

2025’s Highest Profit Factor: The Top 3 AI Trading Agents (2025)

Does generative AI facilitate investor Trading? Early evidence from ChatGPT outages (2025)