Portfolio Correlation Matrix
VTI + QQQ + VOO isn’t diversification — it’s concentration with extra steps. See how correlated your holdings really are.
Asset Returns
3/10 assetsEnter monthly returns (%) for each asset. Use commas or new lines to separate values. All assets should cover the same time period.
Enter percentage values (e.g. "5" for 5%, "-3" for -3%)
Enter percentage values (e.g. "5" for 5%, "-3" for -3%)
Enter percentage values (e.g. "5" for 5%, "-3" for -3%)
r = Σ((xi−x̄)(yi−ȳ)) / √(Σ(xi−x̄)² × Σ(yi−ȳ)²)
Portfolio Correlation: The Complete Guide
Everything you need to know about correlation in investing, why it matters for diversification, and how to build a truly diversified portfolio.
Correlation in investing measures how two assets move relative to each other over time. It is expressed as a coefficient between -1.0 and +1.0, where +1.0 means two assets move perfectly in lockstep, 0 means no linear relationship, and -1.0 means they move in exactly opposite directions.
The standard measure is the Pearson correlation coefficient, calculated from historical return series:
r = Σ((xi−x̄)(yi−ȳ)) / √(Σ(xi−x̄)² × Σ(yi−ȳ)²)
Why it matters: Correlation is the foundation of modern portfolio theory (MPT), introduced by Harry Markowitz in 1952. The key insight is that a portfolio's total risk is not simply the weighted average of individual asset risks — it depends critically on how those assets correlate with each other.
- Low or negative correlations — When one asset zigs and the other zags, portfolio volatility drops without necessarily reducing expected returns. This is the "free lunch" of diversification.
- High positive correlations — When assets move together (correlation > 0.8), combining them provides little diversification benefit. You're just adding more of the same risk under a different ticker symbol.
- Portfolio variance formula — For two assets, portfolio variance = w²₁σ²₁ + w²₂σ²₂ + 2w₁w₂σ₁σ₂ρ₁₂. The ρ (correlation) term determines whether combining assets reduces or barely changes total risk.
Without understanding correlation, investors often build portfolios that feel diversified (five different funds!) but are actually concentrated in the same risk factor (U.S. large-cap equities). A correlation matrix is the tool that makes this hidden concentration visible.
This is one of the most common mistakes in retail investing. VTI (Vanguard Total Stock Market), QQQ (Nasdaq-100), and VOO (Vanguard S&P 500) sound different but are heavily overlapping. Their historical pairwise correlations typically exceed 0.95, meaning they move almost identically.
Why the overlap is so extreme:
- VOO tracks the S&P 500, which represents about 80% of the total U.S. stock market by capitalization. VTI tracks the entire U.S. market, so roughly 80% of VTIis VOO. The remaining 20% (mid-cap and small-cap stocks) also correlate highly with large-caps over most periods.
- QQQ tracks the Nasdaq-100, which is dominated by mega-cap tech companies (Apple, Microsoft, Nvidia, Amazon, Meta, Alphabet). These same companies are the largest holdings in both VOO and VTI due to market-cap weighting. Apple alone can represent 7%+ in all three ETFs.
- Market-cap weighting concentrates risk — In all three ETFs, the top 10 holdings account for 30–50% of the fund. Since those top 10 are largely the same companies, buying all three ETFs just means you own the same mega-caps three times over.
True diversification requires adding asset classes that have structurally different return drivers: international equities (VXUS), bonds (BND, TLT), commodities (GLD, DBC), REITs, or alternatives. A 60/40 stock-bond portfolio, while simple, is far more diversified than a VTI/QQQ/VOO combination because stock-bond correlations have historically been near zero or negative.
Correlation and R-squared are closely related but answer different questions, and confusing them can lead to misinterpretation of portfolio risk.
Correlation (r)
- Measures the direction and strength of the linear relationship between two assets.
- Ranges from -1 to +1. A correlation of -0.5 and +0.5 have the same strength but opposite directions.
- Tells you how two assets move together — do they go up and down at the same time (positive), opposite times (negative), or independently (near zero)?
R-squared (r²)
- The square of the correlation coefficient. Ranges from 0 to 1 (no direction information).
- Measures the proportion of variance in one asset's returns that is explained by the other asset's returns.
- An R-squared of 0.81 (from a correlation of 0.9 or -0.9) means 81% of one asset's return variability is "explained" by the other.
Practical example: If two assets have a correlation of -0.7, R-squared is 0.49. The correlation tells you they tend to move in opposite directions (great for hedging). R-squared tells you that 49% of one's variance is explained by the other. Both are useful, but for portfolio construction, correlation is more informative because it preserves the directional information that matters for diversification.
When to use each: Use correlation for building diversified portfolios (you need to know the direction). Use R-squared when evaluating how closely a fund tracks its benchmark (e.g., an index fund with R-squared of 0.99 tracks the index almost perfectly).
Most investors dramatically overestimate how diversified their portfolios are. Here are the most common correlation traps:
- Owning multiple U.S. large-cap ETFs — As discussed, VTI, VOO, SPY, IVV, and QQQ all share the same mega-cap tech companies. Holding three of these is essentially a triple-leveraged bet on U.S. large-cap equities, not diversification. Correlations between these pairs typically exceed 0.95.
- Sector ETFs within the same market — XLK (Technology), XLC (Communication Services), and XLY (Consumer Discretionary) have high correlations because they share top holdings. Amazon is in XLY, Meta and Alphabet are in XLC, and Microsoft and Apple are in XLK. At the index level, these "different sectors" are driven by the same handful of companies.
- Growth fund stacking — Combining ARKK, QQQ, and a growth mutual fund often produces correlations above 0.85. Growth funds tend to be concentrated in the same themes (AI, SaaS, semiconductors), so they crash and rally together.
- Ignoring factor exposure — A "value" fund and a "dividend" fund often have high correlation because both are tilted toward the same factor (cheap, mature companies). Similarly, "quality" and "low volatility" strategies tend to hold overlapping names.
- International fund overlap — An "international" fund (VXUS) and an "emerging markets" fund (VWO) are not independent — VWO is a subset of VXUS. Their correlation is typically 0.85+.
- Confusing number of holdings with diversification — VTI holds 4,000+ stocks, but market-cap weighting means the top 10 stocks drive most of the returns. Holding 4,000 stocks that all correlate 0.6+ with each other provides far less diversification than holding 10 assets with an average correlation of 0.2.
The fix: Always check the correlation matrix before adding a new position. If its correlation with your existing holdings is above 0.7, it's adding concentration, not diversification.
Negative correlation means two assets tend to move in opposite directions. When one goes up, the other tends to go down. This is the most powerful tool for reducing portfolio volatility and is the basis of hedging strategies.
Key examples of negative correlations:
- Stocks vs. long-term Treasury bonds — Historically, the S&P 500 and 20+ year Treasury bonds (TLT) have had a correlation near -0.3 to -0.5, especially during equity sell-offs. When stocks crash, investors flee to safety, driving Treasury prices up. This is why the classic 60/40 portfolio works — bonds cushion the blow during stock market downturns.
- Gold vs. the U.S. dollar — Gold tends to rise when the dollar weakens, creating a natural hedge for dollar-denominated portfolios. The correlation is typically around -0.3 to -0.5.
- VIX products vs. equities — The VIX (volatility index) spikes when stocks plummet, creating a strong negative correlation. However, VIX products have significant carry costs that erode returns over time, making them expensive hedges.
The math of negative correlation: Consider two assets, each with 15% expected return and 20% volatility. If their correlation is +0.9, the portfolio has 19.5% volatility — barely any reduction. But if their correlation is -0.5, the portfolio volatility drops to 10% — half the individual risk with the same expected return. That's the power of negative correlation.
Practical limits: Truly negative correlations are rare in practice among return-generating assets. Most asset classes have slightly positive or near-zero correlations during normal markets. The negative correlations that do exist (like stocks vs. bonds) tend to be strongest during crisis periods — exactly when you need them most, but not guaranteed to persist. The 2022 experience, when both stocks and bonds fell simultaneously, reminded investors that even reliable negative correlations can break down when inflation drives both assets lower.
Yes, and this is one of the most important — and frustrating — realities of portfolio management. Correlations are not static. They tend to increase sharply during market crises, precisely when diversification matters most.
The "correlation spike" phenomenon:
- Normal markets — Asset class correlations tend to be moderate and relatively stable. International equities might show a 0.6 correlation with U.S. stocks, and high-yield bonds might show a 0.5 correlation.
- Crisis markets — During events like the 2008 financial crisis, COVID crash (March 2020), or the 2022 rate shock, correlations across almost all risk assets spiked toward 1.0. Stocks, corporate bonds, real estate, commodities, and even some "alternative" investments all fell together. Only true safe havens (U.S. Treasuries, cash) maintained their diversification benefit.
- 2022 was a wake-up call — For the first time in decades, both stocks and long-term bonds fell significantly in the same year. The stock-bond correlation, which had been negative for 20+ years, turned positive as inflation and rising rates hurt both asset classes simultaneously. This broke the 60/40 portfolio assumption.
Why it happens: During crises, investors sell whatever they can (not just what they should). Margin calls force liquidation across all positions. Panic is indiscriminate. The "flight to safety" means everything except the safest assets becomes positively correlated. This is sometimes called contagion or correlation breakdown.
What to do about it:
- Use stress-test correlations, not just average correlations. Look at how your portfolio would perform using crisis-period correlations (2008, 2020, 2022).
- Hold truly uncorrelated assets — Cash and short-term Treasuries maintain their diversification benefit even in extreme stress. Gold has also held up in many (but not all) equity crises.
- Don't rely on historical correlations exclusively. A correlation of 0.3 in normal markets might become 0.8 in a crisis. Design your portfolio to survive the higher number, not the lower one.
A correlation matrix is a diagnostic tool — it shows you where concentration risk exists. Here's a systematic approach to improving diversification based on your matrix results:
Step 1: Identify redundant pairs
- Any pair with correlation above 0.8 is essentially duplicative. You're paying two expense ratios for the same exposure. Consider keeping the one with lower fees and better tax efficiency.
- If your entire matrix is "hot" (mostly red/high correlation), your portfolio is concentrated in a single risk factor, regardless of how many tickers you hold.
Step 2: Add structurally different asset classes
- Fixed income — Intermediate-term bonds (BND, AGG) or Treasury bonds (TLT, GOVT) have historically low or negative correlation with equities.
- International equities — VXUS, IXUS, or EFA provide geographic diversification. While correlations with U.S. stocks have increased over decades (globalization), they still provide meaningful diversification, especially emerging markets.
- Real assets — Gold (GLD, IAU), commodities (DBC, GSG), and REITs (VNQ) have different return drivers than stocks and bonds. Gold in particular tends to shine during inflationary periods when stocks and bonds both struggle.
- TIPS (Treasury Inflation-Protected Securities) — Low correlation with nominal bonds and moderate correlation with equities, providing a unique inflation hedge.
Step 3: Target an average pairwise correlation below 0.4
This is the sweet spot for meaningful diversification. It does not mean every pair must be below 0.4 — some pairs (like two equity funds) will naturally be higher. But theaverage across all pairs should ideally stay below 0.4 to achieve genuine portfolio-level risk reduction.
Step 4: Rebalance periodically
Over time, the best-performing assets grow to dominate the portfolio, increasing concentration and correlation. Annual or quarterly rebalancing back to target weights mechanically maintains the diversification benefit you designed into the portfolio.
The reliability of a correlation estimate depends heavily on the number of data points (return periods) you use. Too few periods and the estimate is noisy and unreliable; too many and you may be including outdated data that no longer reflects the current relationship between assets.
General guidelines:
- Minimum: 12 monthly returns (1 year) — This is the bare minimum for a correlation estimate to have any statistical meaning. However, one year of data captures only one market regime, so the estimate may not generalize.
- Recommended: 36–60 monthly returns (3–5 years) — This is the standard range used by most portfolio analytics tools. Three to five years typically captures a mix of up and down markets, giving a more balanced picture.
- Full cycle: 60–120 monthly returns (5–10 years) — If you want to capture how assets correlate across a full market cycle (bull market, bear market, recovery), you need at least 5 years and ideally 10. This is most appropriate for long-term strategic asset allocation.
The tradeoff: Longer time series are more stable but may include periods where the economic regime was fundamentally different (e.g., zero interest rates vs. 5% rates). Shorter time series are more reflective of current conditions but are noisier and may capture temporary dislocations rather than true structural relationships.
Pro tip: If you can, calculate correlations over multiple windows (1-year rolling, 3-year rolling, 5-year total) and compare. If the correlation is consistently high across all windows, you have a robust relationship. If it varies widely, the relationship is unstable and you should be more cautious in relying on it for portfolio construction.
Ready to value the individual stocks in your diversified portfolio?