j-curran.com / work / Wage Distribution Analysis ↗ code
project_03 · 2024 · 2 months · Solo

Wage Distribution Analysis

A statistical comparison of wage distributions over time.

2 months Stack: Python · Pandas

U.S. Household Income Distribution Analysis (2010-2023)

Project Overview

This project analyzes the evolution of household income distribution in the United States from 2010 to 2023 using data from the U.S. Census Bureau's American Community Survey. The analysis examines how the proportion of households in different income brackets has changed over time, revealing significant shifts in the American economic landscape.

Key Finding: The data shows a clear upward mobility trend, with higher income brackets ($75,000+) gaining population share while lower income brackets show declining representation.

Research Questions

This exploratory data analysis addresses several key questions:

  1. How has household income distribution changed from 2010 to 2023?
  2. Which income groups experienced the most significant changes?
  3. What does this tell us about economic mobility and wage growth?
  4. How has the median household income bracket shifted over time?

Dataset

The analysis uses U.S. Census Bureau American Community Survey (ACS) 1-Year Estimates data (Table S1901) spanning 2010-2023. The dataset contains:

Income Brackets Analyzed

Methodology

Data Processing

  1. Data Integration: Combined 13 years of ACS data files into a unified dataset
  2. Data Cleaning:
  3. Removed formatting symbols (%, ±, commas) from numeric fields
  4. Converted percentage strings to numeric values
  5. Standardized column names for consistency
  6. Data Validation: Filtered out invalid or missing values to ensure analysis integrity

Analysis Approach

Visualization Strategy

The project evolved through multiple visualization approaches: 1. Initial Approach: Ridgeline plots (faceted bar charts by year) 2. Final Approach: Multi-line time series plots for better trend identification

Key Findings

Specific Changes (2010-2023)

The analysis reveals that higher-income households have become a larger share of the population, while lower-income households have decreased proportionally, suggesting overall economic improvement during this period.

Results

The analysis produced two key visualizations that illustrate the changing household income distribution over the 13-year study period:

Initial Visualization: Ridgeline Plot

The first approach used a ridgeline-style visualization showing the distribution for each year separately. While this provided a comprehensive view of all years, it made trend identification challenging as viewers had to compare across multiple subplots.

Ridgeline Plot of Household Income Distribution 2010-2023

Figure 1: Ridgeline plot showing household income distribution by year (2010-2023). Each row represents a different year, with bars showing the percentage of households in each income bracket.

Final Visualization: Time Series Analysis

The refined approach used a multi-line time series plot that clearly demonstrates the diverging trends between income brackets. This visualization makes the upward mobility pattern immediately apparent.

Time Series of Household Income Distribution

Figure 2: Time series plot showing the evolution of household income distribution from 2010-2023. Higher income brackets (warmer colors) show increasing trends, while lower income brackets (cooler colors) show declining trends.

Key Observations from Visualizations

  1. Clear Divergence: The time series plot reveals a distinct separation between income brackets above and below $75,000
  2. Consistent Trends: The trends are remarkably consistent across the entire study period, suggesting structural economic changes rather than temporary fluctuations
  3. Magnitude of Change: The visualizations show that changes range from approximately 2-4 percentage points across different income brackets
  4. Crossover Points: Some income brackets intersect over time, indicating shifts in the relative size of different economic segments

Technical Implementation

Tools and Libraries

Project Structure

wage_distribution/
├── main.ipynb              # Primary analysis notebook
├── readme.md              # Project documentation
└── Data/                  # Census data files (2010-2023)
    ├── ACSST1Y2010.S1901-*.csv
    ├── ACSST1Y2011.S1901-*.csv
    └── ...

Limitations and Considerations

Future Enhancements

Data Source

U.S. Census Bureau, American Community Survey 1-Year Estimates, Table S1901: "Income in the Past 12 Months (in 2023 Inflation-Adjusted Dollars)"

URL: https://data.census.gov/table/ACSST5Y2020.S1901?q=S1901 This project analyzes the evolution of household income distribution in the United States from 2010 to 2023 using data from the U.S. Census Bureau's American Community Survey. The analysis examines how the proportion of households in different income brackets has changed over time, revealing significant shifts in the American economic landscape.

Key Finding: The data shows a clear upward mobility trend, with higher income brackets ($75,000+) gaining population share while lower income brackets show declining representation.

Author

John Curran

This project serves as a demonstration of exploratory data analysis methodology, following a narrative approach that documents the analytical thought process from initial data exploration to final insights.

← previous
Georgia Power
next →
Personal Website
© 2026 · case study updated 2026-05-25 ↑ view source on github