Car Price Regression Analysis Summary Report

1. Introduction

This report presents a comprehensive analysis of car price prediction models using various regression approaches. The analysis examines multiple model specifications to identify the most effective predictors of car prices and gain insights into the factors that influence vehicle valuation.

The dataset contains information on car prices with several potential predictors:

2. Model Performance Summary

Several models were estimated to find the best approach for predicting car prices. The table below summarizes their performance across key metrics.

Model R-squared Adj. R-squared RMSE MAE Normality (p-value) AIC
Original 0.9484 0.9484 707.11 666.69 < 0.001 159618.4
Domain Knowledge 0.8705 0.8705 1119.88 860.43 < 0.001 168814.3
Economy Segment 0.7119 0.7114 786.88 604.93 < 0.001 53390.3
Mid-range Segment 0.3420 0.3409 665.15 546.41 < 0.001 53864.8
Luxury Segment 0.6243 0.6236 931.31 747.95 < 0.001 54502.5
Robust Regression N/A N/A 1145.31 835.56 < 0.001 N/A

Key Insights on Model Performance:

R-squared Comparison
Figure 1: Comparison of R-squared values across models
Error Metrics Comparison
Figure 2: RMSE and MAE comparison across models
QQ Plot Comparison
Figure 3: QQ plot comparison showing residual normality across models

3. Key Determinants of Car Prices

The analysis identified several key factors that significantly influence car prices:

3.1 Age/Year Effect

Vehicle age is one of the strongest predictors of price:

3.2 Mileage Impact

Mileage is another critical factor in car valuation:

3.3 Engine Size Premium

Engine size positively affects car prices:

3.4 Fuel Type Effects

Fuel type creates notable price premiums:

Coefficient Comparison
Figure 4: Comparison of key coefficient estimates across models

4. Key Insights and Implications

4.1 Market Segmentation

The analysis reveals distinct pricing dynamics across market segments:

Implication: Different marketing and pricing strategies should be employed for different market segments. Economy cars should emphasize low age and mileage, while luxury vehicles can highlight engine performance and alternative fuel options.

4.2 Depreciation Patterns

The analysis confirms that car depreciation follows non-linear patterns:

Implication: For investment purposes, luxury vehicles may retain value better over time. For consumers seeking value, mid-range vehicles with moderate mileage may offer the best price-to-value ratio.

4.3 Alternative Fuel Premium

Electric and hybrid vehicles maintain significant price premiums:

Implication: The strong premium for alternative fuel vehicles indicates consumer willingness to pay for green technology. Manufacturers should continue investing in alternative powertrains, while used car sellers should highlight these features in marketing.

4.4 Methodological Insights

The analysis demonstrates that:

Implication: Effective car price modeling requires both statistical rigor and domain expertise. Simple domain-informed transformations can be more effective than complex statistical approaches.

5. Conclusion

This comprehensive analysis of car price determinants reveals several key findings:

  1. Major Value Drivers: Vehicle age, mileage, engine size, and fuel type collectively explain a large portion of car price variation (approximately 94.8% ).
  2. Market Segmentation: The car market shows distinct pricing dynamics across segments, with different factors carrying different weights in each segment. This suggests targeted pricing strategies are appropriate.
  3. Alternative Fuel Value: Electric and hybrid vehicles maintain significant price premiums, reflecting consumer valuation of eco-friendly technology and potential fuel savings.
  4. Modeling Approach: While the original model performs well statistically (lowest AIC), the domain-knowledge model and segment-specific models offer superior interpretability and insights into market dynamics.

For practical applications, we recommend using the segment-specific models for targeted price predictions within market segments, and the domain knowledge model for general market analysis and interpretation of factors affecting car prices.