Home

Awesome

Date: 27th January 2021

Adoption of solar energy in residential in Virginia state

Report is available and can be downloaded (“Report_solar_adopt_EN”)<br/><br/>

1. References

[1] Data source: Stanford’s Deepsolar dataset, dec.2018 http://web.stanford.edu/group/deepsolar/home

[2] Python notebook / data analysis: https://www.kaggle.com/andromedasagan/implementation-of-solar-energy-in-the-us<br/><br/>

2. Context

The source of data is Stanford University’s DeepSolar project, a deep learning framework that analyzed satellite images to detect solar panels throughout the country. The data collected are the size, type (residential/non-residential) of the power systems distributed in the 48 states in the U.S. The associated socioeconomic data for these locations were recorded over several years.

My ambition in this work is to build a socio-economic analysis of the last mile to understand what are the profiles within a homogeneous group of households that is adopting solar energy. It focuses on the state of Virginia.

This work is based on a first chapter of data analysis [2] that highlights key trends and correlations in the deployment of solar power based on the full Deepsolar dataset.<br/><br/>

3. Goal

Explanations about the way I proceed:

PIC1<br/><br/>

4. Data exploration

We will measure the adoption of solar systems through the variable solar_panel_area_per_capita. We will draw up a matrix of correlations of deemed and less deemed factors over the target. Certain factors are deemed to be decisive in the choice for households to equip with solar systems. These factors become evident when the following observations are made:

Two missing factors that will unfortunately not be studied here, as they have not been collected in the present dataset:

Explanations about the way I proceed:

Short description of the extracted dataset

Dataset contains socio-economic and environmental data.

5. Missing values

In the dataset, there is a problem with very large floating-point numbers for which INF values are returned. To solve the problem, the max values are filtered and then discarded.

Explanations about the way I proceed:

PIC2

6. Vizualisations

Putting the factors face to face on graphs will allow us to see how what the locality has in household profiles contributes to adoption.

PIC6<br/><br/>

According to the Solar_panel_area_per_capita histogram, the group of vast majority of installed solar systems is located where Solar_panel_area_per_capita is below 0.03 m2/capita.

PIC7

Let's look at the ranges of median age, income and education level of 60% of the records of this sample, between the 20% and 80% percentiles.

We note the very meaningful characteristics of the group of households whose Solar_panel_area_per_capita is below 0.03 m2/capita for the vast majority of installed solar systems. For 60% of the records in this sample:

7. Correlational analysis

PIC8

There is a very strong correlation between average_household_income and number_of_years_of_education. We can easily explain this correlation. In general, education opens doors to higher-paying jobs.

Age_median is much less correlated with average_household_income.

It would have been interesting to see how these factors correlate with other factors that reflect the "green" mentality or the motivation to do savings.<br/><br/>

PIC9

8. Forecasting solar adoption

8.1 Preliminary modeling

PIC10

PIC11

PIC12

Good fitting on the training data. The test score could be much higher.

PIC13

A limited number of descriptives variables can acceptably explain the adoption of solar energy: the predominant are the average_household_income of the locality, age_median and number_of_years_of_education factors.

The level of education (number_of_years_of_education) factor may be influencing the adoption because of partly an intrinsec factor which is the level of income again (average_household_income). We have seen that these factors are highly correlated.

Age_median has also a contribution to the adoption.<br/><br/>

8.2 Fine-tuning the parameters of the model

PIC14

Correct fitting on the training data, the test score is acceptable. The model shows some signs of overfitting, i.e. weaknesses in its generalizability.

PIC15

PIC16<br/><br/>

8.3 Conclusions

The fine-tuned model confirms that the average_household_income of the locality, number_of_years_of_education, age_median and daily_solar_radiation factors can acceptably explain the adoption of solar energy in Virginia state.

The average_household_income factor has the most important contribution.

The level of education (number_of_years_of_education) is also an important factor in which we can find intrinsically the level of income (average_household_income), as seen before in the correlation matrix.

Concerning the contribution of age_median, to explain this we should rather look at what advancement in age and career brings: perhaps the rationality of the choices of household members, the stability of its income and the ability to make investments in order to project savings in the upcoming years.

This ranking could be made more realistic by taking into account adoption factors here absent from the analysis such as the ecological motivation of households or the intensity of financial incentives provided for the purchase of a solar system. Unfortunately, we did not find data illustrating these considerations in the DeepSolar dataset. This would need to be improved.