دانلود مقاله ISI انگلیسی شماره 22304
ترجمه فارسی عنوان مقاله

داده کاوی و روش های آماری برای توصیف عملکرد فتوولتائیک از ماژول غشای نازک

عنوان انگلیسی
Data mining and statistical techniques for characterizing the performance of thin-film photovoltaic modules
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
22304 2013 10 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Expert Systems with Applications, Volume 40, Issue 17, 1 December 2013, Pages 7141–7150

ترجمه کلمات کلیدی
توزیع طیفی خورشیدی - ماژول های فتوولتائیک - نسبت عملکرد - داده کاوی - مدل های آماری
کلمات کلیدی انگلیسی
Solar spectral distribution, Photovoltaic modules, Performance ratio, Data mining, Statistical models
پیش نمایش مقاله
پیش نمایش مقاله  داده کاوی و روش های آماری برای توصیف عملکرد فتوولتائیک از ماژول غشای نازک

چکیده انگلیسی

A method for characterizing the performance ratio of thin-film photovoltaic modules based on the use of data mining and statistical techniques is developed. In general, this parameter changes when modules are working in outdoor conditions depending on irradiance, temperature, air mass and solar spectral irradiance distribution. The problem is that it is usually difficult to know how to include solar spectral irradiance information when estimating the performance of photovoltaic modules. We propose five different solar spectral irradiance distributions that summarize all the different distributions observed in Malaga. Using the probability distribution functions of these curves and a statistical test, we first checked when two spectral distributions measured can be considered to have the same contribution of energy per wavelength. Hence, using this test and the k-means data mining technique, all the measured spectra, more than two hundred and fifty thousand, are clustered in only five different groups. All the spectra in each cluster can be considered as equal and the k-means technique estimates one centroid for each cluster that corresponds to the cumulative probability distribution function that is the most similar to the rest of the samples in the cluster. The results obtained proves that 99.98% of the functions can be considered equal to the centroid of its cluster. With these five types of functions, we have explained the changes in the performance ratio measured for thin-film photovoltaic modules of different technologies.

مقدمه انگلیسی

The increase in the performance of solar photovoltaic modules is a pressing issue in order to improve solar photovoltaic market share and make this industry more competitive. There are many elements where this efficiency can be achieved. The elements to be studied to better establish module performance are the key parameters given by the standard test conditions (STC) used to characterize and correct modules as defined by some international standards such as IEC 60904-3 (IEC, 1989) or IEC 61215 (IEC, 1993). The typical standard variables are temperature (25°), irradiance (1000 W/m2) and AM 1.5 spectrum. This paper is focused on the study and characterization of the spectrum (solar irradiance spectral distribution) and its influence on thin-film photovoltaic devices. The performance of these devices has a greater dependence on the spectrum than photovoltaic modules built using conventional technologies such as mono and policristalin-Si. Two standards for the AM 1.5 spectrum were defined for terrestrial use by the photovoltaic industry and the ASTM (American Society for Testing and Materials) (ASTM, 2012). The importance of this parameter lies in the fact that photovoltaic modules have a spectral response, (Martín and Ruiz, 1999), which will only allow a part of the whole solar spectral irradiance received on their surface ‘to be seen’. Spectral response is a characteristic of every cell technology. Depending on this factor a module will be able to provide more or less energy for one quantity of irradiation received. To study how solar spectral irradiance affects different cell technologies and to explain the different module performances observed, it is first necessary to have an indepth knowledge of the solar spectrum in any outdoor conditions where photovoltaic modules will be operating. Previous papers used different parameters to characterize solar spectrum. These include the Spectral Factor (SF), (Fabero and Chenlo, 1991), Mismatch Factor (M, MM or MMF), (Poissant et al., 2003), Usefull Fraction (UF), (Gottschalg et al., 2003), Photovoltaically Active Fractions (PAFs), (Berman et al., 1999), and the Average Photon Energy (APE), (Gottschalg et al., 2003). One of the reasons why solar spectrum is not among the typical variables is that measuring solar spectral irradiance involves using very expensive devices. We propose to analyze the APE as this parameter uses a single number to characterize a whole spectrum what makes easier for it to be included and treated as another meteorological variable, once calculated. The uniqueness of the relationship between the solar spectral irradiance distribution and the APE is proven, (Minemoto et al., 2009); however, these authors have proven this correspondence by dividing the values of spectral irradiance into several 50 nm bands and using these integrated spectral irradiance values. We propose a different approach, based on the use of a statistical test that enables us to use all the values of the spectrum. Moreover, the authors suggest that the results obtained should be verified with data from locations with different climatic conditions. In this study we are going to characterize the spectra at the location of Malaga, Andalusia (Spain) with a Mediterranean climate whose typical features are high humidity and mild and warm temperatures. Solar spectral irradiances have therefore been recorded at the laboratory facilities of the Solar Photovoltaic Energy group at the University of Malaga during a period of over one year. By using more than one year of data a complete range of spectrum under real working conditions is obtained and a possible seasonal effect is isolated. Once the spectrum is defined a more precise prediction of the energy produced by modules can be performed. This will result in better tools for the photovoltaic industry to size up photovoltaic plants and their integration into the power grid. One of the objectives of this paper is to determine the different types of solar spectral irradiance distribution curves and how these different curves are related to the different estimated APE values. Once these relationships are established, further goal of this paper is to analyze how the performance ratio (PR) for different photovoltaic modules is explained using the relationship found. PR values for these modules are estimated using data collected outdoors for over one year. We propose to use both statistical and data mining techniques that allows us to handle a large quantity of data in an automatic way to analyse and characterize the different APE values in our laboratory. On the one hand, we have used the well known Kolmogorov–Smirnov two sample test to check whether or not two distributions are the same (homogeneity test). On the other hand, we have used the k-means data mining technique to cluster all the estimated APEs in several clusters where all the measurements included have the same cumulative probability distribution of solar spectral irradiance. Many different areas have used clustering including text mining, statistical learning and pattern recognition, (Duda et al., 2001, Hastie et al., 2001 and Jain et al., 1999). In previous papers, spectra were gathered using the APE value but data mining techniques will here be used to characterize the spectra in a few types, aimed at trying to put into the same group spectra with different values of APE but with the same relative distribution of the irradiance for each wavelength. Once the different solar spectral irradiance measurements are characterized taking into account the similarities observed, this information has been used to analyze the performance ratio of different modules for many different outdoor conditions. In the second section, a detailed description of the data set is presented. In the third section the values of performance ratio for thin-film modules of different technologies are depicted and analyzed depending on several parameters. The fourth section is dedicated to analyzing the relationship between solar spectral irradiance and APE value using statistical techniques and to characterizing the different types of spectra irradiance measured using the k-means data mining technique. The fifth section is dedicated to presenting the obtained results and discussing these results for their use in the characterization of performance ratio of thin-film photovoltaic modules. Finally, the last section summarizes the conclusions of the work.

نتیجه گیری انگلیسی

We have analyzed the influence of the solar spectral irradiance distribution on the performance ratio of thin-film photovoltaic modules of different technologies. To carry out this analysis, we first characterize the cumulative probability distribution function of each spectrum using its APE value. After analyzing more than two hundred and fifty thousand measurements of solar spectral irradiation distributions, we can conclude that all these distribution can be represented using only 5 different distributions related to the APE values. We use statistical and data mining techniques to state this fact. Secondly, we prove that all the spectra that have the same APE value also have the same distribution shape. We here use the Kolmogorov–Smirnov two sample test that allows us to compare whether or not two cumulative probability distributions are the same. After perform this, we find that there are only five different types of distributions using the k-means data mining technique. Thanks to this technique it has been possible to reduce over two hundred and fifty thousand measured spectra to only five. Moreover, these five spectra are selected using the k-means algorithm; in fact, they are the centroids of each cluster. Almost all the spectra, more than 99.98% included in each cluster are the same to the centroid of the cluster using statistical criteria. Using these five selected distributions, the performance ratio values estimated for photovoltaic modules built with different technologies are analyzed. A detailed analysis of the curves of performance ratio obtained when they are used as independent variables the temperature and the APE value for each outdoor conditions allows us to conclude that the changes in the performance ratio of modules can be better explained if it is considered the spectrum associated to each APE value. All these results have been obtained using data from Malaga. It would be of great interest to perform them for other locations with different climatic conditions.