• Title/Summary/Keyword: Correlated error

Search Result 347, Processing Time 0.025 seconds

STANDARDIZATION STUDY FOR THE KOREAN VERSION OF THE LURIA-NEBRASKA NEUROPSYCHOLOGICAL BATTERY FOR CHILDREN II : EVALUATION OF THE VALIDITY & CLINICAL UTILITY OF THE KOREAN VERSION OF LNNB-C (한국판 아동용 Luria-Nebraska 신경심리 검사의 표준화 연구 II : 타당도 및 임상적 유용성 검증)

  • Shin, Min-Sup;Hong, Kang-E
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.5 no.1
    • /
    • pp.70-82
    • /
    • 1994
  • Present study was to evaluate the validity and the clinical utility of the Korean version of Luria-Nebraska Neuropsychological Battery for Children(LNNB-C) in various groups including normal, brain damaged attention deficit hyperactivity disordered(ADHD), and psychiatrically disordered. The Korean version of LNNB-C and BGT were administered to clinical groups consisted of 51 patients(19 brain damaged, 16 ADHD. and 16 psychiatric controls), and to normal group composed of 147 children between the age of 8 and It Also KEDI-WISC was administered D clinical groups as a part of comprehensive psychological assessment There were significant differences between the brain damaged and the normals on all scales of LNNB-C, and between the normals and the ADHD on 11 clinical scales and 3 summary scales, which indicate the clinical validity for the scales of the Korean version of LNNB-C. The significant differences between the ADHD and the brain damaged on 3 summary scales were found, suggesting that the summary scales might play an important role id discriminating between two groups. Multiple discriminant analysis showed that the Korean version of LNNB-C significantly discriminates 3 groups - normals, ADHD, and brain damaged. Percentages of correct classification were ranged from 62.5% in the ADHD to 98.6Ta in the normals. For further evaluating the discriminant validity of the LNNB-C, the discriminant power of each items were calculated, and 131 of the 147 items discriminated significantly between the brain damaged and the normals. The scales of LNNB-C significantly correlated with the error scores of BGT and the most of scales of KEDI-WISC. These results put together : strongly support the concurrent and the discriminant validity of the Korean version of LNNB-C in diagnosing brain damage. The limitations of present study and several issues for the luther study were discussed.

  • PDF

Optimal Selection of Classifier Ensemble Using Genetic Algorithms (유전자 알고리즘을 이용한 분류자 앙상블의 최적 선택)

  • Kim, Myung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.99-112
    • /
    • 2010
  • Ensemble learning is a method for improving the performance of classification and prediction algorithms. It is a method for finding a highly accurateclassifier on the training set by constructing and combining an ensemble of weak classifiers, each of which needs only to be moderately accurate on the training set. Ensemble learning has received considerable attention from machine learning and artificial intelligence fields because of its remarkable performance improvement and flexible integration with the traditional learning algorithms such as decision tree (DT), neural networks (NN), and SVM, etc. In those researches, all of DT ensemble studies have demonstrated impressive improvements in the generalization behavior of DT, while NN and SVM ensemble studies have not shown remarkable performance as shown in DT ensembles. Recently, several works have reported that the performance of ensemble can be degraded where multiple classifiers of an ensemble are highly correlated with, and thereby result in multicollinearity problem, which leads to performance degradation of the ensemble. They have also proposed the differentiated learning strategies to cope with performance degradation problem. Hansen and Salamon (1990) insisted that it is necessary and sufficient for the performance enhancement of an ensemble that the ensemble should contain diverse classifiers. Breiman (1996) explored that ensemble learning can increase the performance of unstable learning algorithms, but does not show remarkable performance improvement on stable learning algorithms. Unstable learning algorithms such as decision tree learners are sensitive to the change of the training data, and thus small changes in the training data can yield large changes in the generated classifiers. Therefore, ensemble with unstable learning algorithms can guarantee some diversity among the classifiers. To the contrary, stable learning algorithms such as NN and SVM generate similar classifiers in spite of small changes of the training data, and thus the correlation among the resulting classifiers is very high. This high correlation results in multicollinearity problem, which leads to performance degradation of the ensemble. Kim,s work (2009) showedthe performance comparison in bankruptcy prediction on Korea firms using tradition prediction algorithms such as NN, DT, and SVM. It reports that stable learning algorithms such as NN and SVM have higher predictability than the unstable DT. Meanwhile, with respect to their ensemble learning, DT ensemble shows the more improved performance than NN and SVM ensemble. Further analysis with variance inflation factor (VIF) analysis empirically proves that performance degradation of ensemble is due to multicollinearity problem. It also proposes that optimization of ensemble is needed to cope with such a problem. This paper proposes a hybrid system for coverage optimization of NN ensemble (CO-NN) in order to improve the performance of NN ensemble. Coverage optimization is a technique of choosing a sub-ensemble from an original ensemble to guarantee the diversity of classifiers in coverage optimization process. CO-NN uses GA which has been widely used for various optimization problems to deal with the coverage optimization problem. The GA chromosomes for the coverage optimization are encoded into binary strings, each bit of which indicates individual classifier. The fitness function is defined as maximization of error reduction and a constraint of variance inflation factor (VIF), which is one of the generally used methods to measure multicollinearity, is added to insure the diversity of classifiers by removing high correlation among the classifiers. We use Microsoft Excel and the GAs software package called Evolver. Experiments on company failure prediction have shown that CO-NN is effectively applied in the stable performance enhancement of NNensembles through the choice of classifiers by considering the correlations of the ensemble. The classifiers which have the potential multicollinearity problem are removed by the coverage optimization process of CO-NN and thereby CO-NN has shown higher performance than a single NN classifier and NN ensemble at 1% significance level, and DT ensemble at 5% significance level. However, there remain further research issues. First, decision optimization process to find optimal combination function should be considered in further research. Secondly, various learning strategies to deal with data noise should be introduced in more advanced further researches in the future.

The study of quantitative analytical method for pH and moisture of Hanji record paper using non-destructive FT-NIR spectroscopy (비파괴 분석 방법인 푸리에 변환 근적외선 분광 분석을 이용한 한지 기록물의 산성도 및 함수율 정량 분석 연구)

  • Shin, Yong-Min;Park, Soung-Be;Lee, Chang-Yong;Kim, Chan-Bong;Lee, Seong-Uk;Cho, Won-Bo;Kim, Hyo-Jin
    • Analytical Science and Technology
    • /
    • v.25 no.2
    • /
    • pp.121-126
    • /
    • 2012
  • It is essential to evaluate the quality of Hanji record paper without damaging the record paper by previous destructive methods. The samples were Hanji record paper produced in the 1900s. Near-infrared (NIR) spectrometer was used as a non destructive method for evaluating the quality of record papers. Fourier transform (FT) spectrometer was used with 12,500 to 4,000 $cm^{-1}$ wavenumber range for quantitative analysis and it has high accuracy and good signal-to-noise ratio. The acidity and moisture content of Hanji record paper were measured by integrating sphere as diffuse reflectance type. The acidity (pH) of chemical factors as a quality evaluated factor of Hanji was correlated to NIR spectrum. The NIR spectrum was pretreated to obtain the coefficients of optimum correlation. Multiplicative scatter correction (MSC) and First derivative of Savitzky-Golay were used as pretreated methods. The coefficients of optimum correlation were calculated by PLSR (partial least square regression). The correlation coefficients ($R^2$) of acidity had 0.92 on NIR spectra without pretreatment. Also the standard error of prediction (SEP) of pH was 0.24. And then the NIR spectra with pretreatment would have better correlation coefficient ($R^2$ = 0.98) and 0.19 as SEP on pH. For moisture contents, the linearity correlation without pretreatment was higher than the case with pretreatment (MSC, $1^{st}$ derivative). As the best result, the $R^2$ was 0.99 and SEP was 0.45. This indicates that it is highly proper to evaluate the quality of Hanji record papers speedily with integrated sphere and FT NIR analyzer as a non-destructive method.

The Relationship between Internet Search Volumes and Stock Price Changes: An Empirical Study on KOSDAQ Market (개별 기업에 대한 인터넷 검색량과 주가변동성의 관계: 국내 코스닥시장에서의 산업별 실증분석)

  • Jeon, Saemi;Chung, Yeojin;Lee, Dongyoup
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.81-96
    • /
    • 2016
  • As the internet has become widespread and easy to access everywhere, it is common for people to search information via online search engines such as Google and Naver in everyday life. Recent studies have used online search volume of specific keyword as a measure of the internet users' attention in order to predict disease outbreaks such as flu and cancer, an unemployment rate, and an index of a nation's economic condition, and etc. For stock traders, web search is also one of major information resources to obtain data about individual stock items. Therefore, search volume of a stock item can reflect the amount of investors' attention on it. The investor attention has been regarded as a crucial factor influencing on stock price but it has been measured by indirect proxies such as market capitalization, trading volume, advertising expense, and etc. It has been theoretically and empirically proved that an increase of investors' attention on a stock item brings temporary increase of the stock price and the price recovers in the long run. Recent development of internet environment enables to measure the investor attention directly by the internet search volume of individual stock item, which has been used to show the attention-induced price pressure. Previous studies focus mainly on Dow Jones and NASDAQ market in the United States. In this paper, we investigate the relationship between the individual investors' attention measured by the internet search volumes and stock price changes of individual stock items in the KOSDAQ market in Korea, where the proportion of the trades by individual investors are about 90% of the total. In addition, we examine the difference between industries in the influence of investors' attention on stock return. The internet search volume of stocks were gathered from "Naver Trend" service weekly between January 2007 and June 2015. The regression model with the error term with AR(1) covariance structure is used to analyze the data since the weekly prices in a stock item are systematically correlated. The market capitalization, trading volume, the increment of trading volume, and the month in which each trade occurs are included in the model as control variables. The fitted model shows that an abnormal increase of search volume of a stock item has a positive influence on the stock return and the amount of the influence varies among the industry. The stock items in IT software, construction, and distribution industries have shown to be more influenced by the abnormally large internet search volume than the average across the industries. On the other hand, the stock items in IT hardware, manufacturing, entertainment, finance, and communication industries are less influenced by the abnormal search volume than the average. In order to verify price pressure caused by investors' attention in KOSDAQ, the stock return of the current week is modelled using the abnormal search volume observed one to four weeks ahead. On average, the abnormally large increment of the search volume increased the stock return of the current week and one week later, and it decreased the stock return in two and three weeks later. There is no significant relationship with the stock return after 4 weeks. This relationship differs among the industries. An abnormal search volume brings particularly severe price reversal on the stocks in the IT software industry, which are often to be targets of irrational investments by individual investors. An abnormal search volume caused less severe price reversal on the stocks in the manufacturing and IT hardware industries than on average across the industries. The price reversal was not observed in the communication, finance, entertainment, and transportation industries, which are known to be influenced largely by macro-economic factors such as oil price and currency exchange rate. The result of this study can be utilized to construct an intelligent trading system based on the big data gathered from web search engines, social network services, and internet communities. Particularly, the difference of price reversal effect between industries may provide useful information to make a portfolio and build an investment strategy.

Pseudo Image Composition and Sensor Models Analysis of SPOT Satellite Imagery for Inaccessible Area (비접근 지역에 대한 SPOT 위성영상의 Pseudo영상 구성 및 센서모델 분석)

  • 방기인;조우석
    • Korean Journal of Remote Sensing
    • /
    • v.17 no.1
    • /
    • pp.33-44
    • /
    • 2001
  • The paper presents several satellite models and satellite image decomposition methods for inaccessible area where ground control points can hardly acquired in conventional ways. First, 10 different satellite sensor models, which were extended from collinearity condition equations, were developed and then behavior of each sensor model was investigated. Secondly, satellite images were decomposed and also pseudo images were generated. The satellite sensor model extended from collinearity equations was represented by the six exterior orientation parameters in $1^{st}$, $2^{nd}$ and $3^{rd}$ order function of satellite image row. Among them, the rotational angle parameters such as $\omega$(omega) and $\Phi$(phi) correlated highly with positional parameters could be assigned to constant values. For inaccessible area, satellite images were decomposed, which means that two consecutive images were combined as one image, The combined image consists of one satellite image with ground control points and the other without ground control points. In addition, a pseudo image which is an imaginary image, was prepared from one satellite image with ground control points and the other without ground control points. In other words, the pseudo image is an arbitrary image bridging two consecutive images. For the experiments, SPOT satellite images exposed to the similar area in different pass were used. Conclusively, it was found that 10 different satellite sensor models and 5 different decomposed methods delivered different levels of accuracy. Among them, the satellite camera model with 1st order function of image row for positional orientation parameters and rotational angle parameter of kappa, and constant rotational angle parameter omega and phi provided the best 60m maximum error at check point with pseudo images arrangement.

A Method of Reproducing the CCT of Natural Light using the Minimum Spectral Power Distribution for each Light Source of LED Lighting (LED 조명의 광원별 최소 분광분포를 사용하여 자연광 색온도를 재현하는 방법)

  • Yang-Soo Kim;Seung-Taek Oh;Jae-Hyun Lim
    • Journal of Internet Computing and Services
    • /
    • v.24 no.2
    • /
    • pp.19-26
    • /
    • 2023
  • Humans have adapted and evolved to natural light. However, as humans stay in indoor longer in modern times, the problem of biorhythm disturbance has been induced. To solve this problem, research is being conducted on lighting that reproduces the correlated color temperature(CCT) of natural light that varies from sunrise to sunset. In order to reproduce the CCT of natural light, multiple LED light sources with different CCTs are used to produce lighting, and then a control index DB is constructed by measuring and collecting the light characteristics of the combination of input currents for each light source in hundreds to thousands of steps, and then using it to control the lighting through the light characteristic matching method. The problem with this control method is that the more detailed the steps of the combination of input currents, the more time and economic costs are incurred. In this paper, an LED lighting control method that applies interpolation and combination calculation based on the minimum spectral power distribution information for each light source is proposed to reproduce the CCT of natural light. First, five minimum SPD information for each channel was measured and collected for the LED lighting, which consisted of light source channels with different CCTs and implemented input current control function of a 256-steps for each channel. Interpolation calculation was performed to generate SPD of 256 steps for each channel for the minimum SPD information, and SPD for all control combinations of LED lighting was generated through combination calculation of SPD for each channel. Illuminance and CCT were calculated through the generated SPD, a control index DB was constructed, and the CCT of natural light was reproduced through a matching technique. In the performance evaluation, the CCT for natural light was provided within the range of an average error rate of 0.18% while meeting the recommended indoor illumination standard.

The Variation of Natural Population of Pinus densiflora S. et Z. in Korea (III) -Genetic Variation of the Progeny Originated from Mt. Chu-wang, An-Myon Island and Mt. O-Dae Populations- (소나무 천연집단(天然集團)의 변이(變異)에 관(關)한 연구(硏究)(III) -주왕산(周王山), 안면도(安眠島), 오대산(五臺山) 소나무집단(集團)의 차대(次代)의 유전변이(遺傳變異)-)

  • Yim, Kyong Bin;Kwon, Ki Won
    • Journal of Korean Society of Forest Science
    • /
    • v.32 no.1
    • /
    • pp.36-63
    • /
    • 1976
  • The purpose of this study is to elucidate the genetic variation of the natural forest of Pinus densiflora. Three natural populations of the species, which are considered to be superior quality phenotypically, were selected. The locations and conditions of the populations are shown in table 1 and 2. The morphological traits of tree and needle and some other characteristics were presented already in our first report of this series in which population and family differences according to observed characteristics were statistically analyzed. Twenty trees were sampled from each populations, i.e., 60 trees in total. During the autumn of 1974, matured cones were collected from each tree and open-pollinated seeds were extracted in laboratory. Immediately after cone collection, in closed condition, the morphological characteristics were measured. Seed and seed-wing dimensions were also studied. In the spring of 1975, the seeds were sown in the experimental tree nursery located in Suweon. And in the April of 1976, the 1-0 seedlings were transplanted according to the predetermined experimental design, randomized block design with three replications. Because of cone setting condition. the number of family from which progenies were raised by populations were not equal. The numbers of family were 20 in population 1. 18 in population 2 and 15 in population 3. Then, each randomized block contained seedlings of 53 families from 3 populations. The present paper is mainly concerned with the variation of some characteristics of cone, seed, needle, growth performance of seedlings, and chlorophyll and monoterpene compositions of needles. The results obtained are summerized as follows. 1. The meteorological data obtained by averaging the records of 30 year period, observed from the nearest station to each location of populations, are shown in Fig. 3, 4, and 5. The distributional pattern of monthly precipitation are quite similar among locations. However, the precipitation density on population 2, Seosan area, during growing season is lower as compared to the other two populations. Population 1. Cheong-song area, and population 3, Pyong-chang area, are located in inland, but population 2 in the western seacoast. The differences on the average monthly air temperatures and the average monthly lowest temperatures among populations can hardly be found. 2. Available information on the each mother trees (families) studied, such as age, stem height, diameter at breast height, clear-bole-length, crown conditions and others are shown in table 6,7, and 8. 3. The measurements of fresh cone weight, length and the widest diameter of cone are given in Tab]e 9. All these traits arc concerned with the highly significant population differences and family differences within population. And the population difference was also found in the cone-index, that is, length-diameter ratio. 4. Seed-wing length and seed-wing width showed the population differences, and the family differences were also found in both characteristics. Not discussed in this paper, however, seed-wing colours and their shapes indicate the specificity which is inherent to individual trees as shown in photo 3 on page 50. The colour and shape are fully the expression of genetic make up of mother tree. The little variations on these traits are resulted from this reason. The significant differences among populations and among families were found in those characteristics, such as 1000-seed weight, seed length, seed width, and seed thickness as shown in table 11. As to all these dimensions, the values arc always larger in population 1 which is younger in age than that of the other two. The population differences evaluated by cone, seed and seed-wing sizes could partly be attributed to the growth vigorousity. 5. The values of correlation between the characteristics of cone and seed are presented in table 12. As shown, the positive correlations between cone diameter and seed-wing width were calculated in all populations studied. The correlation between seed-wing length and seed length was significantly positive in population 1 and 3 but not in population 2, that is, the r-value is so small as 0.002. in the latter. The correlation between cone length and seed-wing length was highly significant in population 1, but not in population 2. 6. Differences among progenies in growth performances, such as 1-0 and 1-1 seedling height and root collar diameter were highly singificant among populations as well as families within population(Table 13.) 7. The heritability values in narrow sense of population characteristics were estimated on the basis of variance components. The values based on seedling height at each age stage of 1-1 and 1-0 ranged from 0.146 to 0.288 and the values of root collar diameter from 0.060 to 0.130. (Table 14). These heritability values varied according to characteristics and seedling ages. Here what must be stated is that, for calculation of heritability values, the variance values of population was divided by the variance value of environment (error) and family and population. The present authors want to add the heritability values based on family level in the coming report. It might be considered that if the tree age is increased in furture, the heritability value is supposed to be altered or lowered. Examining the heritability values studied previously by many authors, in pine group at age of 7 to 15, the values of height growth ranged from 0.2 to 0.4 in general. The values we obtained are further below than these. 8. The correlation between seedling growth and seed characteristics were examined and the values resulted are shown in table 16. Contrary to our hypothetical premise of positive correlation between 1-0 seedling height and seed weight, non-significance on it was found. However, 1-0 seedling height correlated positively with seed length. And significant correlations between 1-0 and 1-1 seedling height are calculated. 9. The numbers of stomata row calculated separately by abaxial and adaxial side showed highly significant differences among populations, but not in serration density. On serration density, the differences among families within population were highly significant. (Table 17) A fact must be noted is that the correlation between stomata row on abaxial side and adaxial side was highly significant in all populations. Non-significances of correlation coefficient between progenies and parents regarding to stomata row on abaxial side were shown in all populations studied.(Table 18). 10. The contents of chhlorophyll b of the needle were a little more than that of chlorophyll a irrespective of the populations examined. The differences of chlorophyll a, b and a plus b contents were highly significant but not among families within populations as shown in table 20. The contents of chlorophyll a and b are presented by individual trees of each populations in table 21. 11. The occurrence of monoterpene components was examined by gas liquid chromatography (Shimazu, GC-1C type) to evaluate the population difference. There are some papers reporting the chemical geography of pines basing upon monoterpene composition. The number of populations studied here is not enough to state this problem. The kinds of monoterpene observed in needle were ${\alpha}$-pinene, camphene, ${\beta}$-pinene, myrcene, limonene, ${\beta}$-phellandrene and terpinolene plus two unknowns. In analysis of monoterpene composition, the number of sample trees varied with population, I.e., 18 families for population 1, 15 for population 2 and 11 for population3. (Table 22, 23 and 24). The histograms(Fig. 6) of 7 components of monoterpene by population show noticeably higher percentages of ${\alpha}$-pinene irrespective of population and ${\beta}$-phellandrene in the next order. The minor Pinus densiflora monoterpene composition of camphene, myrcene, limonene and terpinolene made up less than 10 percent of the portion in general. The average coefficients of variation of ${\alpha}$-pinene and ${\beta}$-phellandrene were 11 percent. On the contrary to this, the average coefficients of variation of camphene, limonene and terpinolene varied from 20 to 30 percent. And the significant differences between populaiton were observed only in myrcene and ${\beta}$-phellandrene. (Table 25).

  • PDF