• Title/Summary/Keyword: Selection System

Search Result 5,381, Processing Time 0.033 seconds

Compatibility of Double Cropping of Winter Wheat - Summer Grain Crops in Paddy Field of Southern Korea (남부지역 논의 밀 이모작에서 하계 곡실작물 도입의 적합성)

  • Seo, Jong-Ho;Hwang, Chung-Dong;Oh, Seong-Hwan
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.66 no.1
    • /
    • pp.18-28
    • /
    • 2021
  • The growth period and productivity of cropping system of winter wheat-rice, winter wheat-bean and winter wheat-grain corn for 4 years from 2015 to 2018 were compared at the experimental field of National Institute of Crop Science in Miryang city. The harvest period of winter wheat was in mid-June, and summer crops were sown (transplanted) in late June. In transplanting of rice in late June, there was no difficulty in securing the heading of panicle and the yield of rice, but there was a lot of trouble in sowing wheat in proper time because the harvest time of rice was delayed to early November due to late maturity of rice, particularly in the mid-late maturing cultivar. There was no problem in soybean planting after winter wheat because the proper period of soybean planting is late-June. In addition, there was no problem in winter wheat sowng after soybean because the maturity period of soybean was mid-October. Selection of grain maize in double cropping with winter wheat in terms of growing periods, was desirable because grain maize had the fastest maturity among summer crops. In double cropping of winter wheat-summer crops, wheats combined with soybean and grain maize showed stable yields during three years, but there was a risk of yield declines in the wheat combined with rice in heavy rainfall year. It was possible to secure high yields in three summer crops as yields of rice, soybean, and corn were 600, 350, and 800 kg/10a, respectively. Summer crops with medium maturity was recommended because of no significant difference in yield between medium maturity and medium-late maturity cultivar. Soil physical properties were improved in soils cultivated with soybean and grain maize. Therefore, It was thought that double cropping systems of winter wheat with soybean and grain maize were superior to that of winter wheat with rice in terms of connecting period between winter wheat - summer crops and improvement of soil physical properties, and total income, particularly in soybean.

A Study on the Application of Other Effective Area-based Conservation Measures(OECMs) for Natural Heritage - Focusing on the Old Big Trees of Natural Monument and Dangsan Ritual - (자연유산의 '기타 효과적인 지역기반 보전수단(OECMs)' 등재기준 적용 연구 - 천연기념물 노거수와 당산제를 중심으로 -)

  • Jun, Da-Seul;Shin, Hyun-Sil
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.40 no.3
    • /
    • pp.1-9
    • /
    • 2022
  • This study compared and reviewed the recognition determinants by applying the OECMs criteria, focusing on old big trees, plant of natural monument that are natural heritage under the national heritage system of the Cultural Heritage Administration, and the results are as follows. First, among the protected areas designated and managed by government agencies according to each protection purpose, it is necessary to actively introduce new conservation measures, OECMs, to fulfill the Biodiversity strategy for 2030 while the land area is already saturated. Second, the OECMs are geographically defined areas(CBD, 2018), not currently recognized as a protected areas, governed and managed in a way that achieves positived sustained and effective contribution to in situ conservation of biodiversity. Since the selection of term, the scope of application criteria, and the context of interpretation are inevitably different, it is necessary to separately legislate and establish related laws of the OECMs suitable for each country's situation. Third, as a result of reviewing the OECMs criteria for plant of natural monument, the final 58 potential resources were recognized. Important elements among the OECMs criteria are that buffer zones should be spaced apart from designated zones to secure a certain area, and that economic activities through commercial production should not occur and meet biodiversity standards. Among the potential candidates, 23 areas were analyzed to be geographically isolated and independent, such as Forest of Oriental Arborvitae in Do-dong, Daegu, and forest types such as Carstor Aralia of Gungchon-ri, Samcheok and Forest of Common Camellias in Maryang-ri, Seocheon. As a result of reviewing the application of OECMs criteria for plant of natural monument, it was confirmed that the functions as a traditional uses were specialized among the values of biodiversity, and ecosystem services and cultural and spiritual values were inherited through Korea's unique culture of old big trees and Dangsan ritual. In terms of biodiversity criteria, it can be used as an important factor in connecting human and natural ecosystem networks without the discovery of new species.

Target candidate fish species selection method based on ecological survey for hazardous chemical substance analysis (유해화학물질 분석을 위한 생태조사 기반의 타깃 후보어종 선정법)

  • Ji Yoon Kim;Sang-Hyeon Jin;Min Jae Cho;Hyeji Choi;Kwang-Guk An
    • Korean Journal of Environmental Biology
    • /
    • v.41 no.2
    • /
    • pp.109-125
    • /
    • 2023
  • This study was conducted to select target fish species as baseline research for accumulation analysis of major hazardous chemicals entering the aquatic ecosystem in Korea and to analyze the impact on fish community. The test bed was selected from a sewage treatment plant, which could directly confirm the impact of the inflow of harmful chemicals, and the Geum River estuary where harmful chemicals introduced into the water system were concentrated. A multivariable metric model was developed to select target candidate fish species for hazardous chemical analysis. Details consisted of seven metrics: (1) commercially useful metric, (2) top-carnivorous species metric, (3) pollution fish indicator metric, (4) tolerance fish metric, (5) common abundant metric, (6) sampling availability (collectability) metric, and (7) widely distributed fish metric. Based on seven metric models for candidate fish species, eight species were selected as target candidates. The co-occurring dominant fish with target candidates was tolerant (50%), indicating that the highest abundance of tolerant species could be used as a water pollution indicator. A multi-metric fish-based model analysis for aquatic ecosystem health evaluation showed that the ecosystem health was diagnosed as "bad conditions". Physicochemical water quality variables also influenced fish feeding and tolerance guild in the testbed. Eight water quality parameters appeared high at the T1 site, indicating a large impact of discharging water from the sewage treatment plant. T2 site showed massive algal bloom, with chlorophyll concentration about 15 times higher compared to the reference site.

A Study on Human Rights in North Korea in terms of Haewon-sangsaeng (해원상생 관점에서의 북한인권문제 고찰)

  • Kim Young-jin
    • Journal of the Daesoon Academy of Sciences
    • /
    • v.43
    • /
    • pp.67-102
    • /
    • 2022
  • The purpose of this study is to analyze the human rights found in the North Korean Constitution and their core problem by focusing on elements of human rights suggested by Daesoon Jinrihoe's doctrine of Haewon-sangsaeng (解冤相生 the Resolution of Grievances for Mutual Beneficence). Haewon-sangsaeng is seemingly the only natural law that could resolve human resentment lingering from the Mutual Contention of the Former World while leading humans work for the betterment of one another. Haewon-sangsaeng, as a natural law, includes the right to life, the right to autonomous decision-making, and duty to act according to human dignity (physical freedom, the freedom of conscience, freedom of religion, freedom of speech, freedom of press, etc.), the right to equal treatment in one's social environment, and the right to ensure the highest level of health through treatment. The North Korean Constitution does not have a character as an institutional device to guarantee natural human rights, the fundamental principle of the Constitution, and stipulates the right of revolutionary warriors to defend dictators and dictatorships. The right to life is specified so that an individual's life belongs to the life of the group according to their socio-political theory of life. Rights to freedom are stipulated to prioritize group interests over individual interests in accordance with the principle of collectivism. The right to equality and the right to health justify discrimination through class discrimination. The right to life provided to North Koreans is not guaranteed due to the death penalty system found within the North Korean Criminal Code and the Criminal Code Supplementary Provisions. The North Korean regime deprives North Koreans of their right to die with dignity through public executions. The North Korean regime places due process under the direction of the Korea Worker's Party, recognizes religion as superstition or opium, and the Korea Worker's Party acknowledge the freedoms of bodily autonomy, religion, media, or press. North Koreans are classified according to their status, and their rights to equality are not guaranteed because they are forced to live a pre-modern lifestyle according to the patriarchal order. In addition, health rights are not guaranteed due biased availability selection and accessibility in the medical field as well as the frequent shortages of free treatments.

Long-Term Survival Analysis of Unicompartmental Knee Arthroplasty (슬관절 부분 치환술의 장기 생존 분석)

  • Park, Cheol Hee;Lee, Ho Jin;Son, Hyuck Sung;Bae, Dae Kyung;Song, Sang Jun
    • Journal of the Korean Orthopaedic Association
    • /
    • v.54 no.5
    • /
    • pp.427-434
    • /
    • 2019
  • Purpose: This study evaluated the long term clinical and radiographic results and the survival rates of unicompartmental knee arthroplasty (UKA). In addition, the factors affecting the survival of the procedure were analyzed and the survival curve was compared according to the affecting factors. Materials and Methods: Ninety-nine cases of UKA performed between December 1982 and January 1996 were involved: 10 cases with Modular II, 44 cases with Microloc, and 45 cases with Allegretto prostheses. The mean follow-up period was 16.5 years. Clinically, the hospital for special surgery (HSS) scoring system and the range of motion (ROM) were evaluated. Radiographically, the femorotibial angle (FTA) was measured. The survival rate was analyzed using the Kaplan-Meier method. Cox regression analysis was used to identify the factors affecting the survival according to age, sex, body mass index, preoperative diagnosis, and type of implant. The Kaplan-Meier survival curves were compared according to the factors affecting the survival of UKA. Results: The overall average HSS score and ROM was 57.7 and 134.3° preoperatively, 92.7 and 138.4° at 1 year postoperatively, and 79.1 and 138.4° at the last follow-up (p<0.001, respectively). The overall average FTA was varus 0.8° preoperatively, valgus 4.1° at postoperative 2 weeks, and valgus 3.0° at the last follow-up. The overall 5-, 10-, 15- and 20-year survival rates were 91.8%, 82.9%, 71.0%, and 67.0%, respectively. The factors affecting the survival were the age and type of implant. The risk of the failure decreased with age (hazard ratio=0.933). The Microloc group was more hazardous than the other prostheses (hazard ratio=0.202, 0.430, respectively). The survival curve in the patients below 60 years of age was significantly lower than those of the patients over 60 years of age (p=0.003); the survival curve of the Microloc group was lower compared to the Modular II and Allegretto groups (p=0.025). Conclusion: The long-term clinical and radiographic results and survival of UKA using old fixed bearing prostheses were satisfactory. The selection of appropriate patient and prosthesis will be important for the long term survival of the UKA procedure.

Demand for Priorities for Preventing Occupational Diseases among Farmers (농업인들의 업무상질환 예방을 위한 우선순위에 대한 요구도)

  • Ae-Rim Seo;Ji-Youn Kim;Bokyoung Kim;Gyeong-Ye Lee;Kyungsu Kim;Ki-Soo Park
    • Journal of agricultural medicine and community health
    • /
    • v.48 no.4
    • /
    • pp.239-250
    • /
    • 2023
  • Objective: This study was a preliminary study for the prevention programs for farmers' occupational diseases. It selected the priorities recognized by farmers, such as occupational diseases, and also identifies the effectiveness and feasibility of prevention programs among diseases recognized by farmers. Therefore, we plan to use it as basis data for future farmer safety and health programs. Method: The subjects of the study were farmers living in the region, selected through a snowball recruitment method, and a total of 671 people were targeted. The priority selection method was the Basic Priority Rating System (BPRS) method, and among the occupational diseases, programs to prevent musculoskeletal diseases, cardiovascular and respiratory diseases, and pesticide poisoning were surveyed on the effectiveness and feasibility of farmers. Results: Among occupational diseases, the highest priority was musculo-skeletal disease, followed by respiratory disease and pesticide poisoning. Among the programs for musculoskeletal disease, 'use of agricultural work convenience equipment and auxiliary tools' had the highest perceived effectiveness and feasibility. Among the five programs for pesticide poisoning, 'equipment of protective equipment such as pesticide protective clothing/glove' had the highest effectiveness at 67.4%, and 'compliance with pesticide use instructions' had the highest level of feasibility at 64.3%. Among the four programs to prevent respiratory diseases, 'wearing a dust mask or gas mask' was the highest at 65.5% in terms of both effectiveness and feasibility. Conclusion: When carrying out safety and health programs for farmers, the priorities recognized by farmers should be taken into consideration, and the program contents should also be developed taking into account the size of effect and feasibility recognized by farmers.

A Study on the Location of Retail Trade in Kwangju-si and Its Inhabitants와 Effcient Utilization (광주시 소매업의 입지와 주민의 효율적 이용에 관한 연구)

  • ;Jeon, Kyung-sook
    • Journal of the Korean Geographical Society
    • /
    • v.30 no.1
    • /
    • pp.68-92
    • /
    • 1995
  • Recentry the structure of the retail trade have been chanaed with its environmantal changes. Some studies may be necessary on the changing process of environment and fundamental structure analyses of the retail trade. This study analyzes the location of retail trades, inhabitants' behavior in retail tredes and their desirable utilization scheme of them in Kwangju-si. Some study methods, contents and coming-out results are as follows: 1. Retail trades can be classified into independent stores, chain-stores (supermarket, voluntary chain and frenchiise system and convenience store), department stores, cooperative associations, traditional, markets mail-order marketing, automatic vending and others by service levels, selling-items, prices, managements, methods of retailing and store or nonstore type. 2. In Kwangju, the environment of retail trades is related to the consumers of population structure: chanes in consumers pattern, trends toward agings and nuclear family, increase of leisur: time and female advances to society. Rapid structural shift in retail trade has also been occurred due to these social changes. Traditionl and premodern markets until 1970s altere to supermarkets or department stores in 1980s, and various types, large enterprises and foreign capitals came into being in 1990s. 3. The locational characteristics of retail trades are resulted from the spatial analysis of the total population distribution, and from the calculation of segregation index in the light of potential demand. The densely-populated areas occurs in newly-built apartment housing complex which is distributed with a ring-shaped pattern around the old urban core. The numbers and rates of the aged over sixty in Kwangsan-gu and the circumference area of Mt.Moodeung, are larger and higher where rural elements are remarkable. A relation between population distribution and retail trade are analysed by the index of population per shop. The index of the population number per shop is lower in urban center, as a whole, being more convenient for consumers. In newly-formed apartment complex areas, on the other, the index more than 1,000 per shop, meeting not the demands for consumers. Because both the younger and the aged are numerous in these areas, the retail trade pattern pertinent to both are needed. Urban fringes including Kwangsan-gu and the vicinity of Mt.Moodeung have some problems owing to the most of population number per shop (more than 1, 500) and the most extensive as well. 4. The regional characteristic of retail trade is analyzed through the location quotient of shops by locational patterns and centerality index. Chungkum-dong is the highest-order central place in CBD. It is the core of retail trades, which has higher-ordered specialty store including three big department stores, supermarkets and large stores. Taegum-dong, Chungsu-dong, Taeui-dong, and Numun-dong that are neiahbored to Chungkum-dong fall on the second group. They have a central commercial section where large chain stores, specialty shopping streets, narrow-line retailing shops (furniture, amusement service, and gallary), supermarkets and daily markets are located. The third group is formed on the axis of state roads linking to Naju-kun, Changseong-kun, Tamyang-kun, Hwasun-kun and forme-Songjeong-eup. It is related to newly, rising apartment housing complex along a trunk road, and characterized by markets and specialty stores. The fourth group has neibourhood-shopping centers including older residential area and Songjeong-eup area with independent stores and supermarkets as main retailing functions. The last group contains inner residential area and outer part of a city including Songjeong-eup. Outer part of miscellaneous shops being occasionally found is rural rather than urban (Fig. 7). 5. The residents' behaviors using retail trade are analyzed by factors of goods and facilities. Department stores are very high level in preference for higher-order shopping-goods such as clothes for full dress in view of both diversity and quality of goods(28.9%). But they have severe traffic congestions, and high competitions for market ranges caused by their sma . 64.0% of respondents make combined purpose trips together with banking and shopping. 6. For more efficiency of retail-trading, it is necessary to induce spatial distribution policy with regard to opportunity frequency of goods selection by central place, frontier regions and age groups. Also we must consider to analyze competition among different types of retail trade and analyze the consumption behaviors of working females and younger-aged groups, in aspects of time and space. Service improvement and the rationalization of management should be accomplished in such as cooperative location (situation) must be under consideration in relations to other functions such as finance, leisure & sports, and culture centers. Various service systems such as installment, credit card and peremium ticket, new used by enterprises, must also be carried service improvement. The rationalization and professionalization in for the commercial goods are bsically requested.

  • PDF

Corporate Default Prediction Model Using Deep Learning Time Series Algorithm, RNN and LSTM (딥러닝 시계열 알고리즘 적용한 기업부도예측모형 유용성 검증)

  • Cha, Sungjae;Kang, Jungseok
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.1-32
    • /
    • 2018
  • In addition to stakeholders including managers, employees, creditors, and investors of bankrupt companies, corporate defaults have a ripple effect on the local and national economy. Before the Asian financial crisis, the Korean government only analyzed SMEs and tried to improve the forecasting power of a default prediction model, rather than developing various corporate default models. As a result, even large corporations called 'chaebol enterprises' become bankrupt. Even after that, the analysis of past corporate defaults has been focused on specific variables, and when the government restructured immediately after the global financial crisis, they only focused on certain main variables such as 'debt ratio'. A multifaceted study of corporate default prediction models is essential to ensure diverse interests, to avoid situations like the 'Lehman Brothers Case' of the global financial crisis, to avoid total collapse in a single moment. The key variables used in corporate defaults vary over time. This is confirmed by Beaver (1967, 1968) and Altman's (1968) analysis that Deakins'(1972) study shows that the major factors affecting corporate failure have changed. In Grice's (2001) study, the importance of predictive variables was also found through Zmijewski's (1984) and Ohlson's (1980) models. However, the studies that have been carried out in the past use static models. Most of them do not consider the changes that occur in the course of time. Therefore, in order to construct consistent prediction models, it is necessary to compensate the time-dependent bias by means of a time series analysis algorithm reflecting dynamic change. Based on the global financial crisis, which has had a significant impact on Korea, this study is conducted using 10 years of annual corporate data from 2000 to 2009. Data are divided into training data, validation data, and test data respectively, and are divided into 7, 2, and 1 years respectively. In order to construct a consistent bankruptcy model in the flow of time change, we first train a time series deep learning algorithm model using the data before the financial crisis (2000~2006). The parameter tuning of the existing model and the deep learning time series algorithm is conducted with validation data including the financial crisis period (2007~2008). As a result, we construct a model that shows similar pattern to the results of the learning data and shows excellent prediction power. After that, each bankruptcy prediction model is restructured by integrating the learning data and validation data again (2000 ~ 2008), applying the optimal parameters as in the previous validation. Finally, each corporate default prediction model is evaluated and compared using test data (2009) based on the trained models over nine years. Then, the usefulness of the corporate default prediction model based on the deep learning time series algorithm is proved. In addition, by adding the Lasso regression analysis to the existing methods (multiple discriminant analysis, logit model) which select the variables, it is proved that the deep learning time series algorithm model based on the three bundles of variables is useful for robust corporate default prediction. The definition of bankruptcy used is the same as that of Lee (2015). Independent variables include financial information such as financial ratios used in previous studies. Multivariate discriminant analysis, logit model, and Lasso regression model are used to select the optimal variable group. The influence of the Multivariate discriminant analysis model proposed by Altman (1968), the Logit model proposed by Ohlson (1980), the non-time series machine learning algorithms, and the deep learning time series algorithms are compared. In the case of corporate data, there are limitations of 'nonlinear variables', 'multi-collinearity' of variables, and 'lack of data'. While the logit model is nonlinear, the Lasso regression model solves the multi-collinearity problem, and the deep learning time series algorithm using the variable data generation method complements the lack of data. Big Data Technology, a leading technology in the future, is moving from simple human analysis, to automated AI analysis, and finally towards future intertwined AI applications. Although the study of the corporate default prediction model using the time series algorithm is still in its early stages, deep learning algorithm is much faster than regression analysis at corporate default prediction modeling. Also, it is more effective on prediction power. Through the Fourth Industrial Revolution, the current government and other overseas governments are working hard to integrate the system in everyday life of their nation and society. Yet the field of deep learning time series research for the financial industry is still insufficient. This is an initial study on deep learning time series algorithm analysis of corporate defaults. Therefore it is hoped that it will be used as a comparative analysis data for non-specialists who start a study combining financial data and deep learning time series algorithm.

The actual aspects of North Korea's 1950s Changgeuk through the Chunhyangjeon in the film Moranbong(1958) and the album Corée Moranbong(1960) (영화 <모란봉>(1958)과 음반 (1960) 수록 <춘향전>을 통해 본 1950년대 북한 창극의 실제적 양상)

  • Song, Mi-Kyoung
    • (The) Research of the performance art and culture
    • /
    • no.43
    • /
    • pp.5-46
    • /
    • 2021
  • The film Moranbong is the product of a trip to North Korea in 1958, when Armangati, Chris Marker, Claude Lantzmann, Francis Lemarck and Jean-Claude Bonardo left at the invitation of Joseon Film. However, for political reasons, the film was not immediately released, and it was not until 2010 that it was rediscovered and received attention. The movie consists of the narratives of Young-ran and Dong-il, set in the Korean War, that are folded into the narratives of Chunhyang and Mongryong in the classic Chunhyangjeon of Joseon. At this time, Joseon's classics are reproduced in the form of the drama Chunhyangjeon, which shares the time zone with the two main characters, and the two narratives are covered in a total of six scenes. There are two layers of middle-story frames in the movie, and if the same narrative is set in North Korea in the 1950s, there is an epic produced by the producers and actors of the Changgeuk Chunhyangjeon and the Changgeuk Chunhyangjeon as a complete work. In the outermost frame of the movie, Dong-il is the main character, but in the inner double frame, Young-ran, who is an actor growing up with the Changgeuk Chunhyangjeon and a character in the Changgeuk Chunhyangjeon, is the center. The following three OST albums are Corée Moranbong released in France in 1960, Musique de corée released in 1970, and 朝鮮の伝統音樂-唱劇 「春香伝」と伝統樂器- released in 1968 in Japan. While Corée Moranbong consists only of the music from the film Moranbong, the two subsequent albums included additional songs collected and recorded by Pyongyang National Broadcasting System. However, there is no information about the movie Moranbong on the album released in Japan. Under the circumstances, it is highly likely that the author of the record label or music commentary has not confirmed the existence of the movie Moranbong, and may have intentionally excluded related contents due to the background of the film's ban on its release. The results of analyzing the detailed scenes of the Changgeuk Chunhyangjeon, Farewell Song, Sipjang-ga, Chundangsigwa, Bakseokti and Prison Song in the movie Moranbong or OST album in the 1950s are as follows. First, the process of establishing the North Korean Changgeuk Chunhyangjeon in the 1950s was confirmed. The play, compiled in 1955 through the Joseon Changgeuk Collection, was settled in the form of a Changgeuk that can be performed in the late 1950s by the Changgeuk Chunhyangjeon between 1956 and 1958. Since the 1960s, Chunhyangjeon has no longer been performed as a traditional pansori-style Changgeuk, so the film Moranbong and the album Corée moranbong are almost the last records to capture the Changgeuk Chunhyangjeon and its music. Second, we confirmed the responses of the actors to the controversy over Takseong in the North Korean creative world in the 1950s. Until 1959, there was a voice of criticism surrounding Takseong and a voice of advocacy that it was also a national characteristic. Shin Woo-sun, who almost eliminated Takseong with clear and high-pitched phrases, air man who changed according to the situation, who chose Takseong but did not actively remove Takseong, Lim So-hyang, who tried to maintain his own tone while accepting some of modern vocalization. Although Cho Sang-sun and Lim So-hyang were also guaranteed roles to continue their voices, the selection/exclusion patterns in the movie Moranbong were linked to the Takseong removal guidelines required by North Korean musicians in the name of Dang and People in the 1950s. Second, Changgeuk actors' response to the controversy over the turbidity of the North Korean Changgeuk community in the 1950s was confirmed. Until 1959, there were voices of criticism and support surrounding Taksung in North Korea. Shin Woo-sun, who showed consistent performance in removing turbidity with clear, high-pitched vocal sounds, Gong Gi-nam, who did not actively remove turbidity depending on the situation, Cho Sang-sun, who accepted some of the vocalization required by the party, while maintaining his original tone. On the other hand, Cho Sang-seon and Lim So-hyang were guaranteed roles to continue their sounds, but the selection/exclusion patterns of Moranbong was independently linked to the guidelines for removing turbidity that the Gugak musicians who crossed to North Korea had been asked for.

Stock Price Prediction by Utilizing Category Neutral Terms: Text Mining Approach (카테고리 중립 단어 활용을 통한 주가 예측 방안: 텍스트 마이닝 활용)

  • Lee, Minsik;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.123-138
    • /
    • 2017
  • Since the stock market is driven by the expectation of traders, studies have been conducted to predict stock price movements through analysis of various sources of text data. In order to predict stock price movements, research has been conducted not only on the relationship between text data and fluctuations in stock prices, but also on the trading stocks based on news articles and social media responses. Studies that predict the movements of stock prices have also applied classification algorithms with constructing term-document matrix in the same way as other text mining approaches. Because the document contains a lot of words, it is better to select words that contribute more for building a term-document matrix. Based on the frequency of words, words that show too little frequency or importance are removed. It also selects words according to their contribution by measuring the degree to which a word contributes to correctly classifying a document. The basic idea of constructing a term-document matrix was to collect all the documents to be analyzed and to select and use the words that have an influence on the classification. In this study, we analyze the documents for each individual item and select the words that are irrelevant for all categories as neutral words. We extract the words around the selected neutral word and use it to generate the term-document matrix. The neutral word itself starts with the idea that the stock movement is less related to the existence of the neutral words, and that the surrounding words of the neutral word are more likely to affect the stock price movements. And apply it to the algorithm that classifies the stock price fluctuations with the generated term-document matrix. In this study, we firstly removed stop words and selected neutral words for each stock. And we used a method to exclude words that are included in news articles for other stocks among the selected words. Through the online news portal, we collected four months of news articles on the top 10 market cap stocks. We split the news articles into 3 month news data as training data and apply the remaining one month news articles to the model to predict the stock price movements of the next day. We used SVM, Boosting and Random Forest for building models and predicting the movements of stock prices. The stock market opened for four months (2016/02/01 ~ 2016/05/31) for a total of 80 days, using the initial 60 days as a training set and the remaining 20 days as a test set. The proposed word - based algorithm in this study showed better classification performance than the word selection method based on sparsity. This study predicted stock price volatility by collecting and analyzing news articles of the top 10 stocks in market cap. We used the term - document matrix based classification model to estimate the stock price fluctuations and compared the performance of the existing sparse - based word extraction method and the suggested method of removing words from the term - document matrix. The suggested method differs from the word extraction method in that it uses not only the news articles for the corresponding stock but also other news items to determine the words to extract. In other words, it removed not only the words that appeared in all the increase and decrease but also the words that appeared common in the news for other stocks. When the prediction accuracy was compared, the suggested method showed higher accuracy. The limitation of this study is that the stock price prediction was set up to classify the rise and fall, and the experiment was conducted only for the top ten stocks. The 10 stocks used in the experiment do not represent the entire stock market. In addition, it is difficult to show the investment performance because stock price fluctuation and profit rate may be different. Therefore, it is necessary to study the research using more stocks and the yield prediction through trading simulation.