• Title/Summary/Keyword: CLASSIFICATION

Search Result 22,636, Processing Time 0.049 seconds

Recommender system using BERT sentiment analysis (BERT 기반 감성분석을 이용한 추천시스템)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.2
    • /
    • pp.1-15
    • /
    • 2021
  • If it is difficult for us to make decisions, we ask for advice from friends or people around us. When we decide to buy products online, we read anonymous reviews and buy them. With the advent of the Data-driven era, IT technology's development is spilling out many data from individuals to objects. Companies or individuals have accumulated, processed, and analyzed such a large amount of data that they can now make decisions or execute directly using data that used to depend on experts. Nowadays, the recommender system plays a vital role in determining the user's preferences to purchase goods and uses a recommender system to induce clicks on web services (Facebook, Amazon, Netflix, Youtube). For example, Youtube's recommender system, which is used by 1 billion people worldwide every month, includes videos that users like, "like" and videos they watched. Recommended system research is deeply linked to practical business. Therefore, many researchers are interested in building better solutions. Recommender systems use the information obtained from their users to generate recommendations because the development of the provided recommender systems requires information on items that are likely to be preferred by the user. We began to trust patterns and rules derived from data rather than empirical intuition through the recommender systems. The capacity and development of data have led machine learning to develop deep learning. However, such recommender systems are not all solutions. Proceeding with the recommender systems, there should be no scarcity in all data and a sufficient amount. Also, it requires detailed information about the individual. The recommender systems work correctly when these conditions operate. The recommender systems become a complex problem for both consumers and sellers when the interaction log is insufficient. Because the seller's perspective needs to make recommendations at a personal level to the consumer and receive appropriate recommendations with reliable data from the consumer's perspective. In this paper, to improve the accuracy problem for "appropriate recommendation" to consumers, the recommender systems are proposed in combination with context-based deep learning. This research is to combine user-based data to create hybrid Recommender Systems. The hybrid approach developed is not a collaborative type of Recommender Systems, but a collaborative extension that integrates user data with deep learning. Customer review data were used for the data set. Consumers buy products in online shopping malls and then evaluate product reviews. Rating reviews are based on reviews from buyers who have already purchased, giving users confidence before purchasing the product. However, the recommendation system mainly uses scores or ratings rather than reviews to suggest items purchased by many users. In fact, consumer reviews include product opinions and user sentiment that will be spent on evaluation. By incorporating these parts into the study, this paper aims to improve the recommendation system. This study is an algorithm used when individuals have difficulty in selecting an item. Consumer reviews and record patterns made it possible to rely on recommendations appropriately. The algorithm implements a recommendation system through collaborative filtering. This study's predictive accuracy is measured by Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). Netflix is strategically using the referral system in its programs through competitions that reduce RMSE every year, making fair use of predictive accuracy. Research on hybrid recommender systems combining the NLP approach for personalization recommender systems, deep learning base, etc. has been increasing. Among NLP studies, sentiment analysis began to take shape in the mid-2000s as user review data increased. Sentiment analysis is a text classification task based on machine learning. The machine learning-based sentiment analysis has a disadvantage in that it is difficult to identify the review's information expression because it is challenging to consider the text's characteristics. In this study, we propose a deep learning recommender system that utilizes BERT's sentiment analysis by minimizing the disadvantages of machine learning. This study offers a deep learning recommender system that uses BERT's sentiment analysis by reducing the disadvantages of machine learning. The comparison model was performed through a recommender system based on Naive-CF(collaborative filtering), SVD(singular value decomposition)-CF, MF(matrix factorization)-CF, BPR-MF(Bayesian personalized ranking matrix factorization)-CF, LSTM, CNN-LSTM, GRU(Gated Recurrent Units). As a result of the experiment, the recommender system based on BERT was the best.

A Study on the Characteristics of Paridae Nesting Material by Urban Green Area Type (도시녹지 유형별 박새과 둥지 재료 특성 연구)

  • Kim, Kyeong-Tae;Lee, Hyun-Jung;Kim, Whee-Moon;Kim, Seoung-Yeal;Song, Wonkyong
    • Korean Journal of Environment and Ecology
    • /
    • v.35 no.3
    • /
    • pp.256-264
    • /
    • 2021
  • Rapid urbanization around the world has negatively affected wildlife habitats, including birds. Wild birds settled in the city are adapting to the changed surroundings, and are typically known to make nests using materials that are easy to find around the city. This study was conducted for the purpose of analyzing the nesting materials on the Paridae using artificial bird nests installed in cities. In this study, the researchers established a total of 33 artificial bird nests in urban parks (22) and forests (11) in Cheonan-si, Chungcheongnam-do. Then we collected 4 artificial bird nests in urban parks (18.19%) and 5 in urban forests (45.46%) to compare the characteristics of bird nest materials by the nest, species, and urban green area types. Eight nests, excluding a nest abandoned by a pair of Paridae, were used for the material analysis. The collected nests were dried, and classified into natural materials (vegetable materials, animal materials, moss, and soil) and artificial materials (cotton, paper pieces, plastics, vinyl, and synthetic fibers), and then each nest was weighed. The classification result shows that the portion of moss (50.65%) was the highest in all nests, followed by soil (21.43%), artificial material (13.95%), vegetable material (5.78%), animal material (4.57%), and others (3.59%) in that order. Artificial materials were used in all nests in urban green areas. Moreover, although the Paridae used about 5.16% more vegetable material than the Parus varius, it was not significant (t=2.17, p=0.07). Plant materials and soil were most preferred in urban forests, and moss, animal, and artificial materials were widely used in that order in urban parks. There was a significant difference in the use of vegetable materials between urban parks and urban forests (t=3.07, p<0.05*). In the habitats like urbanized and dry areas, where artificial materials are highly accessible, artificial materials replaced some roles of natural materials. This study is a basic study for the analysis of the types of materials used in artificial bird nests to understand the habitat system of urban ecosystems. It can be used as the basic data for ecological studies and conservation of the Paridae species.

A Study on the Structural Reinforcement of the Modified Caisson Floating Dock (개조된 케이슨 플로팅 도크의 구조 보강에 대한 연구)

  • Kim, Hong-Jo;Seo, Kwang-Cheol;Park, Joo-Shin
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.27 no.1
    • /
    • pp.172-178
    • /
    • 2021
  • In the ship repair market, interest in maintenance and repair is steadily increasing due to the reinforcement of prevention of environmental pollution caused by ships and the reinforcement of safety standards for ship structures. By reflecting this effect, the number of requests for repairs by foreign shipping companies increases to repair shipbuilders in the Southwest Sea. However, because most of the repair shipbuilders in the southwestern area are small and medium-sized companies, it is difficult to lead to the integrated synergy effect of the repair shipbuilding companies. Moreover, the infrastructure is not integrated; hence, using the infrastructure jointly is a challenge, which acts as an obstacle to the activation of the repair shipbuilding industry. Floating docks are indispensable to operating the repair shipbuilding business; in addition, most of them are operated through renovation/repair after importing aging caisson docks from overseas. However, their service life is more than 30 years; additionally, there is no structure inspection standard. Therefore, it is vulnerable to the safety field. In this study, the finite element analysis program of ANSYS was used to evaluate the structural safety of the modified caisson dock and obtain additional structural reinforcement schemes to solve the derived problems. For the floating docks, there are classification regulations; however, concerning structural strength, the regulations are insufficient, and the applicability is inferior. These insufficient evaluation areas were supplemented through a detailed structural FE-analysis. The reinforcement plan was decided by reinforcing the pontoon deck and reinforcement of the side tank, considering the characteristics of the repair shipyard condition. The final plan was selected to reinforce the side wing tank through the structural analysis of the decision; in addition, the actual structure was fabricated to reflect the reinforcement plan. Our results can be used as reference data for improving the structural strength of similar facilities; we believe that the optimal solution can be found quickly if this method is used during renovation/repair.

Performance of Investment Strategy using Investor-specific Transaction Information and Machine Learning (투자자별 거래정보와 머신러닝을 활용한 투자전략의 성과)

  • Kim, Kyung Mock;Kim, Sun Woong;Choi, Heung Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.65-82
    • /
    • 2021
  • Stock market investors are generally split into foreign investors, institutional investors, and individual investors. Compared to individual investor groups, professional investor groups such as foreign investors have an advantage in information and financial power and, as a result, foreign investors are known to show good investment performance among market participants. The purpose of this study is to propose an investment strategy that combines investor-specific transaction information and machine learning, and to analyze the portfolio investment performance of the proposed model using actual stock price and investor-specific transaction data. The Korea Exchange offers daily information on the volume of purchase and sale of each investor to securities firms. We developed a data collection program in C# programming language using an API provided by Daishin Securities Cybosplus, and collected 151 out of 200 KOSPI stocks with daily opening price, closing price and investor-specific net purchase data from January 2, 2007 to July 31, 2017. The self-organizing map model is an artificial neural network that performs clustering by unsupervised learning and has been introduced by Teuvo Kohonen since 1984. We implement competition among intra-surface artificial neurons, and all connections are non-recursive artificial neural networks that go from bottom to top. It can also be expanded to multiple layers, although many fault layers are commonly used. Linear functions are used by active functions of artificial nerve cells, and learning rules use Instar rules as well as general competitive learning. The core of the backpropagation model is the model that performs classification by supervised learning as an artificial neural network. We grouped and transformed investor-specific transaction volume data to learn backpropagation models through the self-organizing map model of artificial neural networks. As a result of the estimation of verification data through training, the portfolios were rebalanced monthly. For performance analysis, a passive portfolio was designated and the KOSPI 200 and KOSPI index returns for proxies on market returns were also obtained. Performance analysis was conducted using the equally-weighted portfolio return, compound interest rate, annual return, Maximum Draw Down, standard deviation, and Sharpe Ratio. Buy and hold returns of the top 10 market capitalization stocks are designated as a benchmark. Buy and hold strategy is the best strategy under the efficient market hypothesis. The prediction rate of learning data using backpropagation model was significantly high at 96.61%, while the prediction rate of verification data was also relatively high in the results of the 57.1% verification data. The performance evaluation of self-organizing map grouping can be determined as a result of a backpropagation model. This is because if the grouping results of the self-organizing map model had been poor, the learning results of the backpropagation model would have been poor. In this way, the performance assessment of machine learning is judged to be better learned than previous studies. Our portfolio doubled the return on the benchmark and performed better than the market returns on the KOSPI and KOSPI 200 indexes. In contrast to the benchmark, the MDD and standard deviation for portfolio risk indicators also showed better results. The Sharpe Ratio performed higher than benchmarks and stock market indexes. Through this, we presented the direction of portfolio composition program using machine learning and investor-specific transaction information and showed that it can be used to develop programs for real stock investment. The return is the result of monthly portfolio composition and asset rebalancing to the same proportion. Better outcomes are predicted when forming a monthly portfolio if the system is enforced by rebalancing the suggested stocks continuously without selling and re-buying it. Therefore, real transactions appear to be relevant.

Spectral Band Selection for Detecting Fire Blight Disease in Pear Trees by Narrowband Hyperspectral Imagery (초분광 이미지를 이용한 배나무 화상병에 대한 최적 분광 밴드 선정)

  • Kang, Ye-Seong;Park, Jun-Woo;Jang, Si-Hyeong;Song, Hye-Young;Kang, Kyung-Suk;Ryu, Chan-Seok;Kim, Seong-Heon;Jun, Sae-Rom;Kang, Tae-Hwan;Kim, Gul-Hwan
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.23 no.1
    • /
    • pp.15-33
    • /
    • 2021
  • In this study, the possibility of discriminating Fire blight (FB) infection tested using the hyperspectral imagery. The reflectance of healthy and infected leaves and branches was acquired with 5 nm of full width at high maximum (FWHM) and then it was standardized to 10 nm, 25 nm, 50 nm, and 80 nm of FWHM. The standardized samples were divided into training and test sets at ratios of 7:3, 5:5 and 3:7 to find the optimal bands of FWHM by the decision tree analysis. Classification accuracy was evaluated using overall accuracy (OA) and kappa coefficient (KC). The hyperspectral reflectance of infected leaves and branches was significantly lower than those of healthy green, red-edge (RE) and near infrared (NIR) regions. The bands selected for the first node were generally 750 and 800 nm; these were used to identify the infection of leaves and branches, respectively. The accuracy of the classifier was higher in the 7:3 ratio. Four bands with 50 nm of FWHM (450, 650, 750, and 950 nm) might be reasonable because the difference in the recalculated accuracy between 8 bands with 10 nm of FWHM (440, 580, 640, 660, 680, 710, 730, and 740 nm) and 4 bands was only 1.8% for OA and 4.1% for KC, respectively. Finally, adding two bands (550 nm and 800 nm with 25 nm of FWHM) in four bands with 50 nm of FWHM have been proposed to improve the usability of multispectral image sensors with performing various roles in agriculture as well as detecting FB with other combinations of spectral bands.

Comparison of Models for Stock Price Prediction Based on Keyword Search Volume According to the Social Acceptance of Artificial Intelligence (인공지능의 사회적 수용도에 따른 키워드 검색량 기반 주가예측모형 비교연구)

  • Cho, Yujung;Sohn, Kwonsang;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.1
    • /
    • pp.103-128
    • /
    • 2021
  • Recently, investors' interest and the influence of stock-related information dissemination are being considered as significant factors that explain stock returns and volume. Besides, companies that develop, distribute, or utilize innovative new technologies such as artificial intelligence have a problem that it is difficult to accurately predict a company's future stock returns and volatility due to macro-environment and market uncertainty. Market uncertainty is recognized as an obstacle to the activation and spread of artificial intelligence technology, so research is needed to mitigate this. Hence, the purpose of this study is to propose a machine learning model that predicts the volatility of a company's stock price by using the internet search volume of artificial intelligence-related technology keywords as a measure of the interest of investors. To this end, for predicting the stock market, we using the VAR(Vector Auto Regression) and deep neural network LSTM (Long Short-Term Memory). And the stock price prediction performance using keyword search volume is compared according to the technology's social acceptance stage. In addition, we also conduct the analysis of sub-technology of artificial intelligence technology to examine the change in the search volume of detailed technology keywords according to the technology acceptance stage and the effect of interest in specific technology on the stock market forecast. To this end, in this study, the words artificial intelligence, deep learning, machine learning were selected as keywords. Next, we investigated how many keywords each week appeared in online documents for five years from January 1, 2015, to December 31, 2019. The stock price and transaction volume data of KOSDAQ listed companies were also collected and used for analysis. As a result, we found that the keyword search volume for artificial intelligence technology increased as the social acceptance of artificial intelligence technology increased. In particular, starting from AlphaGo Shock, the keyword search volume for artificial intelligence itself and detailed technologies such as machine learning and deep learning appeared to increase. Also, the keyword search volume for artificial intelligence technology increases as the social acceptance stage progresses. It showed high accuracy, and it was confirmed that the acceptance stages showing the best prediction performance were different for each keyword. As a result of stock price prediction based on keyword search volume for each social acceptance stage of artificial intelligence technologies classified in this study, the awareness stage's prediction accuracy was found to be the highest. The prediction accuracy was different according to the keywords used in the stock price prediction model for each social acceptance stage. Therefore, when constructing a stock price prediction model using technology keywords, it is necessary to consider social acceptance of the technology and sub-technology classification. The results of this study provide the following implications. First, to predict the return on investment for companies based on innovative technology, it is most important to capture the recognition stage in which public interest rapidly increases in social acceptance of the technology. Second, the change in keyword search volume and the accuracy of the prediction model varies according to the social acceptance of technology should be considered in developing a Decision Support System for investment such as the big data-based Robo-advisor recently introduced by the financial sector.

Identification of Bird Community Characteristics by Habitat Environment of Jeongmaek Using Self-organizing Map - Case Stuty Area Geumnamhonam and Honam, Hannamgeumbuk and Geumbuk, Naknam Jeongmaek, South Korea - (자기조직화지도를 활용한 정맥의 서식지 환경에 따른 조류 군집 특성 파악 - 금남호남 및 호남정맥, 한남금북 및 금북정맥, 낙남정맥을 대상으로 -)

  • Hwang, Jong-Kyeong;Kang, Te-han;Han, Seung-Woo;Cho, Hae-Jin;Nam, Hyung-Kyu;Kim, Su-Jin;Lee, Joon-Woo
    • Korean Journal of Environment and Ecology
    • /
    • v.35 no.4
    • /
    • pp.377-386
    • /
    • 2021
  • This study was conducted to provide basic data for habitat management and preservation of Jeongmaek. A total of 18 priority research areas were selected with consideration to terrain and habitat environment, and 54 fixed plots were selected for three types of habits: development, valley, and forest road and ridge. The survey was conducted in each season (May, August, and October), excluding the winter season, from 2016 to 2018. The distribution analysis of birds observed in each habitat type using a self-organizing map (SOM) classified them into a total of four groups (MRPP, A=0.12, and p <0.005). The comparative analysis of the number of species, the number of individuals, and the species diversity index for each SOM group showed that they were all the highest in group III (Kruskal-Wallis, the number species: x2 = 13.436, P <0.005; the number of individuals: x2 = 8.229, P <0.05; the species diversity index: x2 = 17.115, P <0.005). Moreover, the analysis by applying the land cover map to the random forest model to examine the index species of each group and identify the characteristics of the habitat environment showed a difference in the ratio of the habitat environment and the indicator species among the four groups. The index species analysis identified a total of 18 bird species as the indicator species in three groups except for group II. When applying the random forest model and indicator species analysis to the results of classification into four groups using the SOM, the composition of the indicator species by the group showed a correlation with the habitat characteristics of each group. Moreover, the distribution patterns and densities of observed species were clearly distinguished according to the dominant habitat for each group. The results of the analysis that applied the SOM, indicator species, and random forest model together can derive useful results for the characterization of bird habitats according to the habitat environment.

A Numerical Analysis to Estimate Disposal Spacing and Rock Mass Condition for High Efficiency Repository Based on Temperature Criteria of Bentonite Buffer (벤토나이트 완충재 설계 기준 온도에 따른 고효율 처분시스템 처분 간격 및 암반 조건 산정을 위한 수치해석적 연구)

  • Kim, Kwang-Il;Lee, Changsoo;Kim, Jin-Seop;Cho, Dongkeun
    • Tunnel and Underground Space
    • /
    • v.31 no.4
    • /
    • pp.289-308
    • /
    • 2021
  • This study conducts coupled thermo-hydro-mechanical numerical modeling to investigate the maximum temperature and conditions for securing mechanical stability of the high-level radioactive waste repository when temperature criteria of bentonite buffer are 100℃ and 125℃, respectively. In case of temperature criterion of buffer as 100℃, the maximum temperatures at the interface between canister and buffer are calculated to be 99.4℃ and 99.8℃, respectively for a case with disposal tunnel spacing of 40 m and deposition hole spacing of 5.5 m and for the other case with disposal tunnel spacing of 30 m and deposition hole spacing of 6.5 m. In case of temperature criterion of buffer as 125℃, spacings of disposal tunnel and deposition hole could be decreased to 30 m and 4.5 m, respectively, which reduces the disposal area up to 55% compared to the disposal area of KRS+. According to analysis of mechanical stability for various disposal spacings, RMR of rock mass for KRS+ should be larger than 72.4 which belongs to good rock in RMR classification to prevent failure of rock mass. As disposal spacing is decreased, required RMR of rock mass is increased. In order to prevent failure of rock mass for a case with disposal tunnel spacing of 30 m and deposition hole spacing of 4.5 m, RMR larger than 87.3 is needed. However, mechanical stability of the repository is secured for all cases with RMR over 75 considering the enhancement of rock strength due to confining stress induced by swelling of the bentonite buffer and backfill.

A Study on the Frame of Reference of the Korean Welfare State Model Focusing on Esping-Anderson's Wel fare State Regime (에스핑-앤더슨의 복지국가체제를 중심으로 한국형 복지국가의 준거 틀에 관한 연구)

  • Jung, Hyun-Kyung
    • Industry Promotion Research
    • /
    • v.7 no.2
    • /
    • pp.43-49
    • /
    • 2022
  • This study aims to study Esping-Anderson's theory of welfare state system, develop a model of welfare state suitable for Korea's situation, and apply it to reality. In this research method, basic research and analysis of ideology is used, focusing on Esping-Anderson's welfare state system theory, and applying it appropriately to the Korean situation. Studies on the model of the welfare state have been studied after the classification of complementary and institutional models asserted by Willensky and Lebo in 1965. In addition, Esping-Andersen asserts three things as a model of the welfare state according to ideology. First, the role of the market is central to the liberal welfare system that best fits the image of classical capitalism, and individualistic solidarity through the market. The role of the state or family, which can be a hindrance, is actually marginalized. In addition, in order to maximize individualistic solidarity through the market, de-commodification in the national domain tends to be minimized. Second, the conservative welfare system has a strong familistic element, so the source of social solidarity is the family, and the state plays a role of supporting and supplementing the characteristics of this family. In the conservative system, de-commodification appears to be high among household heads, or the welfare system takes on a corporatist and nationalistic form, it can be said that these characteristics are reflected. Third, in the social democratic welfare system, the source of social solidarity is the state. Therefore, the role of the state is large, the state has a high possibility of decommodification, and it has the characteristics of substitutes for the family and the market through universalist intervention. This study applies Esping-Anderson's three welfare state models to study a model suitable for the Korean situation. In conclusion, Esping-Anderson's three welfare state models can be classified into a market-oriented model based on a liberal welfare system, a status-oriented model based on a conservative corporatist welfare system, and a solidarity-oriented model based on a social-democratic welfare system, presented a compromise between liberalism and conservatism as a Korean model.

The prediction of the stock price movement after IPO using machine learning and text analysis based on TF-IDF (증권신고서의 TF-IDF 텍스트 분석과 기계학습을 이용한 공모주의 상장 이후 주가 등락 예측)

  • Yang, Suyeon;Lee, Chaerok;Won, Jonggwan;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.2
    • /
    • pp.237-262
    • /
    • 2022
  • There has been a growing interest in IPOs (Initial Public Offerings) due to the profitable returns that IPO stocks can offer to investors. However, IPOs can be speculative investments that may involve substantial risk as well because shares tend to be volatile, and the supply of IPO shares is often highly limited. Therefore, it is crucially important that IPO investors are well informed of the issuing firms and the market before deciding whether to invest or not. Unlike institutional investors, individual investors are at a disadvantage since there are few opportunities for individuals to obtain information on the IPOs. In this regard, the purpose of this study is to provide individual investors with the information they may consider when making an IPO investment decision. This study presents a model that uses machine learning and text analysis to predict whether an IPO stock price would move up or down after the first 5 trading days. Our sample includes 691 Korean IPOs from June 2009 to December 2020. The input variables for the prediction are three tone variables created from IPO prospectuses and quantitative variables that are either firm-specific, issue-specific, or market-specific. The three prospectus tone variables indicate the percentage of positive, neutral, and negative sentences in a prospectus, respectively. We considered only the sentences in the Risk Factors section of a prospectus for the tone analysis in this study. All sentences were classified into 'positive', 'neutral', and 'negative' via text analysis using TF-IDF (Term Frequency - Inverse Document Frequency). Measuring the tone of each sentence was conducted by machine learning instead of a lexicon-based approach due to the lack of sentiment dictionaries suitable for Korean text analysis in the context of finance. For this reason, the training set was created by randomly selecting 10% of the sentences from each prospectus, and the sentence classification task on the training set was performed after reading each sentence in person. Then, based on the training set, a Support Vector Machine model was utilized to predict the tone of sentences in the test set. Finally, the machine learning model calculated the percentages of positive, neutral, and negative sentences in each prospectus. To predict the price movement of an IPO stock, four different machine learning techniques were applied: Logistic Regression, Random Forest, Support Vector Machine, and Artificial Neural Network. According to the results, models that use quantitative variables using technical analysis and prospectus tone variables together show higher accuracy than models that use only quantitative variables. More specifically, the prediction accuracy was improved by 1.45% points in the Random Forest model, 4.34% points in the Artificial Neural Network model, and 5.07% points in the Support Vector Machine model. After testing the performance of these machine learning techniques, the Artificial Neural Network model using both quantitative variables and prospectus tone variables was the model with the highest prediction accuracy rate, which was 61.59%. The results indicate that the tone of a prospectus is a significant factor in predicting the price movement of an IPO stock. In addition, the McNemar test was used to verify the statistically significant difference between the models. The model using only quantitative variables and the model using both the quantitative variables and the prospectus tone variables were compared, and it was confirmed that the predictive performance improved significantly at a 1% significance level.