• Title/Summary/Keyword: 매개변수들

Search Result 5,693, Processing Time 0.108 seconds

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

Research on Perfusion CT in Rabbit Brain Tumor Model (토끼 뇌종양 모델에서의 관류 CT 영상에 관한 연구)

  • Ha, Bon-Chul;Kwak, Byung-Kook;Jung, Ji-Sung;Lim, Cheong-Hwan;Jung, Hong-Ryang
    • Journal of radiological science and technology
    • /
    • v.35 no.2
    • /
    • pp.165-172
    • /
    • 2012
  • We investigated the vascular characteristics of tumors and normal tissue using perfusion CT in the rabbit brain tumor model. The VX2 carcinoma concentration of $1{\times}10^7$ cells/ml(0.1ml) was implanted in the brain of nine New Zealand white rabbits (weight: 2.4kg-3.0kg, mean: 2.6kg). The perfusion CT was scanned when the tumors were grown up to 5mm. The tumor volume and perfusion value were quantitatively analyzed by using commercial workstation (advantage windows workstation, AW, version 4.2, GE, USA). The mean volume of implanted tumors was $316{\pm}181mm^3$, and the biggest and smallest volumes of tumor were 497 $mm^3$ and 195 $mm^3$, respectively. All the implanted tumors in rabbits are single-nodular tumors, and intracranial metastasis was not observed. In the perfusion CT, cerebral blood volume (CBV) were $74.40{\pm}9.63$, $16.08{\pm}0.64$, $15.24{\pm}3.23$ ml/100g in the tumor core, ipsilateral normal brain, and contralateral normal brain, respectively ($p{\leqq}0.05$). In the cerebral blood flow (CBF), there were significant differences between the tumor core and both normal brains ($p{\leqq}0.05$), but no significant differences between ipsilateral and contralateral normal brains ($962.91{\pm}75.96$ vs. $357.82{\pm}12.82$ vs. $323.19{\pm}83.24$ ml/100g/min). In the mean transit time (MTT), there were significant differences between the tumor core and both normal brains ($p{\leqq}0.05$), but no significant differences between ipsilateral and contralateral normal brains ($4.37{\pm}0.19$ vs. $3.02{\pm}0.41$ vs. $2.86{\pm}0.22$ sec). In the permeability surface (PS), there were significant differences among the tumor core, ipsilateral and contralateral normal brains ($47.23{\pm}25.45$ vs. $14.54{\pm}1.60$ vs. $6.81{\pm}4.20$ ml/100g/min)($p{\leqq}0.05$). In the time to peak (TTP) were no significant differences among the tumor core, ipsilateral and contralateral normal brains. In the positive enhancement integral (PEI), there were significant differences among the tumor core, ipsilateral and contralateral brains ($61.56{\pm}16.07$ vs. $12.58{\pm}2.61$ vs. $8.26{\pm}5.55$ ml/100g). ($p{\leqq}0.05$). In the maximum slope of increase (MSI), there were significant differences between the tumor core and both normal brain($p{\leqq}0.05$), but no significant differences between ipsilateral and contralateral normal brains ($13.18{\pm}2.81$ vs. $6.99{\pm}1.73$ vs. $6.41{\pm}1.39$ HU/sec). Additionally, in the maximum slope of decrease (MSD), there were significant differences between the tumor core and contralateral normal brain($p{\leqq}0.05$), but no significant differences between the tumor core and ipsilateral normal brain($4.02{\pm}1.37$ vs. $4.66{\pm}0.83$ vs. $6.47{\pm}1.53$ HU/sec). In conclusion, the VX2 tumors were implanted in the rabbit brain successfully, and stereotactic inoculation method make single-nodular type of tumor that was no metastasis in intracranial, suitable for comparative study between tumors and normal tissues. Therefore, perfusion CT would be a useful diagnostic tool capable of reflecting the vascularity of the tumors.

A Study on the Effect of Booth Recommendation System on Exhibition Visitors Unplanned Visit Behavior (전시장 참관객의 계획되지 않은 방문행동에 있어서 부스추천시스템의 영향에 대한 연구)

  • Chung, Nam-Ho;Kim, Jae-Kyung
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.175-191
    • /
    • 2011
  • With the MICE(Meeting, Incentive travel, Convention, Exhibition) industry coming into the spotlight, there has been a growing interest in the domestic exhibition industry. Accordingly, in Korea, various studies of the industry are being conducted to enhance exhibition performance as in the United States or Europe. Some studies are focusing particularly on analyzing visiting patterns of exhibition visitors using intelligent information technology in consideration of the variations in effects of watching exhibitions according to the exhibitory environment or technique, thereby understanding visitors and, furthermore, drawing the correlations between exhibiting businesses and improving exhibition performance. However, previous studies related to booth recommendation systems only discussed the accuracy of recommendation in the aspect of a system rather than determining changes in visitors' behavior or perception by recommendation. A booth recommendation system enables visitors to visit unplanned exhibition booths by recommending visitors suitable ones based on information about visitors' visits. Meanwhile, some visitors may be satisfied with their unplanned visits, while others may consider the recommending process to be cumbersome or obstructive to their free observation. In the latter case, the exhibition is likely to produce worse results compared to when visitors are allowed to freely observe the exhibition. Thus, in order to apply a booth recommendation system to exhibition halls, the factors affecting the performance of the system should be generally examined, and the effects of the system on visitors' unplanned visiting behavior should be carefully studied. As such, this study aims to determine the factors that affect the performance of a booth recommendation system by reviewing theories and literature and to examine the effects of visitors' perceived performance of the system on their satisfaction of unplanned behavior and intention to reuse the system. Toward this end, the unplanned behavior theory was adopted as the theoretical framework. Unplanned behavior can be defined as "behavior that is done by consumers without any prearranged plan". Thus far, consumers' unplanned behavior has been studied in various fields. The field of marketing, in particular, has focused on unplanned purchasing among various types of unplanned behavior, which has been often confused with impulsive purchasing. Nevertheless, the two are different from each other; while impulsive purchasing means strong, continuous urges to purchase things, unplanned purchasing is behavior with purchasing decisions that are made inside a store, not before going into one. In other words, all impulsive purchases are unplanned, but not all unplanned purchases are impulsive. Then why do consumers engage in unplanned behavior? Regarding this question, many scholars have made many suggestions, but there has been a consensus that it is because consumers have enough flexibility to change their plans in the middle instead of developing plans thoroughly. In other words, if unplanned behavior costs much, it will be difficult for consumers to change their prearranged plans. In the case of the exhibition hall examined in this study, visitors learn the programs of the hall and plan which booth to visit in advance. This is because it is practically impossible for visitors to visit all of the various booths that an exhibition operates due to their limited time. Therefore, if the booth recommendation system proposed in this study recommends visitors booths that they may like, they can change their plans and visit the recommended booths. Such visiting behavior can be regarded similarly to consumers' visit to a store or tourists' unplanned behavior in a tourist spot and can be understand in the same context as the recent increase in tourism consumers' unplanned behavior influenced by information devices. Thus, the following research model was established. This research model uses visitors' perceived performance of a booth recommendation system as the parameter, and the factors affecting the performance include trust in the system, exhibition visitors' knowledge levels, expected personalization of the system, and the system's threat to freedom. In addition, the causal relation between visitors' satisfaction of their perceived performance of the system and unplanned behavior and their intention to reuse the system was determined. While doing so, trust in the booth recommendation system consisted of 2nd order factors such as competence, benevolence, and integrity, while the other factors consisted of 1st order factors. In order to verify this model, a booth recommendation system was developed to be tested in 2011 DMC Culture Open, and 101 visitors were empirically studied and analyzed. The results are as follows. First, visitors' trust was the most important factor in the booth recommendation system, and the visitors who used the system perceived its performance as a success based on their trust. Second, visitors' knowledge levels also had significant effects on the performance of the system, which indicates that the performance of a recommendation system requires an advance understanding. In other words, visitors with higher levels of understanding of the exhibition hall learned better the usefulness of the booth recommendation system. Third, expected personalization did not have significant effects, which is a different result from previous studies' results. This is presumably because the booth recommendation system used in this study did not provide enough personalized services. Fourth, the recommendation information provided by the booth recommendation system was not considered to threaten or restrict one's freedom, which means it is valuable in terms of usefulness. Lastly, high performance of the booth recommendation system led to visitors' high satisfaction levels of unplanned behavior and intention to reuse the system. To sum up, in order to analyze the effects of a booth recommendation system on visitors' unplanned visits to a booth, empirical data were examined based on the unplanned behavior theory and, accordingly, useful suggestions for the establishment and design of future booth recommendation systems were made. In the future, further examination should be conducted through elaborate survey questions and survey objects.