• Title/Summary/Keyword: Social-Media

Search Result 3,072, Processing Time 0.031 seconds

Study of the Actual Condition and Satisfaction of Volunteer Activity in Australian Hospital (호주 일 지역의 병원 자원봉사활동 실태와 만족도)

  • Park, Geum-Ja;Choi, Hae-Young
    • Journal of Hospice and Palliative Care
    • /
    • v.9 no.1
    • /
    • pp.17-29
    • /
    • 2006
  • Purpose: This research aimed to investigate the actual condition and satisfaction of volunteer activity in Australian hospital. Methods: Data was collected by self reported questionnaire from 101 volunteers and analyzed by frequency and percentage, t-test, ANOVA and Sheffe and Pearson's correlation coefficients using SPSS 12.0. Results: 1. Years involved in volunteer work were $5{\sim}10$ years (32.7%), above 10 years (30.7%), $2{\sim}3$ years (11.9%) and $3{\sim}5$ years (10.9%). Types of volunteer work were physical care (32.7%), physical and emotional care (14.9%), and others (18.8%). Types of allocation of tasks were by volunteer coordination (55.7%), and by volunteer preference and consent between volunteer and coordinator (both respectively, 20.5%). Main reasons for volunteer work were to help sick people (61.4%) and to make good use of leisure time (22.8%). Routes to start volunteer work were from his (her) own inquiries (43.4%), from hearing from other volunteers (30.7%) and from mass media (13.1%). 80.2% of volunteers had received some kinds of training or preparation for volunteer work. Suitability of volunteer's skill and ability to voluntary work were 'very well' (74.0%) and 'mostly well' (18.0%). Reimbursements or benefits received for volunteer work were token or lunch or group outing (31.7%), and token and lunch or group outing (19.8%). Evaluation frequency for volunteer work was occasionally (372%), frequently (30.9%), always (17.0%) and never (14.9%). Relationship with volunteer work coordinator was very good (85.0%). The relationship with other volunteers was very good (81.2%). The relationship with hospital staffs was very good (69.7%) and mostly good (21.2%). Family and friend's support for volunteer work was very good (83.2%). 2 The mean score of satisfaction for the hospital volunteer activity was $3.09{\pm}0.49\;(range:\;1{\sim}4)$. The highest score domain was 'social contact', $3.48{\pm}0.61$, and the lowest was 'social exchange', $1.65{\pm}0.63$. An item of the highest score was 'I have an opportunity to help other people' ($3.83{\pm}0.40$), and the lowest score item was 'I will receive compensation for volunteer work I have done ($1.10{\pm}0.78$).' 3. The satisfaction from hospital volunteer activity was shown by significant difference according to sex (t=2.038, P=0.044), marital status (F=3.806, P=0.013), years involved in volunteer work (F=3.326), nam reason to do volunteer work (F=2.707, P=0.035), receive any training or preparation for volunteer work (t=-1.982, 0=0.050), frequency of evaluation for volunteer work (F=7.877, P=0.000), suitability of volunteer's skill and ability to voluntary work (t=2.712, P=0.049), relationship with volunteer work coordinators (F=-2.517, P=0.013), relation with hospital staffs (F=5.202, P=0.007), and support of their volunteer work by their family and friends (t=-3.394, P=0.001). Conclusion: The satisfaction of hospice volunteer activity was moderate. The satisfaction for hospice volunteer activity was shown by significant difference according to sex (t=2.038, P=0.044), marital status (F=3.806, P=0.013), years involved in volunteer work (F=3.326), main reason to do volunteer work (F=2.707, P=0.035), receive any training or preparation for volunteer work (t=-1.982, 0=0.050), frequency of evaluation for volunteer work (F=7.877, P=0.000), suitability of volunteer's skill and ability to voluntary work (t=2.712, P=0.049), relationship with volunteer work coordinator (F=-2.517, P=0.013), relation with hospital staffs (F=5.202, P=0.007), and family and friend's support for volunteer work (t=-3.394, P=0.001). Therefore, it is necessary to consider various factors to improve the satisfaction of voluntary work.

  • PDF

Clickstream Big Data Mining for Demographics based Digital Marketing (인구통계특성 기반 디지털 마케팅을 위한 클릭스트림 빅데이터 마이닝)

  • Park, Jiae;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.143-163
    • /
    • 2016
  • The demographics of Internet users are the most basic and important sources for target marketing or personalized advertisements on the digital marketing channels which include email, mobile, and social media. However, it gradually has become difficult to collect the demographics of Internet users because their activities are anonymous in many cases. Although the marketing department is able to get the demographics using online or offline surveys, these approaches are very expensive, long processes, and likely to include false statements. Clickstream data is the recording an Internet user leaves behind while visiting websites. As the user clicks anywhere in the webpage, the activity is logged in semi-structured website log files. Such data allows us to see what pages users visited, how long they stayed there, how often they visited, when they usually visited, which site they prefer, what keywords they used to find the site, whether they purchased any, and so forth. For such a reason, some researchers tried to guess the demographics of Internet users by using their clickstream data. They derived various independent variables likely to be correlated to the demographics. The variables include search keyword, frequency and intensity for time, day and month, variety of websites visited, text information for web pages visited, etc. The demographic attributes to predict are also diverse according to the paper, and cover gender, age, job, location, income, education, marital status, presence of children. A variety of data mining methods, such as LSA, SVM, decision tree, neural network, logistic regression, and k-nearest neighbors, were used for prediction model building. However, this research has not yet identified which data mining method is appropriate to predict each demographic variable. Moreover, it is required to review independent variables studied so far and combine them as needed, and evaluate them for building the best prediction model. The objective of this study is to choose clickstream attributes mostly likely to be correlated to the demographics from the results of previous research, and then to identify which data mining method is fitting to predict each demographic attribute. Among the demographic attributes, this paper focus on predicting gender, age, marital status, residence, and job. And from the results of previous research, 64 clickstream attributes are applied to predict the demographic attributes. The overall process of predictive model building is compose of 4 steps. In the first step, we create user profiles which include 64 clickstream attributes and 5 demographic attributes. The second step performs the dimension reduction of clickstream variables to solve the curse of dimensionality and overfitting problem. We utilize three approaches which are based on decision tree, PCA, and cluster analysis. We build alternative predictive models for each demographic variable in the third step. SVM, neural network, and logistic regression are used for modeling. The last step evaluates the alternative models in view of model accuracy and selects the best model. For the experiments, we used clickstream data which represents 5 demographics and 16,962,705 online activities for 5,000 Internet users. IBM SPSS Modeler 17.0 was used for our prediction process, and the 5-fold cross validation was conducted to enhance the reliability of our experiments. As the experimental results, we can verify that there are a specific data mining method well-suited for each demographic variable. For example, age prediction is best performed when using the decision tree based dimension reduction and neural network whereas the prediction of gender and marital status is the most accurate by applying SVM without dimension reduction. We conclude that the online behaviors of the Internet users, captured from the clickstream data analysis, could be well used to predict their demographics, thereby being utilized to the digital marketing.

Stock Price Prediction by Utilizing Category Neutral Terms: Text Mining Approach (카테고리 중립 단어 활용을 통한 주가 예측 방안: 텍스트 마이닝 활용)

  • Lee, Minsik;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.123-138
    • /
    • 2017
  • Since the stock market is driven by the expectation of traders, studies have been conducted to predict stock price movements through analysis of various sources of text data. In order to predict stock price movements, research has been conducted not only on the relationship between text data and fluctuations in stock prices, but also on the trading stocks based on news articles and social media responses. Studies that predict the movements of stock prices have also applied classification algorithms with constructing term-document matrix in the same way as other text mining approaches. Because the document contains a lot of words, it is better to select words that contribute more for building a term-document matrix. Based on the frequency of words, words that show too little frequency or importance are removed. It also selects words according to their contribution by measuring the degree to which a word contributes to correctly classifying a document. The basic idea of constructing a term-document matrix was to collect all the documents to be analyzed and to select and use the words that have an influence on the classification. In this study, we analyze the documents for each individual item and select the words that are irrelevant for all categories as neutral words. We extract the words around the selected neutral word and use it to generate the term-document matrix. The neutral word itself starts with the idea that the stock movement is less related to the existence of the neutral words, and that the surrounding words of the neutral word are more likely to affect the stock price movements. And apply it to the algorithm that classifies the stock price fluctuations with the generated term-document matrix. In this study, we firstly removed stop words and selected neutral words for each stock. And we used a method to exclude words that are included in news articles for other stocks among the selected words. Through the online news portal, we collected four months of news articles on the top 10 market cap stocks. We split the news articles into 3 month news data as training data and apply the remaining one month news articles to the model to predict the stock price movements of the next day. We used SVM, Boosting and Random Forest for building models and predicting the movements of stock prices. The stock market opened for four months (2016/02/01 ~ 2016/05/31) for a total of 80 days, using the initial 60 days as a training set and the remaining 20 days as a test set. The proposed word - based algorithm in this study showed better classification performance than the word selection method based on sparsity. This study predicted stock price volatility by collecting and analyzing news articles of the top 10 stocks in market cap. We used the term - document matrix based classification model to estimate the stock price fluctuations and compared the performance of the existing sparse - based word extraction method and the suggested method of removing words from the term - document matrix. The suggested method differs from the word extraction method in that it uses not only the news articles for the corresponding stock but also other news items to determine the words to extract. In other words, it removed not only the words that appeared in all the increase and decrease but also the words that appeared common in the news for other stocks. When the prediction accuracy was compared, the suggested method showed higher accuracy. The limitation of this study is that the stock price prediction was set up to classify the rise and fall, and the experiment was conducted only for the top ten stocks. The 10 stocks used in the experiment do not represent the entire stock market. In addition, it is difficult to show the investment performance because stock price fluctuation and profit rate may be different. Therefore, it is necessary to study the research using more stocks and the yield prediction through trading simulation.

A Study on the Smoking Status of the Korean Middle and High School Students (한국인(韓國人) 중고교생(中高校生)들의 흡연실태(吸煙實態)에 관(關)한 연구)

  • Park, Soon-Young
    • Journal of the Korean Society of School Health
    • /
    • v.7 no.1
    • /
    • pp.57-71
    • /
    • 1994
  • I investigated actual conditions of smoking of teenagers who were randomly chosen middle and high school students. 1. Juvenile smoking 1) Parents' opinions of juvenile smoking Most parents do not want their children to smoke after growth : 88.6% of fathers (middle school students: 88.9%, high school students: 88.4%) and 95.1% of mothers (middle school students: 93.4%, high school students :95.5%). 2) Teenagers' opinions of smoking after growth The rate of students who will smoke after growth is 10.8% (middle school students: 12.0%, high school students: 9.9%): students in agricultural areas show the higher rate than those in cities. 3) Parents' opinions of their children's smoking now 1.5% of fathers want their children to smoke now (middle school students: 1.3%, high school students: 1.6%) and 1.1% of mothers do (middle school students: 0.6%, high school students: 1.5%). This shows that most parents do not want their children to smoke now. 4) Students' opinions of their friends' smoking now Students who want their friends smoke now cover 7.8% (middle school students: 7.1%, high school students: 8.4%). This rates are higher than those of parents shown in (3). And more high school students and more girl students gave the positive reponse than middle school boy and girl students, respectively. 5) Students' views of smoking "Look like an adult" covers the rate of 4.0% (boy: 7.8%, girl:3.6%) 6.7% of middle school students have this view, while 3.7% of high school students have. 16.1% of students had an experience of smoking during the last one year (boy: 29.9%, girl: 8.6%): this shows that the rate of the boy students is more than 3 times greater than that of the girl students and high students who experienced smoking last year covers 20.2%, while middle school students shows 10.9%. 6) Actual conditions of students' smoking The present rate of students' smoking is 22.4% (boy:38.3%, girl:13.8%): the rate of boy students is greater than that of girl students. Students who smoke more than pack of cigarettes a day cover 8.2% (boy: 17.5%, girl: 3.2%): 5.2% of middle school students (boy:11.4%, girl: 2.1%) smoke more than one pack while 10.7% of high school students do (boy:21.5%, girl: 4.2%). This shows that the rate of boy students' smoking is greater than that of girl students' smoking. 7) The rate of smoking of students' parents 75.4% of fathers (city: 74.5%, agricultural area:75.9%) smoke: and more than a half (62.4%) smoke more than a pack cigarettes a day. On the other hand, the rate of smoking mothers is 5.2%(city: 4.3%, agricultural area: 7.3%): the rate is higher in agricultural areas. 8) Opinions of smoking population in the future 61.4% of students answered that smoking population will increase, while 27.0% have the opinion that smoking population will decrease. 2. Opinions of the effects of smoking on health 1) Have you heard that smokers are likely to suffer from tuberclosis? 78.3% of students said yes (boy: 80.8%, girl: 76.4%): it is shown that the rate of boys is greater than that of girls. 2) Have you heard that smokers are likely to get out of endurance? 76.6% of students (boy: 69.3%, girl: 49.7%) answered yes: it is shown that the rate of boys is greater than that of girls. 3) Have you heard that heart-beats get fast when one smokes? 32.5% of students (boy: 35.5%, girl: 30.9%) answered yes: 32.2% in cities(boy: 33.0%, girl: 31.8%) and 33.5% in agricultural areas(boy: 41.8%, girl: 28.8%): and 28.7% middle students and 35.5% of high school students answered yes. 4) Have you heard that smokers are likely to have heart-diseases? 35.1% of students (boy: 34.0%, girl: 34.1%) answered yes: 35.3% in cities (boy: 37.2%, girl: 34.2%) and 36.7% in agricultural areas (boy: 39.0%, girl: 33.9%): 34.8% of middle school students and 35.4% of high school students. 5) Have you heard that smokers are likely to have a lung cancer? 91.4% of students (boy: 93.2%, girl: 89.9%) answered yes: 90.35% in cities and 94.2% in agricultural areas. 6) Have you heard that the life of smokers gets shorter? 94.3% of students (boy:94.6%, girl: 92.2%) answered yes. 7) Have you heard that pregnant smokers will deliver a baby with low birth weight? 29.6% of students (boy: 29.8%, girl: 29.4%) answered yes: the rates of boys and girls almost the same. 8) Have you heard that one feels calm when one smokes? 80.1% of students (boy: 81.8%, girl: 79.2%) answered yes: boys and girls showed almost the same rate. 3. Preventive measures Smoking people continued to increase all over the world because smoking not only mitigated emotional uneasiness such as loneliness, nervousness and so on, but also could be very helpful from the social perspective. This was so because they did not consider harmful effects of smoking on health, and victims. However, because any -one can have physical disorders caused by smoking, people should always keep in mind the following preventive measures. 1) Doctors or teachers should set an example of giving up smoking. Informing patients or students of harmful effects of smoking to persuade their family and relatives not to smoke. 2) Through mass media like newspapers, periodicals or broadcasting, to make people know harmful effects of smoking and not smoke. 3) To prohibit selling teenagers cigarette by law. 4) To prohibit smoking in public places like work places, offices, lecture rooms, recreation rooms, buses, trains and so on. 5) To decrease the rate of life insurance for non-smokers as in foreign countries and to give a warming of the harmful effects on cigarette packets or ads.

  • PDF

A study on the gratification of the patient in the Dental Hospital (치과병원 내원환자의 만족도 조사분석)

  • Kim, Min-Young;Lee, Keun-Woo;Moon, Hong-Suk;Chung, Moon-Kyu
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.46 no.1
    • /
    • pp.65-82
    • /
    • 2008
  • Statement of problem : Today's market economy has been changed more and more to consumer concerned. It is owing to not only consumers ' rising standard of living and education, but also purchasers' easy accessibilities to products through various mass media. The consumer centered market system, where customer can choose items with diverse alternatives to satisfy their self esteem, is also applied to the field of medical business, and accelerated by an increasing income level of shoppers and introducing the whole nations' medical insurance system. Today, the medical industry has become competitive due to increasing number of medical institutions and medical personnel, and this offers wide choices to consumers in the medical market place. At this point of time, it is essential to survey on the primary factor of gratification for the patient in the Dental clinic, as well as on the problems and suggestions in medical service. Purpose : The analysis in this study shows essential factors and expected influential elements in satisfaction of the patient in the Dental Hopsital, and strategic suggestions for the provider of dental service, which can be of benefit to the prospective customer as well as can make improvement in the quality of dental treatment service. Material and method : This study had been researched by collecting and analyzing the organized questionnaires, which were filled in directly from 784 patients, who visit Dental Hospital, Yonsei University in Seoul, from January 23rd to April 15th. Result : It can be summarized like the followings. 1. The social and demographical peculiarities of respondents are as follows. Samples of gender and marital status are adequately extracted, but data on occupation and treatment are are under a bias toward students, undergraduates and graduate students, and orthodontics. 2. 74% of patients who answer the questionnaire were highly satisfied with the service of dental clinic in the section of overall satisfaction. 3. The survey result about specific service of dental treatment, within sections of independent variables, is like the followings; Patients are highly gratified with service system, kindness, explanation, explanation on expected waiting hours, reservation system, emergency measures, expert treatment, existence of knowledge of dentistry, size of hospital, disinfection, equipment and parking, but lowly satisfied with expense of treatment, preparatory hours for treatment, waiting hours, treatment hours and the period of subscription. 4. The correlation analysis showed that there is no significant linear relationship between the independent variables. 5. The probit regression analysis showed that 8 out of 34 independent variables explained the dependent variables at the level of 0.01. 6. It shows that 8 independent variables, which can affect customers 'satisfaction, are clearing up of inconvenience, service system, kindness, explanation, treatment hours per attendance, reservation system, existence of knowledge of dentistry, and contentment of equipment in the hospital. Conclusion : The consumer's satisfaction totally relies on subjective evaluations of customers. Providing appropriate service, which can meet the criteria for the customer who demands various wares, pursues luxury goods, and expects high quality of medical service, is essential to fulfill patients' satisfaction. Many medical institutions do their best to satisfy their customer, touch their consumer, and offer patience centered services, and it is also applied to the field of dentistry. Establishing brand new strategic managements and elevating the quality of dental service based on this survey are required to improve the satisfaction of patience in the Dental Hospital.

The Analysis of the Current Status of Medical Accidents and Disputes Researched in the Korean Web Sites (인터넷 사이트를 통해 살펴본 의료사고 및 의료분쟁의 현황에 관한 분석)

  • Cha, Yu-Rim;Kwon, Jeong-Seung;Choi, Jong-Hoon;Kim, Chong-Youl
    • Journal of Oral Medicine and Pain
    • /
    • v.31 no.4
    • /
    • pp.297-316
    • /
    • 2006
  • The increasing tendency of medical disputes is one of the remarkable social phenomena. Especially we must not overlook the phenomenon that production and circulation of information related to medical accidents is increasing rapidly through the internet. In this research, we evaluated the web sites which provide the information related to medical accidents using the keyword "medical accidents" in March 2006, and classified the 28 web sites according to the kinds of establishers. We also analyzed the contents of the sites, and checked and compared the current status of the web sites and problems that have to be improved. Finally, we suggested the possible solutions to prevent medical accidents. The detailed results were listed below. 1. Medical practitioners, general public, and lawyers were all familiar with and prefer the term "medical accidents" mainly. 2. In the number of sites searched by the keyword "medical accidents", lawyer had the most sites and medical practitioners had the least ones. 3. Many sites by general public and lawyers had their own medical record analysts but there was little professional analysts for dentistry. 4. General public were more interested in the prevention of medical accidents but the lawyers were more interested in the process after medical accidents. The sites by medical practitioners dealt with the least remedies of medical accidents, compared with other sites. 5. General public wanted the third party such as government intervention into the disputes including the medical dispute arbitration law or/and the establishment of independent medical dispute judgment institution. 6. In the comparison among the establishers of web sites, medical practitioners dealt with the least examples of medical accidents. 7. The suggestion of cases in counseling articles related to dental accidents were considered less importantly than the reality. 8. Whereas there were many articles about domestic cases related to the bloody dental treatment, in the open counseling articles the number of dental treatment regarding to non insurance treatment was large. 9. In comparing offered information of medical accidents based on the establishers, general public offered vocabularies, lawyers offered related laws and medical practitioners offered medical knowledge relatively. 10. They all cited the news pressed by the media to offer the current status of domestic medical accidents. Especially among the web sites by general public, NGOs provided the plentiful statistical data related to medical accidents. 11. The web sites that collect the medical accidents were only two. As a result of our research, we found out that, in the flood of information, medical disputes can be occurred by the wrong information from third party, and the medical practitioners have the most passive attitudes on the medical accidents. Thus, it is crucial to have the mutual interchange and exchange of information between lawyer, patients and medical practitioners, so that based on clear mutual comprehension we can solve the accidents and disputes more positively and actively.

A Study on the Function of Oral Medicine as the Secondary Clinic Based on Analysis on Admissive Channel and Case Features (내원경위 분석과 환자 특성 평가에 따른 2차 진료기관으로서 구강내과 역할에 대한 연구)

  • Lee, You-Mee;Lee, Jung-Hyun;Lim, Hyun-Dae
    • Journal of Oral Medicine and Pain
    • /
    • v.31 no.3
    • /
    • pp.199-210
    • /
    • 2006
  • The epidemiological researches on the inpatients hospitalized at the oral medicine ward have been continuously carried out since 1970, and most researches have been performed by centering around the oral medicine wards of college hospitals. Numerous specialists have been produced after the establishment of oral medicine, and they have been active in various fields. As dental clinics have gotten bigger, the function of oral medicine in the secondary clinics is being brought out. As admissive channel, case features, case composition and otherwise have not been researched for a long time, the related researches should be carried out from now on. Hereupon, this study was carried out by targeting the 100 inpatients hospitalized at the oral medicine ward of Sun Hospital located in Daejeon Korea, through questionnaire. As the result, the following results were derived. 1. The ages of the inpatients in Sun Hospital were $29.21{\pm}11.31$ on the average; 71 females' mean average was $29.63{\pm}11.29$ and 29 males' mean average was $28.17{\pm}11.48$. In regard of school career, the patients who finished high-school course or higher accounted for 78%; the patients' school career seemed to be relatively high. The patients who complained of temporomandibular pain accounted for the highest proportion with 65%. In motivation to visit this hospital, internet surfing was 11%, mass media was 10%, acquaintance's introduction was 38%. The patients, who were hospitalized at another hospital due to the same symptom, accounted for 56%. The dental clinics, which made the patients visit this hospital, accounted for 20%. The patients, who were previously aware that the present symptom should be treated by oral medicine, accounted for 38%. The patients, who were not aware of the fact in advance, were 62%. The respondents of 51% answered that they were aware of the fact one month or below before hospitalization. 2. The patients, who complained of craniocervical ache, accounted for 58%; the patients, whose ache aches affect dailylife, were 22%. Continuous ache was 14% and intermittent ache was 68%, and dull pain was 23%. 3. Life variations were compared with each other by using SRRS (Social Readjustment Rating Scale). In consequence, the variation within 3 years indicated a significant difference in the both groups but the variation within 6 months did not indicate any differences. 4. In regard of the questionnaire on the incidents happened for a week, the ache-group was compared with the group free from the ache. As the result, the number of strain arisen for a week, the decrease of favorite works and sudden fear indicated a significant difference. Pleasant feeling and the decrease of interests in looks did not indicate a significant difference, but came close to the significance. 5. In the questionnaire on impatience, the ache-group indicated higher value but there was not a significant difference. 6. In the questionnaire on the symptoms caused by stress, the two groups indicated significant differences in the item of 'the teethridge itches and feels a tooth rising' and 'the occiput or the nape is stiff.' In the item 'the inside of the cheek or the teethridge are widely peeled off, accompanied with ache and hemorrhage', 'the face has acne or pimple' and 'headache frequently attacks', a significant difference was not observed but the two groups came close to the significance.

Construction of Consumer Confidence index based on Sentiment analysis using News articles (뉴스기사를 이용한 소비자의 경기심리지수 생성)

  • Song, Minchae;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.1-27
    • /
    • 2017
  • It is known that the economic sentiment index and macroeconomic indicators are closely related because economic agent's judgment and forecast of the business conditions affect economic fluctuations. For this reason, consumer sentiment or confidence provides steady fodder for business and is treated as an important piece of economic information. In Korea, private consumption accounts and consumer sentiment index highly relevant for both, which is a very important economic indicator for evaluating and forecasting the domestic economic situation. However, despite offering relevant insights into private consumption and GDP, the traditional approach to measuring the consumer confidence based on the survey has several limits. One possible weakness is that it takes considerable time to research, collect, and aggregate the data. If certain urgent issues arise, timely information will not be announced until the end of each month. In addition, the survey only contains information derived from questionnaire items, which means it can be difficult to catch up to the direct effects of newly arising issues. The survey also faces potential declines in response rates and erroneous responses. Therefore, it is necessary to find a way to complement it. For this purpose, we construct and assess an index designed to measure consumer economic sentiment index using sentiment analysis. Unlike the survey-based measures, our index relies on textual analysis to extract sentiment from economic and financial news articles. In particular, text data such as news articles and SNS are timely and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. There exist two main approaches to the automatic extraction of sentiment from a text, we apply the lexicon-based approach, using sentiment lexicon dictionaries of words annotated with the semantic orientations. In creating the sentiment lexicon dictionaries, we enter the semantic orientation of individual words manually, though we do not attempt a full linguistic analysis (one that involves analysis of word senses or argument structure); this is the limitation of our research and further work in that direction remains possible. In this study, we generate a time series index of economic sentiment in the news. The construction of the index consists of three broad steps: (1) Collecting a large corpus of economic news articles on the web, (2) Applying lexicon-based methods for sentiment analysis of each article to score the article in terms of sentiment orientation (positive, negative and neutral), and (3) Constructing an economic sentiment index of consumers by aggregating monthly time series for each sentiment word. In line with existing scholarly assessments of the relationship between the consumer confidence index and macroeconomic indicators, any new index should be assessed for its usefulness. We examine the new index's usefulness by comparing other economic indicators to the CSI. To check the usefulness of the newly index based on sentiment analysis, trend and cross - correlation analysis are carried out to analyze the relations and lagged structure. Finally, we analyze the forecasting power using the one step ahead of out of sample prediction. As a result, the news sentiment index correlates strongly with related contemporaneous key indicators in almost all experiments. We also find that news sentiment shocks predict future economic activity in most cases. In almost all experiments, the news sentiment index strongly correlates with related contemporaneous key indicators. Furthermore, in most cases, news sentiment shocks predict future economic activity; in head-to-head comparisons, the news sentiment measures outperform survey-based sentiment index as CSI. Policy makers want to understand consumer or public opinions about existing or proposed policies. Such opinions enable relevant government decision-makers to respond quickly to monitor various web media, SNS, or news articles. Textual data, such as news articles and social networks (Twitter, Facebook and blogs) are generated at high-speeds and cover a wide range of issues; because such sources can quickly capture the economic impact of specific economic issues, they have great potential as economic indicators. Although research using unstructured data in economic analysis is in its early stages, but the utilization of data is expected to greatly increase once its usefulness is confirmed.

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode (CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석)

  • Park, Ho-yeon;Kim, Kyoung-jae
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.141-154
    • /
    • 2019
  • Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

Spatial effect on the diffusion of discount stores (대형할인점 확산에 대한 공간적 영향)

  • Joo, Young-Jin;Kim, Mi-Ae
    • Journal of Distribution Research
    • /
    • v.15 no.4
    • /
    • pp.61-85
    • /
    • 2010
  • Introduction: Diffusion is process by which an innovation is communicated through certain channel overtime among the members of a social system(Rogers 1983). Bass(1969) suggested the Bass model describing diffusion process. The Bass model assumes potential adopters of innovation are influenced by mass-media and word-of-mouth from communication with previous adopters. Various expansions of the Bass model have been conducted. Some of them proposed a third factor affecting diffusion. Others proposed multinational diffusion model and it stressed interactive effect on diffusion among several countries. We add a spatial factor in the Bass model as a third communication factor. Because of situation where we can not control the interaction between markets, we need to consider that diffusion within certain market can be influenced by diffusion in contiguous market. The process that certain type of retail extends is a result that particular market can be described by the retail life cycle. Diffusion of retail has pattern following three phases of spatial diffusion: adoption of innovation happens in near the diffusion center first, spreads to the vicinity of the diffusing center and then adoption of innovation is completed in peripheral areas in saturation stage. So we expect spatial effect to be important to describe diffusion of domestic discount store. We define a spatial diffusion model using multinational diffusion model and apply it to the diffusion of discount store. Modeling: In this paper, we define a spatial diffusion model and apply it to the diffusion of discount store. To define a spatial diffusion model, we expand learning model(Kumar and Krishnan 2002) and separate diffusion process in diffusion center(market A) from diffusion process in the vicinity of the diffusing center(market B). The proposed spatial diffusion model is shown in equation (1a) and (1b). Equation (1a) is the diffusion process in diffusion center and equation (1b) is one in the vicinity of the diffusing center. $$\array{{S_{i,t}=(p_i+q_i{\frac{Y_{i,t-1}}{m_i}})(m_i-Y_{i,t-1})\;i{\in}\{1,{\cdots},I\}\;(1a)}\\{S_{j,t}=(p_j+q_j{\frac{Y_{j,t-1}}{m_i}}+{\sum\limits_{i=1}^I}{\gamma}_{ij}{\frac{Y_{i,t-1}}{m_i}})(m_j-Y_{j,t-1})\;i{\in}\{1,{\cdots},I\},\;j{\in}\{I+1,{\cdots},I+J\}\;(1b)}}$$ We rise two research questions. (1) The proposed spatial diffusion model is more effective than the Bass model to describe the diffusion of discount stores. (2) The more similar retail environment of diffusing center with that of the vicinity of the contiguous market is, the larger spatial effect of diffusing center on diffusion of the vicinity of the contiguous market is. To examine above two questions, we adopt the Bass model to estimate diffusion of discount store first. Next spatial diffusion model where spatial factor is added to the Bass model is used to estimate it. Finally by comparing Bass model with spatial diffusion model, we try to find out which model describes diffusion of discount store better. In addition, we investigate the relationship between similarity of retail environment(conceptual distance) and spatial factor impact with correlation analysis. Result and Implication: We suggest spatial diffusion model to describe diffusion of discount stores. To examine the proposed spatial diffusion model, 347 domestic discount stores are used and we divide nation into 5 districts, Seoul-Gyeongin(SG), Busan-Gyeongnam(BG), Daegu-Gyeongbuk(DG), Gwan- gju-Jeonla(GJ), Daejeon-Chungcheong(DC), and the result is shown

    . In a result of the Bass model(I), the estimates of innovation coefficient(p) and imitation coefficient(q) are 0.017 and 0.323 respectively. While the estimate of market potential is 384. A result of the Bass model(II) for each district shows the estimates of innovation coefficient(p) in SG is 0.019 and the lowest among 5 areas. This is because SG is the diffusion center. The estimates of imitation coefficient(q) in BG is 0.353 and the highest. The imitation coefficient in the vicinity of the diffusing center such as BG is higher than that in the diffusing center because much information flows through various paths more as diffusion is progressing. A result of the Bass model(II) shows the estimates of innovation coefficient(p) in SG is 0.019 and the lowest among 5 areas. This is because SG is the diffusion center. The estimates of imitation coefficient(q) in BG is 0.353 and the highest. The imitation coefficient in the vicinity of the diffusing center such as BG is higher than that in the diffusing center because much information flows through various paths more as diffusion is progressing. In a result of spatial diffusion model(IV), we can notice the changes between coefficients of the bass model and those of the spatial diffusion model. Except for GJ, the estimates of innovation and imitation coefficients in Model IV are lower than those in Model II. The changes of innovation and imitation coefficients are reflected to spatial coefficient(${\gamma}$). From spatial coefficient(${\gamma}$) we can infer that when the diffusion in the vicinity of the diffusing center occurs, the diffusion is influenced by one in the diffusing center. The difference between the Bass model(II) and the spatial diffusion model(IV) is statistically significant with the ${\chi}^2$-distributed likelihood ratio statistic is 16.598(p=0.0023). Which implies that the spatial diffusion model is more effective than the Bass model to describe diffusion of discount stores. So the research question (1) is supported. In addition, we found that there are statistically significant relationship between similarity of retail environment and spatial effect by using correlation analysis. So the research question (2) is also supported.

  • PDF