Analysis of Twitter for 2012 South Korea Presidential Election by Text Mining Techniques (텍스트 마이닝을 이용한 2012년 한국대선 관련 트위터 분석)
-
- Journal of Intelligence and Information Systems
- /
- v.19 no.3
- /
- pp.141-156
- /
- 2013
Social media is a representative form of the Web 2.0 that shapes the change of a user's information behavior by allowing users to produce their own contents without any expert skills. In particular, as a new communication medium, it has a profound impact on the social change by enabling users to communicate with the masses and acquaintances their opinions and thoughts. Social media data plays a significant role in an emerging Big Data arena. A variety of research areas such as social network analysis, opinion mining, and so on, therefore, have paid attention to discover meaningful information from vast amounts of data buried in social media. Social media has recently become main foci to the field of Information Retrieval and Text Mining because not only it produces massive unstructured textual data in real-time but also it serves as an influential channel for opinion leading. But most of the previous studies have adopted broad-brush and limited approaches. These approaches have made it difficult to find and analyze new information. To overcome these limitations, we developed a real-time Twitter trend mining system to capture the trend in real-time processing big stream datasets of Twitter. The system offers the functions of term co-occurrence retrieval, visualization of Twitter users by query, similarity calculation between two users, topic modeling to keep track of changes of topical trend, and mention-based user network analysis. In addition, we conducted a case study on the 2012 Korean presidential election. We collected 1,737,969 tweets which contain candidates' name and election on Twitter in Korea (http://www.twitter.com/) for one month in 2012 (October 1 to October 31). The case study shows that the system provides useful information and detects the trend of society effectively. The system also retrieves the list of terms co-occurred by given query terms. We compare the results of term co-occurrence retrieval by giving influential candidates' name, 'Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn' as query terms. General terms which are related to presidential election such as 'Presidential Election', 'Proclamation in Support', Public opinion poll' appear frequently. Also the results show specific terms that differentiate each candidate's feature such as 'Park Jung Hee' and 'Yuk Young Su' from the query 'Guen Hae Park', 'a single candidacy agreement' and 'Time of voting extension' from the query 'Jae In Moon' and 'a single candidacy agreement' and 'down contract' from the query 'Chul Su Ahn'. Our system not only extracts 10 topics along with related terms but also shows topics' dynamic changes over time by employing the multinomial Latent Dirichlet Allocation technique. Each topic can show one of two types of patterns-Rising tendency and Falling tendencydepending on the change of the probability distribution. To determine the relationship between topic trends in Twitter and social issues in the real world, we compare topic trends with related news articles. We are able to identify that Twitter can track the issue faster than the other media, newspapers. The user network in Twitter is different from those of other social media because of distinctive characteristics of making relationships in Twitter. Twitter users can make their relationships by exchanging mentions. We visualize and analyze mention based networks of 136,754 users. We put three candidates' name as query terms-Geun Hae Park', 'Jae In Moon', and 'Chul Su Ahn'. The results show that Twitter users mention all candidates' name regardless of their political tendencies. This case study discloses that Twitter could be an effective tool to detect and predict dynamic changes of social issues, and mention-based user networks could show different aspects of user behavior as a unique network that is uniquely found in Twitter.
People easily believe that news and stock index are closely related. They think that securing news before anyone else can help them forecast the stock prices and enjoy great profit, or perhaps capture the investment opportunity. However, it is no easy feat to determine to what extent the two are related, come up with the investment decision based on news, or find out such investment information is valid. If the significance of news and its impact on the stock market are analyzed, it will be possible to extract the information that can assist the investment decisions. The reality however is that the world is inundated with a massive wave of news in real time. And news is not patterned text. This study suggests the stock-index invest model based on "News Big Data" opinion mining that systematically collects, categorizes and analyzes the news and creates investment information. To verify the validity of the model, the relationship between the result of news opinion mining and stock-index was empirically analyzed by using statistics. Steps in the mining that converts news into information for investment decision making, are as follows. First, it is indexing information of news after getting a supply of news from news provider that collects news on real-time basis. Not only contents of news but also various information such as media, time, and news type and so on are collected and classified, and then are reworked as variable from which investment decision making can be inferred. Next step is to derive word that can judge polarity by separating text of news contents into morpheme, and to tag positive/negative polarity of each word by comparing this with sentimental dictionary. Third, positive/negative polarity of news is judged by using indexed classification information and scoring rule, and then final investment decision making information is derived according to daily scoring criteria. For this study, KOSPI index and its fluctuation range has been collected for 63 days that stock market was open during 3 months from July 2011 to September in Korea Exchange, and news data was collected by parsing 766 articles of economic news media M company on web page among article carried on stock information>news>main news of portal site Naver.com. In change of the price index of stocks during 3 months, it rose on 33 days and fell on 30 days, and news contents included 197 news articles before opening of stock market, 385 news articles during the session, 184 news articles after closing of market. Results of mining of collected news contents and of comparison with stock price showed that positive/negative opinion of news contents had significant relation with stock price, and change of the price index of stocks could be better explained in case of applying news opinion by deriving in positive/negative ratio instead of judging between simplified positive and negative opinion. And in order to check whether news had an effect on fluctuation of stock price, or at least went ahead of fluctuation of stock price, in the results that change of stock price was compared only with news happening before opening of stock market, it was verified to be statistically significant as well. In addition, because news contained various type and information such as social, economic, and overseas news, and corporate earnings, the present condition of type of industry, market outlook, the present condition of market and so on, it was expected that influence on stock market or significance of the relation would be different according to the type of news, and therefore each type of news was compared with fluctuation of stock price, and the results showed that market condition, outlook, and overseas news was the most useful to explain fluctuation of news. On the contrary, news about individual company was not statistically significant, but opinion mining value showed tendency opposite to stock price, and the reason can be thought to be the appearance of promotional and planned news for preventing stock price from falling. Finally, multiple regression analysis and logistic regression analysis was carried out in order to derive function of investment decision making on the basis of relation between positive/negative opinion of news and stock price, and the results showed that regression equation using variable of market conditions, outlook, and overseas news before opening of stock market was statistically significant, and classification accuracy of logistic regression accuracy results was shown to be 70.0% in rise of stock price, 78.8% in fall of stock price, and 74.6% on average. This study first analyzed relation between news and stock price through analyzing and quantifying sensitivity of atypical news contents by using opinion mining among big data analysis techniques, and furthermore, proposed and verified smart investment decision making model that could systematically carry out opinion mining and derive and support investment information. This shows that news can be used as variable to predict the price index of stocks for investment, and it is expected the model can be used as real investment support system if it is implemented as system and verified in the future.
This study aims at the psychological elucidation of some conscious aspects of the personality of Yi Sun-sin (1545-1598), the Korean national hero, and the unconscious teleologic meanings of his dreams mentioned in Nanjung Ilgi (Diary in War Time) from the viewpoint of analytical psychology of C.G. Jung. Yi Sun-sin was a man of discipline, incorporated with the spirit of Confucian filial piety, hyo (hsiao) and royalty, chung. He was a stern man but with a warm heart. In his diary, Yi Sun-sin poured forth his feelings of suffering, despair, and extreme solicitude caused by slanders of his political opponents, his grief for the loss of mother and son, and his worries about the fate of his country, which the Japanese invaders now plundered. The moon night offered him the opportunity to touch with his inner soul, by reciting poems, playing Korean string, 'Keomungo', and flute. Further, he widened his scope by asking for the answers from the 'Heaven' through divination and dream. Yi Sun-sin's attitude toward his mother who raised the future hero and maternal principles were considered in concern with the Jungian term 'mother complex'. Won Gyun, Yi Sun-sin's rival admiral, who persistently accused Yi Sun-sin of 'slanders,' certainly represents the unconscious shadow image of Yi Sun-sin. The reciprocal 'shadow' projection has intervened in the conflicting relationship between Yi and Won. In concern to the argument for the suicidal death of Yi Sun-sin, the author found no evidence supporting such an argument, No trace of latent suicidal wish was found in his dreams. For Yi Sun-sin, the determination of the life and death depends on Heaven. 32 dreams from the diary and 3 from other historical references were reviewed and analyzed in the Jungian way. Symbols of anima, Self, and individuation process were found. His dream repeatedly suggests that Yi Sun-sin is an extraordinary man chosen by the divine man (神人). In the dream, Yi Sun-sin was a disciple of the divine man receiving instructions on various strategies, and he alone could see the great thing or events. The dream of a beautiful blue and red dragon, whom he was friendly touching, indicates Yi Sun-sin's eligibility for the kingship. Yi Sun-sin seemingly did not aware of this message of the unconscious. Perhaps he sensed something special but did not identify with 'the disciple of gods' and 'royal dragon' in his dream. His modest attitude toward the dream has prevented him from falling into ego inflation. There were warning signals in two dreams that suggested disorders in the dreamer's instinctive feminine drive. Spirits of the dead father and brothers appear in the dream, giving advice or mourning for the death of Sun-sin's mother. Though Yi Sun-sin was a genuine Confucian gentleman, a dream revealed his unconscious drive to destroy the Confucian authoritative 'Persona' by trampling down the cylindrical traditional Korean hat. To the dreams of synchronicity phenomena Yi Sun-sin immediately solves the problem in concrete reality. He understood dreams as valuable messages from the superior entity, for example, the Confucian Heaven (天) or Heaven's Decree (天命). Furthermore, the 'Heaven' presumably arranged for him the way to the national hero and imposed necessary trials upon him. Both his persecutors and advocates of him guided him in the way of a hero. Yi Sun-sin followed his destiny and completed the living myth of the hero. His mother, King Seon-jo, and prime minister Liu Seong Yong, all have contributed to embodying the myth of the hero. Yi Sun-sin died and became god, the divine healer of the nation.
HARN, Chang Yawl : Studies on the dimorphism and Fertility of Persicaria japonica (MEISSNER) Gross et Nakai. Kor Jour. Bot. 3(I) 1-15 1960 Numerous investigations, since the works of DARWIN, have been made regarding the heterostylous plants by JOST (1907), CORRENS (1924), LAIBACK (1924), LEWIS (1943), and many others. Studies on the heterostylous Polygomum, however, were not reported except for the buckwhent, Fagopyrum esculentum, which was investigated by SCHOCH-BODMER (1930), EAST (1934), FROLOVA & Co-Workers (1946), MORRIS (1947, 1951) TATEBE (1949, 1951, 1953), present author (1957), and others. It is because no heterostylous species, besides buckwheat, have been known to exist in the Polygonum family. The author, during his studies on both heterostylism and fertility of Polygonaceae, has found that the species, persicaria japonica (Meissner) Gross et Nakai, is not diecious as has been known in taxonomy, but in reality beterostylous both morphologically and physiologically. It was found that this plant, regarded by taxonomist, as a male plant setting no seed, actually set seed (botanical fruit) when legitimate combination was made. Since his brief report on the dimorphic phenomens of this plant in 1956, the author's further research on the manner of fertilization has revealed that this species is a peculiar type whose dimorphism has undergone extreme specialization structurally and physiologically, the short-styled individual behaving in nature as a male plant and the long-styled individual, as female, whereas in controllled pollination the plant shows highly differentiated typical dimorphism. When compared with the other dimorphous species of this family, F. esculentum and P. sentiosa. it has been clarified that these three species differ in the degree of differentiation of their dimorphism morphologically and physiologically. That is, P. japonica has developed such a high specialization as to mislead the taxonomists, while P. senticosa shows almost no noticeable difference between long- and shortstyled individuals retaining most of the inherent physiological character cmmon to the genus except for the fact that it has two forms of flowers. F. esculentum appears to have taken the intermediate position in every respect. The result obtained in the present experiment are summarized as follows: 1) P. japonica has two kinds of individuals, one long style-short stamened; the other, short style-long stamened. The floral structure of this plants shows typical characteristics of dimorphic heterostylism. The differentiation between the two forms of flower has proceeded so highly both in primary and secondary difference of flower structure that this may be regarded as the most specialized form of dimorphism. 2) The differences of floral structure between the long and short styled individuals are remarkable compared with the other dimorphic species of the family. 3) The stamens of long styled plants show the sign of deteriolation whereas those of the short styled flower are well-developed. 4) When legitimate combinations are made, both L- and S-styled individuals are fertilized well and set seed (fruit), while in the illegitimate combination no fertilization and seed setting occur. Physiologically this species exhibits the typical behavior of dimorphic plants. 5) The self-fertile character, so common in other species of the other non-heterostyle Polygonum family, has disappeared completely. 6) Under natural conditions, no or few seed setting is observed in short styled individuals that behave as if they were male plants. 7) In hand pollination, the combination of both
This study is to investigate and analyze regional patterns of aging in Taejeon Metropolitan city-the overpopulated area of Choong-Cheong Province-by cohort analysis method. According to the population structure transition caused by rapid social and economic changes, Korea has made a rapid progress in population aging since 1970. This trend is so rapid that we should prepare for and cope with aging society. It is not only slow to cope with it in our society, but also there are few studies on population aging of the geographical field in Korea. The data of this study are the reports of Population and Housing Censuses in 1975 and 1985 and General Population and Housing Censuses with 10% sample survey in 1995 taken by National Statistical Office. The research method is to sample as the aging district the area with high aged population rate where the populations over 60 reside among total population during the years of 1975, 1985, 1995 and to sample the special districts of decreasing population where the population decreases very much and the special districts of increasing population in which the population increases greatly, presuming that the reason why aged population rate increases is that non-elderly population high in mobility moves out. It is then verified and ascertained whether it is true or not with cohort analysis method by age. Finally regional patterns in the city are found through the classification and modeling by type based on the aging district, the special districts of decreasing population, and the special districts of increasing population. The characteristics of the regional patterns show that there is social population transition and that non-elderly population moves out. The aging district with the high aged population rate is divided into high-level keeping-up type, relative falling type below the average of Taejeon city in aging progress, and relative rising type above the average of the city. This district can be found at both the central area of the city and the suburbs because Taejeon city has the characteristic of over-bounded city. But it cannot be found at the new built-up area with the in-migration of large population. The special districts of decreasing population where the population continues to decrease can be said to be the population doughnuts found at the CBD and its neighboring inner area. On the other hand, the special districts of increasing population where the population continues to increase are located at the new built-up area of the northern part in Taejeon city. The special districts of decreasing population are overlapping with the aging district and higher in aged population rate by the out-migration of non-elderly population. The special districts of increasing population are not overlapping with the aging district and lower in aged population rate by the in-migration of non-elderly population. To clarify the distribution map of the aging district, the special districts of decreasing and increasing population and the aging district are divided into four groups such as the special districts of decreasing population group-the same one as the aging district, the special districts of decreasing population group, the special districts of increasing population group, and the other district. With the cohort analysis method by age used to investigate the definite increase and decrease of aging population through population transition of each group, it is found that the progress of population aging is closely related to the social population fluctuation, especially that aged population rate is higher with the out-migration of non-elderly population. This is to explain each model of CBD, inner area, and the suburbs after modeling the aging district, the special districts of decreasing population, and the special districts of increasing population in Taejeon city. On the assumption that the city area is a concentric circle, it is possible to divide it into three areas such as CBD(A), the inner area(B), and the suburbs(C). The special districts of increasing and decreasing population in the city are divided into three districts-the special districts of decreasing population(a), the special districts of increasing population(b), and the others(c). The aging district of this city is divided into the aging district(
To survive in the global competitive environment, enterprise should be able to solve various problems and find the optimal solution effectively. The big-data is being perceived as a tool for solving enterprise problems effectively and improve competitiveness with its' various problem solving and advanced predictive capabilities. Due to its remarkable performance, the implementation of big data systems has been increased through many enterprises around the world. Currently the big-data is called the 'crude oil' of the 21st century and is expected to provide competitive superiority. The reason why the big data is in the limelight is because while the conventional IT technology has been falling behind much in its possibility level, the big data has gone beyond the technological possibility and has the advantage of being utilized to create new values such as business optimization and new business creation through analysis of big data. Since the big data has been introduced too hastily without considering the strategic value deduction and achievement obtained through the big data, however, there are difficulties in the strategic value deduction and data utilization that can be gained through big data. According to the survey result of 1,800 IT professionals from 18 countries world wide, the percentage of the corporation where the big data is being utilized well was only 28%, and many of them responded that they are having difficulties in strategic value deduction and operation through big data. The strategic value should be deducted and environment phases like corporate internal and external related regulations and systems should be considered in order to introduce big data, but these factors were not well being reflected. The cause of the failure turned out to be that the big data was introduced by way of the IT trend and surrounding environment, but it was introduced hastily in the situation where the introduction condition was not well arranged. The strategic value which can be obtained through big data should be clearly comprehended and systematic environment analysis is very important about applicability in order to introduce successful big data, but since the corporations are considering only partial achievements and technological phases that can be obtained through big data, the successful introduction is not being made. Previous study shows that most of big data researches are focused on big data concept, cases, and practical suggestions without empirical study. The purpose of this study is provide the theoretically and practically useful implementation framework and strategies of big data systems with conducting comprehensive literature review, finding influencing factors for successful big data systems implementation, and analysing empirical models. To do this, the elements which can affect the introduction intention of big data were deducted by reviewing the information system's successful factors, strategic value perception factors, considering factors for the information system introduction environment and big data related literature in order to comprehend the effect factors when the corporations introduce big data and structured questionnaire was developed. After that, the questionnaire and the statistical analysis were performed with the people in charge of the big data inside the corporations as objects. According to the statistical analysis, it was shown that the strategic value perception factor and the inside-industry environmental factors affected positively the introduction intention of big data. The theoretical, practical and political implications deducted from the study result is as follows. The frist theoretical implication is that this study has proposed theoretically effect factors which affect the introduction intention of big data by reviewing the strategic value perception and environmental factors and big data related precedent studies and proposed the variables and measurement items which were analyzed empirically and verified. This study has meaning in that it has measured the influence of each variable on the introduction intention by verifying the relationship between the independent variables and the dependent variables through structural equation model. Second, this study has defined the independent variable(strategic value perception, environment), dependent variable(introduction intention) and regulatory variable(type of business and corporate size) about big data introduction intention and has arranged theoretical base in studying big data related field empirically afterwards by developing measurement items which has obtained credibility and validity. Third, by verifying the strategic value perception factors and the significance about environmental factors proposed in the conventional precedent studies, this study will be able to give aid to the afterwards empirical study about effect factors on big data introduction. The operational implications are as follows. First, this study has arranged the empirical study base about big data field by investigating the cause and effect relationship about the influence of the strategic value perception factor and environmental factor on the introduction intention and proposing the measurement items which has obtained the justice, credibility and validity etc. Second, this study has proposed the study result that the strategic value perception factor affects positively the big data introduction intention and it has meaning in that the importance of the strategic value perception has been presented. Third, the study has proposed that the corporation which introduces big data should consider the big data introduction through precise analysis about industry's internal environment. Fourth, this study has proposed the point that the size and type of business of the corresponding corporation should be considered in introducing the big data by presenting the difference of the effect factors of big data introduction depending on the size and type of business of the corporation. The political implications are as follows. First, variety of utilization of big data is needed. The strategic value that big data has can be accessed in various ways in the product, service field, productivity field, decision making field etc and can be utilized in all the business fields based on that, but the parts that main domestic corporations are considering are limited to some parts of the products and service fields. Accordingly, in introducing big data, reviewing the phase about utilization in detail and design the big data system in a form which can maximize the utilization rate will be necessary. Second, the study is proposing the burden of the cost of the system introduction, difficulty in utilization in the system and lack of credibility in the supply corporations etc in the big data introduction phase by corporations. Since the world IT corporations are predominating the big data market, the big data introduction of domestic corporations can not but to be dependent on the foreign corporations. When considering that fact, that our country does not have global IT corporations even though it is world powerful IT country, the big data can be thought to be the chance to rear world level corporations. Accordingly, the government shall need to rear star corporations through active political support. Third, the corporations' internal and external professional manpower for the big data introduction and operation lacks. Big data is a system where how valuable data can be deducted utilizing data is more important than the system construction itself. For this, talent who are equipped with academic knowledge and experience in various fields like IT, statistics, strategy and management etc and manpower training should be implemented through systematic education for these talents. This study has arranged theoretical base for empirical studies about big data related fields by comprehending the main variables which affect the big data introduction intention and verifying them and is expected to be able to propose useful guidelines for the corporations and policy developers who are considering big data implementationby analyzing empirically that theoretical base.
It was planned and performed to study the possibility on the use of inexpensive and easily acquirable foliage powder, which processed by pulverizing after dried, instead of imported expensive wheat flour for the extending of plywood adhesives. Pine leaves of softwood trees, Poplar, Oak and Sycamore leaves of broad leaved species were selected and harvested to pulverize into the minute foliage powder. The harvested foliages from each selected species were pulverized into 40 mesh particles after dried at