• Title/Summary/Keyword: 텍스트 구성

Search Result 865, Processing Time 0.023 seconds

Semantic Visualization of Dynamic Topic Modeling (다이내믹 토픽 모델링의 의미적 시각화 방법론)

  • Yeon, Jinwook;Boo, Hyunkyung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.131-154
    • /
    • 2022
  • Recently, researches on unstructured data analysis have been actively conducted with the development of information and communication technology. In particular, topic modeling is a representative technique for discovering core topics from massive text data. In the early stages of topic modeling, most studies focused only on topic discovery. As the topic modeling field matured, studies on the change of the topic according to the change of time began to be carried out. Accordingly, interest in dynamic topic modeling that handle changes in keywords constituting the topic is also increasing. Dynamic topic modeling identifies major topics from the data of the initial period and manages the change and flow of topics in a way that utilizes topic information of the previous period to derive further topics in subsequent periods. However, it is very difficult to understand and interpret the results of dynamic topic modeling. The results of traditional dynamic topic modeling simply reveal changes in keywords and their rankings. However, this information is insufficient to represent how the meaning of the topic has changed. Therefore, in this study, we propose a method to visualize topics by period by reflecting the meaning of keywords in each topic. In addition, we propose a method that can intuitively interpret changes in topics and relationships between or among topics. The detailed method of visualizing topics by period is as follows. In the first step, dynamic topic modeling is implemented to derive the top keywords of each period and their weight from text data. In the second step, we derive vectors of top keywords of each topic from the pre-trained word embedding model. Then, we perform dimension reduction for the extracted vectors. Then, we formulate a semantic vector of each topic by calculating weight sum of keywords in each vector using topic weight of each keyword. In the third step, we visualize the semantic vector of each topic using matplotlib, and analyze the relationship between or among the topics based on the visualized result. The change of topic can be interpreted in the following manners. From the result of dynamic topic modeling, we identify rising top 5 keywords and descending top 5 keywords for each period to show the change of the topic. Existing many topic visualization studies usually visualize keywords of each topic, but our approach proposed in this study differs from previous studies in that it attempts to visualize each topic itself. To evaluate the practical applicability of the proposed methodology, we performed an experiment on 1,847 abstracts of artificial intelligence-related papers. The experiment was performed by dividing abstracts of artificial intelligence-related papers into three periods (2016-2017, 2018-2019, 2020-2021). We selected seven topics based on the consistency score, and utilized the pre-trained word embedding model of Word2vec trained with 'Wikipedia', an Internet encyclopedia. Based on the proposed methodology, we generated a semantic vector for each topic. Through this, by reflecting the meaning of keywords, we visualized and interpreted the themes by period. Through these experiments, we confirmed that the rising and descending of the topic weight of a keyword can be usefully used to interpret the semantic change of the corresponding topic and to grasp the relationship among topics. In this study, to overcome the limitations of dynamic topic modeling results, we used word embedding and dimension reduction techniques to visualize topics by era. The results of this study are meaningful in that they broadened the scope of topic understanding through the visualization of dynamic topic modeling results. In addition, the academic contribution can be acknowledged in that it laid the foundation for follow-up studies using various word embeddings and dimensionality reduction techniques to improve the performance of the proposed methodology.

Reader-Response Criticism about the Functional relation of Romance, Women and Patriarchy -Based on Janice A. Radway's Reading the Romance: Women, Patriarchy and Popular Literature (로맨스, 여성, 가부장제의 함수관계에 대한 독자반응비평 -제니스 A. 래드웨이의 『로맨스 읽기: 여성, 가부장제와 대중문학』을 중심으로)

  • Lee, Jung-Oak
    • Journal of Popular Narrative
    • /
    • v.25 no.3
    • /
    • pp.349-383
    • /
    • 2019
  • This paper examined the meaning and task of romance research with a focus on Reading the Romance(1984) by Janice A. Radway. This book, which analyzes romance texts by examining the situation and meaning of reading romance by women readers integrating between cultural studies and literary studies, is one of the most popular studies on the romance genre. Radway scrutinized the practical significance of reading romance in a community of women readers. Through a study involving questionnaires and in-depth interviews, she found that for women, romance reading is a 'compensatory fiction' that brings happiness and emotional redemption through a sense of liberation achieved by escaping from patriarchal daily life. The romance that women prefer is composed of 4 stages and 13 divisions: 'Encounter → Attest → Recovery → Happy End'. It also maintains a formula that begins with an immature female character's identity crisis and ends with a blissful union that recognizes the intrinsic value of the main character, who has turned into a man who is considerate of the women. Therefore, romance plays the role of pursuit of the 'female utopian fantasy' and at the same time a reconciliation of women to patriarchy. Feminist critics of the day criticized this argument. However, reading romance is a 'feminine reading', and romance is literature about the functional relationship between women's lives and patriarchy. Yet the interpretation could differ depending on the different viewpoints and definitions of the women's utopian fantasy. In recent years, the conditions of female reader's lives, awareness and imagination have been changing rapidly. As a result, the female utopian fantasy has also changed significantly. Nevertheless, women's lives in the real patriarchal system are still contradictory, and their adventurous imagination is spreading in alternative spaces such as the subculture. In this regard, the question is about the definition of romance and the meanings of romance research are still important task.

The Daily History and Self-consciousness of Jeonju Citizens: Two Examples of Reading Groups (전주 시민의 일상사와 자기의식 『혼불』과 공유지(Commons)의 사례)

  • Oh, Hangnyeong
    • The Korean Journal of Archival Studies
    • /
    • no.81
    • /
    • pp.5-44
    • /
    • 2024
  • This paper is an experience and observation report on the activities of Jeonju citizens, who are 'kleine leutes'. Text or Born-digital materials such as diaries, group chat rooms, memos, and interviews showing citizens' contemporary and daily history (Alltagsgeschichite) were used for this purpose. These civic groups are reading groups we can find easily and they also enjoy walking, hiking, and movies, and so to speak ordinary local people are their members. One team read Choi Myung-hee's "Honbul" for about a year and a half, while another team read several books under the theme of "commons," and enjoyed exploring, exhibiting, or watching movies together. The main text is composed of three parts. First, I looked at the methods and perspectives to examine the daily life of local people. To this end, the views of Detlev Peukert and Alf Lüdtke, who captured the prospects and the possibilities of theories of daily history, and James C. Scott, who provided insight into infra-politics, were reviewed. This work was to find the perspective and method of daily history research that could observe the activities of Jeonju citizens. Second, we looked at the experience of the "Honbool" meeting. The reading of "Honbool" which took place during the period of confrontation with Covid19 began in connection with its intense locality. As the criticism of "a great writer born in our local land" relieved the uncomfortable feelings, the members' critical mind was revealed after Volume3 of "Honbool" and stood out after Volume6. It seemed to show the characteristics of the self-consciousness (Eigensinn) of citizens who choose dynamics rather than being stuck to a specific form of empathy (Betroffenheit). I think it showed the difficulty and hope to face in the description and research of local history at the same time. Third, I observed citizens who gathered on the subject of public land. This meeting showed the actuality and accumulation process of the infra-political capabilities of citizens in Jeonju. Reading-commons did not suffer from 'heart trouble' as a local citizen compared to "Honbool". Rather, the difficulty of related books was an obstacle, and the difficulty was easily resolved. As the meeting progressed, awareness of the commons became more sophisticated and issues and discussions were independently shared with each other, and a wealth of hidden transcripts were accumulated through its practice and problem consciousness. It is difficult to think about modern daily life apart from the capitalist era. More fundamentally, it is here and now in everyday life that humans enjoy or suffer from. All history passes through my body here and now. This is the universality of daily history. It depends on the ability of citizens to create daily history to experience and at the same time maintain the distance of criticism.

Stock-Index Invest Model Using News Big Data Opinion Mining (뉴스와 주가 : 빅데이터 감성분석을 통한 지능형 투자의사결정모형)

  • Kim, Yoo-Sin;Kim, Nam-Gyu;Jeong, Seung-Ryul
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.143-156
    • /
    • 2012
  • People easily believe that news and stock index are closely related. They think that securing news before anyone else can help them forecast the stock prices and enjoy great profit, or perhaps capture the investment opportunity. However, it is no easy feat to determine to what extent the two are related, come up with the investment decision based on news, or find out such investment information is valid. If the significance of news and its impact on the stock market are analyzed, it will be possible to extract the information that can assist the investment decisions. The reality however is that the world is inundated with a massive wave of news in real time. And news is not patterned text. This study suggests the stock-index invest model based on "News Big Data" opinion mining that systematically collects, categorizes and analyzes the news and creates investment information. To verify the validity of the model, the relationship between the result of news opinion mining and stock-index was empirically analyzed by using statistics. Steps in the mining that converts news into information for investment decision making, are as follows. First, it is indexing information of news after getting a supply of news from news provider that collects news on real-time basis. Not only contents of news but also various information such as media, time, and news type and so on are collected and classified, and then are reworked as variable from which investment decision making can be inferred. Next step is to derive word that can judge polarity by separating text of news contents into morpheme, and to tag positive/negative polarity of each word by comparing this with sentimental dictionary. Third, positive/negative polarity of news is judged by using indexed classification information and scoring rule, and then final investment decision making information is derived according to daily scoring criteria. For this study, KOSPI index and its fluctuation range has been collected for 63 days that stock market was open during 3 months from July 2011 to September in Korea Exchange, and news data was collected by parsing 766 articles of economic news media M company on web page among article carried on stock information>news>main news of portal site Naver.com. In change of the price index of stocks during 3 months, it rose on 33 days and fell on 30 days, and news contents included 197 news articles before opening of stock market, 385 news articles during the session, 184 news articles after closing of market. Results of mining of collected news contents and of comparison with stock price showed that positive/negative opinion of news contents had significant relation with stock price, and change of the price index of stocks could be better explained in case of applying news opinion by deriving in positive/negative ratio instead of judging between simplified positive and negative opinion. And in order to check whether news had an effect on fluctuation of stock price, or at least went ahead of fluctuation of stock price, in the results that change of stock price was compared only with news happening before opening of stock market, it was verified to be statistically significant as well. In addition, because news contained various type and information such as social, economic, and overseas news, and corporate earnings, the present condition of type of industry, market outlook, the present condition of market and so on, it was expected that influence on stock market or significance of the relation would be different according to the type of news, and therefore each type of news was compared with fluctuation of stock price, and the results showed that market condition, outlook, and overseas news was the most useful to explain fluctuation of news. On the contrary, news about individual company was not statistically significant, but opinion mining value showed tendency opposite to stock price, and the reason can be thought to be the appearance of promotional and planned news for preventing stock price from falling. Finally, multiple regression analysis and logistic regression analysis was carried out in order to derive function of investment decision making on the basis of relation between positive/negative opinion of news and stock price, and the results showed that regression equation using variable of market conditions, outlook, and overseas news before opening of stock market was statistically significant, and classification accuracy of logistic regression accuracy results was shown to be 70.0% in rise of stock price, 78.8% in fall of stock price, and 74.6% on average. This study first analyzed relation between news and stock price through analyzing and quantifying sensitivity of atypical news contents by using opinion mining among big data analysis techniques, and furthermore, proposed and verified smart investment decision making model that could systematically carry out opinion mining and derive and support investment information. This shows that news can be used as variable to predict the price index of stocks for investment, and it is expected the model can be used as real investment support system if it is implemented as system and verified in the future.

A Mobile Landmarks Guide : Outdoor Augmented Reality based on LOD and Contextual Device (모바일 랜드마크 가이드 : LOD와 문맥적 장치 기반의 실외 증강현실)

  • Zhao, Bi-Cheng;Rosli, Ahmad Nurzid;Jang, Chol-Hee;Lee, Kee-Sung;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.1
    • /
    • pp.1-21
    • /
    • 2012
  • In recent years, mobile phone has experienced an extremely fast evolution. It is equipped with high-quality color displays, high resolution cameras, and real-time accelerated 3D graphics. In addition, some other features are includes GPS sensor and Digital Compass, etc. This evolution advent significantly helps the application developers to use the power of smart-phones, to create a rich environment that offers a wide range of services and exciting possibilities. To date mobile AR in outdoor research there are many popular location-based AR services, such Layar and Wikitude. These systems have big limitation the AR contents hardly overlaid on the real target. Another research is context-based AR services using image recognition and tracking. The AR contents are precisely overlaid on the real target. But the real-time performance is restricted by the retrieval time and hardly implement in large scale area. In our work, we exploit to combine advantages of location-based AR with context-based AR. The system can easily find out surrounding landmarks first and then do the recognition and tracking with them. The proposed system mainly consists of two major parts-landmark browsing module and annotation module. In landmark browsing module, user can view an augmented virtual information (information media), such as text, picture and video on their smart-phone viewfinder, when they pointing out their smart-phone to a certain building or landmark. For this, landmark recognition technique is applied in this work. SURF point-based features are used in the matching process due to their robustness. To ensure the image retrieval and matching processes is fast enough for real time tracking, we exploit the contextual device (GPS and digital compass) information. This is necessary to select the nearest and pointed orientation landmarks from the database. The queried image is only matched with this selected data. Therefore, the speed for matching will be significantly increased. Secondly is the annotation module. Instead of viewing only the augmented information media, user can create virtual annotation based on linked data. Having to know a full knowledge about the landmark, are not necessary required. They can simply look for the appropriate topic by searching it with a keyword in linked data. With this, it helps the system to find out target URI in order to generate correct AR contents. On the other hand, in order to recognize target landmarks, images of selected building or landmark are captured from different angle and distance. This procedure looks like a similar processing of building a connection between the real building and the virtual information existed in the Linked Open Data. In our experiments, search range in the database is reduced by clustering images into groups according to their coordinates. A Grid-base clustering method and user location information are used to restrict the retrieval range. Comparing the existed research using cluster and GPS information the retrieval time is around 70~80ms. Experiment results show our approach the retrieval time reduces to around 18~20ms in average. Therefore the totally processing time is reduced from 490~540ms to 438~480ms. The performance improvement will be more obvious when the database growing. It demonstrates the proposed system is efficient and robust in many cases.