Search | Korea Science

Efficient Topic Modeling by Mapping Global and Local Topics (전역 토픽의 지역 매핑을 통한 효율적 토픽 모델링 방안)

Choi, Hochang;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.23 no.3
- /
- pp.69-94
- /
- 2017
Recently, increase of demand for big data analysis has been driving the vigorous development of related technologies and tools. In addition, development of IT and increased penetration rate of smart devices are producing a large amount of data. According to this phenomenon, data analysis technology is rapidly becoming popular. Also, attempts to acquire insights through data analysis have been continuously increasing. It means that the big data analysis will be more important in various industries for the foreseeable future. Big data analysis is generally performed by a small number of experts and delivered to each demander of analysis. However, increase of interest about big data analysis arouses activation of computer programming education and development of many programs for data analysis. Accordingly, the entry barriers of big data analysis are gradually lowering and data analysis technology being spread out. As the result, big data analysis is expected to be performed by demanders of analysis themselves. Along with this, interest about various unstructured data is continually increasing. Especially, a lot of attention is focused on using text data. Emergence of new platforms and techniques using the web bring about mass production of text data and active attempt to analyze text data. Furthermore, result of text analysis has been utilized in various fields. Text mining is a concept that embraces various theories and techniques for text analysis. Many text mining techniques are utilized in this field for various research purposes, topic modeling is one of the most widely used and studied. Topic modeling is a technique that extracts the major issues from a lot of documents, identifies the documents that correspond to each issue and provides identified documents as a cluster. It is evaluated as a very useful technique in that reflect the semantic elements of the document. Traditional topic modeling is based on the distribution of key terms across the entire document. Thus, it is essential to analyze the entire document at once to identify topic of each document. This condition causes a long time in analysis process when topic modeling is applied to a lot of documents. In addition, it has a scalability problem that is an exponential increase in the processing time with the increase of analysis objects. This problem is particularly noticeable when the documents are distributed across multiple systems or regions. To overcome these problems, divide and conquer approach can be applied to topic modeling. It means dividing a large number of documents into sub-units and deriving topics through repetition of topic modeling to each unit. This method can be used for topic modeling on a large number of documents with limited system resources, and can improve processing speed of topic modeling. It also can significantly reduce analysis time and cost through ability to analyze documents in each location or place without combining analysis object documents. However, despite many advantages, this method has two major problems. First, the relationship between local topics derived from each unit and global topics derived from entire document is unclear. It means that in each document, local topics can be identified, but global topics cannot be identified. Second, a method for measuring the accuracy of the proposed methodology should be established. That is to say, assuming that global topic is ideal answer, the difference in a local topic on a global topic needs to be measured. By those difficulties, the study in this method is not performed sufficiently, compare with other studies dealing with topic modeling. In this paper, we propose a topic modeling approach to solve the above two problems. First of all, we divide the entire document cluster(Global set) into sub-clusters(Local set), and generate the reduced entire document cluster(RGS, Reduced global set) that consist of delegated documents extracted from each local set. We try to solve the first problem by mapping RGS topics and local topics. Along with this, we verify the accuracy of the proposed methodology by detecting documents, whether to be discerned as the same topic at result of global and local set. Using 24,000 news articles, we conduct experiments to evaluate practical applicability of the proposed methodology. In addition, through additional experiment, we confirmed that the proposed methodology can provide similar results to the entire topic modeling. We also proposed a reasonable method for comparing the result of both methods.
https://doi.org/10.13088/jiis.2017.23.3.069 인용 PDF KSCI

Suggestion of Urban Regeneration Type Recommendation System Based on Local Characteristics Using Text Mining (텍스트 마이닝을 활용한 지역 특성 기반 도시재생 유형 추천 시스템 제안)

Kim, Ikjun;Lee, Junho;Kim, Hyomin;Kang, Juyoung
- Journal of Intelligence and Information Systems
- /
- v.26 no.3
- /
- pp.149-169
- /
- 2020
"The Urban Renewal New Deal project", one of the government's major national projects, is about developing underdeveloped areas by investing 50 trillion won in 100 locations on the first year and 500 over the next four years. This project is drawing keen attention from the media and local governments. However, the project model which fails to reflect the original characteristics of the area as it divides project area into five categories: "Our Neighborhood Restoration, Housing Maintenance Support Type, General Neighborhood Type, Central Urban Type, and Economic Base Type," According to keywords for successful urban regeneration in Korea, "resident participation," "regional specialization," "ministerial cooperation" and "public-private cooperation", when local governments propose urban regeneration projects to the government, they can see that it is most important to accurately understand the characteristics of the city and push ahead with the projects in a way that suits the characteristics of the city with the help of local residents and private companies. In addition, considering the gentrification problem, which is one of the side effects of urban regeneration projects, it is important to select and implement urban regeneration types suitable for the characteristics of the area. In order to supplement the limitations of the 'Urban Regeneration New Deal Project' methodology, this study aims to propose a system that recommends urban regeneration types suitable for urban regeneration sites by utilizing various machine learning algorithms, referring to the urban regeneration types of the '2025 Seoul Metropolitan Government Urban Regeneration Strategy Plan' promoted based on regional characteristics. There are four types of urban regeneration in Seoul: "Low-use Low-Level Development, Abandonment, Deteriorated Housing, and Specialization of Historical and Cultural Resources" (Shon and Park, 2017). In order to identify regional characteristics, approximately 100,000 text data were collected for 22 regions where the project was carried out for a total of four types of urban regeneration. Using the collected data, we drew key keywords for each region according to the type of urban regeneration and conducted topic modeling to explore whether there were differences between types. As a result, it was confirmed that a number of topics related to real estate and economy appeared in old residential areas, and in the case of declining and underdeveloped areas, topics reflecting the characteristics of areas where industrial activities were active in the past appeared. In the case of the historical and cultural resource area, since it is an area that contains traces of the past, many keywords related to the government appeared. Therefore, it was possible to confirm political topics and cultural topics resulting from various events. Finally, in the case of low-use and under-developed areas, many topics on real estate and accessibility are emerging, so accessibility is good. It mainly had the characteristics of a region where development is planned or is likely to be developed. Furthermore, a model was implemented that proposes urban regeneration types tailored to regional characteristics for regions other than Seoul. Machine learning technology was used to implement the model, and training data and test data were randomly extracted at an 8:2 ratio and used. In order to compare the performance between various models, the input variables are set in two ways: Count Vector and TF-IDF Vector, and as Classifier, there are 5 types of SVM (Support Vector Machine), Decision Tree, Random Forest, Logistic Regression, and Gradient Boosting. By applying it, performance comparison for a total of 10 models was conducted. The model with the highest performance was the Gradient Boosting method using TF-IDF Vector input data, and the accuracy was 97%. Therefore, the recommendation system proposed in this study is expected to recommend urban regeneration types based on the regional characteristics of new business sites in the process of carrying out urban regeneration projects."
https://doi.org/10.13088/jiis.2020.26.3.169 인용 PDF KSCI

Studies on the Directivity of Gokjungkyeong(Kyung Overlapped with Gok) which was specified in Byeokgye-ri, Yangpyeong-gun and the Hwaseo Lee, Hang-ro's Management in Byeokwon Garden (양평 벽계리에 설정된 곡중경(曲中景)의 지향성과 화서(華西) 이항로(李恒老)의 벽원(蘗園) 경영)

Jung, Woo-Jin;Rho, Jae-Hyun
- Journal of the Korean Institute of Traditional Landscape Architecture
- /
- v.34 no.3
- /
- pp.78-97
- /
- 2016
The objectives of this study are to examine the context of the establishment of Suhoe Gugok, Byeokgye Gugok Vally, and Nosan Palkyung, which have been established in Seojong-myeon of Yangpyeong-gun, by literature review and site investigations, and to determine the sceneries of Byeokgye scenic site as enjoyed and managed during the period of Hwaseo Lee, Hang-ro(華西李恒老). The results of the study are as follows. First, Byeokgye Gugok Vally(黃蘗九曲) and Nosan Palkyung(蘆山八景), which have been established after the period of Hwaseo and theorized to have been established around key scenic areas associated with Hwaseo's activities, the analysis results showed that they were collecting sceneries of modern times. The extensive overlap between Byeokgye Gugok Vally and concentrated scenic elements of Suhoe Gugok(水回九曲), and the artificial configuration from the end point of Suhoe Gugok to the beginning point of Nosan Palkyung, reveal the pattern of space conflict and hegemony between Byeokgyes of Suip-ri and Nomun-ri. This is likely to be caused by the conflict between the historicity of the group that enjoyed Byeokgye prior to Hwaso's period and the strong territoriality of the space filled with the image of Hwaseo. Second, Byeokgye Gugok Vally was the secondary spatial system created by selecting the most scenic sites in Suip-ri while expanding the area of Nosan Palkyung. After establishment of Byeokgye Gugok Vally, the spatial identity of the entire Byeokgyecheon area was effectively established. This was a "Hwaseo-oriented" move, including the complete exclusion of the scenic sites from the pre-Hwaseo period such as Cheongseo Gujang and Suhoe Gugok's Letters Carved on the Rock. Consequently, the entire Byeokgyecheon area was reorganized into a cultural scenic site with Heoseo's influence. Third, Fifth, creations of Gugok(九曲) to determine the lineage of the Hwaseo School from Juja(朱子) to Yulgok(栗谷) to Uam(尤庵) to Hwaseo is likely to be an opportunity of birth and external motivation of the establishment of new Gugok Palkyung. In other words, Nosan Palkyung and Byeokgye Gugok Vally are likely to have been created as a reaction to the change of the center of the Hwaseo School to Okgyedong, and with strategic orientation based on the motivation and needs such as creation of the connecting space between Mui Gugok, Gosan Gugok, and Okgye Gugok, and the elevation of Hwaseo's status. Fourth, from the Hwaseo's Li-centric point of view, all revered sites in Beokwon(蘗園) that he managed existed as the spatial creative work to experience the existence of "li" through the objects in the landscape and the boundary of the spirit of emptiness of the aesthetic self. This clearly shows how Byeokgye Gugok Vally or Nosan Palkyung must be defined, and furthermore, appreciated and approached, prior to discussing it as the space associated with Hwaseo. Fifth, Nosan Palkyung was composed of cultural scenic landscapes of Gokjungkyung(曲中景) with eight scenic sites where Hwaseo gave his teachings and spend time around, in the Byeokgye of Nomun-ri area of Byeokgye Gugok Vally. The sceneries is, however, collected by depending on Hwaseo's Letters Carved on the Rock and poetry. Consequently, an inner exuberance of Nosan Palkyung is satisfied beside Byeokgye Gugok Vally, but its conceptual adequacy leaves room for questions.
https://doi.org/10.14700/KITLA.2016.34.3.078 인용 PDF

Analysis of the Time-dependent Relation between TV Ratings and the Content of Microblogs (TV 시청률과 마이크로블로그 내용어와의 시간대별 관계 분석)

Choeh, Joon Yeon;Baek, Haedeuk;Choi, Jinho
- Journal of Intelligence and Information Systems
- /
- v.20 no.1
- /
- pp.163-176
- /
- 2014
Social media is becoming the platform for users to communicate their activities, status, emotions, and experiences to other people. In recent years, microblogs, such as Twitter, have gained in popularity because of its ease of use, speed, and reach. Compared to a conventional web blog, a microblog lowers users' efforts and investment for content generation by recommending shorter posts. There has been a lot research into capturing the social phenomena and analyzing the chatter of microblogs. However, measuring television ratings has been given little attention so far. Currently, the most common method to measure TV ratings uses an electronic metering device installed in a small number of sampled households. Microblogs allow users to post short messages, share daily updates, and conveniently keep in touch. In a similar way, microblog users are interacting with each other while watching television or movies, or visiting a new place. In order to measure TV ratings, some features are significant during certain hours of the day, or days of the week, whereas these same features are meaningless during other time periods. Thus, the importance of features can change during the day, and a model capturing the time sensitive relevance is required to estimate TV ratings. Therefore, modeling time-related characteristics of features should be a key when measuring the TV ratings through microblogs. We show that capturing time-dependency of features in measuring TV ratings is vitally necessary for improving their accuracy. To explore the relationship between the content of microblogs and TV ratings, we collected Twitter data using the Get Search component of the Twitter REST API from January 2013 to October 2013. There are about 300 thousand posts in our data set for the experiment. After excluding data such as adverting or promoted tweets, we selected 149 thousand tweets for analysis. The number of tweets reaches its maximum level on the broadcasting day and increases rapidly around the broadcasting time. This result is stems from the characteristics of the public channel, which broadcasts the program at the predetermined time. From our analysis, we find that count-based features such as the number of tweets or retweets have a low correlation with TV ratings. This result implies that a simple tweet rate does not reflect the satisfaction or response to the TV programs. Content-based features extracted from the content of tweets have a relatively high correlation with TV ratings. Further, some emoticons or newly coined words that are not tagged in the morpheme extraction process have a strong relationship with TV ratings. We find that there is a time-dependency in the correlation of features between the before and after broadcasting time. Since the TV program is broadcast at the predetermined time regularly, users post tweets expressing their expectation for the program or disappointment over not being able to watch the program. The highly correlated features before the broadcast are different from the features after broadcasting. This result explains that the relevance of words with TV programs can change according to the time of the tweets. Among the 336 words that fulfill the minimum requirements for candidate features, 145 words have the highest correlation before the broadcasting time, whereas 68 words reach the highest correlation after broadcasting. Interestingly, some words that express the impossibility of watching the program show a high relevance, despite containing a negative meaning. Understanding the time-dependency of features can be helpful in improving the accuracy of TV ratings measurement. This research contributes a basis to estimate the response to or satisfaction with the broadcasted programs using the time dependency of words in Twitter chatter. More research is needed to refine the methodology for predicting or measuring TV ratings.
https://doi.org/10.13088/jiis.2014.20.1.163 인용 PDF KSCI

Solution Phase Photolyses of Substituted Diphenyl Ether Herbicides under Simulated Environmental Conditions (모조(模造) 환경조건하(環境條件下)에서의 치환(置換) Diphenyl Ether 제초제(除草劑)의 광분해(光分解)에 관(關)한 연구(硏究))

Lee, Jae-Koo
- Applied Biological Chemistry
- /
- v.17 no.3
- /
- pp.149-176
- /
- 1974
Eight substituted diphenyl ether herbicides and some of their photoproducts were studied in terms of solution phase photolysis under simulated environmental conditions by using a Rayonet photochemical reactor. The test compounds absorbed sufficient light energy at the wavelength of 300 nm to undergo various photoreactions. All the photoproducts were confirmed by means of tlc, glc, ir, ms, and/or nmr spectrometry. The results obtained are summarized as follows: Solution phase photolysis of C-6989: An exceedingly large amount of p-nitrophenol formed strongly indicates the readiness of the ether linkage cleavage of this compound as the main reaction in all solvents used. Photoreduction of nitro to amino group(s) and photooxidation of trifluoromethyl to carboxyl group were recognized as minor reactions. Aqueous photolysis of p-nitrophenol: Quinone(0.28%), hydroquinone (0.66%), and p-aminophenol (0.42%) were confirmed as photoproducts, in addition to a relatively small amount of an unknown compound. The mechanisms of formation of these products were proposed to be the nitro-nitrite rearrangement via $n{\rightarrow}{\pi}^*$ excitation and the photoreduction through hydrogen abstractions by radicals, respectively. Solution phase photolysis of Nitrofen: Photochemical reduction leading to the p-amino derivative was the main reaction in n-hexane. In aqueous solution, the photoreduction of nitro to amino group and hydroxylation predominated over the ether linkage cleavage. Nucleophilic displacement of the nitro group by hydroxide ion and replacement of chlorine substituents by hydroxyl group or, to a lesser extent, hydrogen were also observed as minor reactoins. Solution phase photolysis of MO-338: Photoreduction of the nitro to amino group was marked in the n-hexane solution photolysis. In the aqueous solution, photoreduction of the nitro substituent and hydroxylation were the main reactions with replacement of chlorine substituents by the hydroxyl group and hydrogen, and cleavage of the ether linkage as minor reactions. Photolyses of MC-4379, MC-3761, MC-5127, MC-6063, and MC-7181 in n-hexane and cyclohexane: Photoreduction of the nitro group leading to the corresponding amino derivative and replacement of one of the halogen substituents by hydrogen from the solvent used were the key reactions in each compound. Aqueous photolysis of MC-4379: Cleavage of the ether linkage, replacement of the carboxymethyl by hydroxyl group, hydroxylation, and replacement of the nitro by hydroxy group were prominent with photoreduction and dechlorination as minor reactions. Aqueous photolysis of MC-3761: Cleavage of the ether linkage, replacement of the carboxymethyl by hydroxyl group, and photoreduction followed by hydroxylation were the main reactions. Aqueous photolysis of MC-5127: Replacement of carboxyethyl by hydrogen was predominant with ether linkage cleavage, photoreduction, and dechlorination as minor reactions. It was obvious that the decarboxyethylation proceeded more readily than decarboxymethylation occurring in the other compounds. Aqueous photolysis of MC-6063: Cleavage of the ether linkage and photodechlorination were the main reactions. Aqueous photolysis of MC-7181: Replacement of the carboxymethyl group by hydrogen and monodechlorination were the remarkable reactions. Cleavage of the ether linkage and hydroxylation were thought to be the minor reactions. Aqueous photolysis of 3-carboxymethyl-4-nitrophenol: The photo-induced Fries rearrangement common to aromatic esters did not appear to occur in the carboxymethyl group of this type of compound. Conversion of nitro to nitroso group was the main reaction.
PDF

NFC-based Smartwork Service Model Design (NFC 기반의 스마트워크 서비스 모델 설계)

Park, Arum;Kang, Min Su;Jun, Jungho;Lee, Kyoung Jun
- Journal of Intelligence and Information Systems
- /
- v.19 no.2
- /
- pp.157-175
- /
- 2013
Since Korean government announced 'Smartwork promotion strategy' in 2010, Korean firms and government organizations have started to adopt smartwork. However, the smartwork has been implemented only in a few of large enterprises and government organizations rather than SMEs (small and medium enterprises). In USA, both Yahoo! and Best Buy have stopped their flexible work because of its reported low productivity and job loafing problems. In addition, according to the literature on smartwork, we could draw obstacles of smartwork adoption and categorize them into the three types: institutional, organizational, and technological. The first category of smartwork adoption obstacles, institutional, include the difficulties of smartwork performance evaluation metrics, the lack of readiness of organizational processes, limitation of smartwork types and models, lack of employee participation in smartwork adoption procedure, high cost of building smartwork system, and insufficiency of government support. The second category, organizational, includes limitation of the organization hierarchy, wrong perception of employees and employers, a difficulty in close collaboration, low productivity with remote coworkers, insufficient understanding on remote working, and lack of training about smartwork. The third category, technological, obstacles include security concern of mobile work, lack of specialized solution, and lack of adoption and operation know-how. To overcome the current problems of smartwork in reality and the reported obstacles in literature, we suggest a novel smartwork service model based on NFC(Near Field Communication). This paper suggests NFC-based Smartwork Service Model composed of NFC-based Smartworker networking service and NFC-based Smartwork space management service. NFC-based smartworker networking service is comprised of NFC-based communication/SNS service and NFC-based recruiting/job seeking service. NFC-based communication/SNS Service Model supplements the key shortcomings that existing smartwork service model has. By connecting to existing legacy system of a company through NFC tags and systems, the low productivity and the difficulty of collaboration and attendance management can be overcome since managers can get work processing information, work time information and work space information of employees and employees can do real-time communication with coworkers and get location information of coworkers. Shortly, this service model has features such as affordable system cost, provision of location-based information, and possibility of knowledge accumulation. NFC-based recruiting/job-seeking service provides new value by linking NFC tag service and sharing economy sites. This service model has features such as easiness of service attachment and removal, efficient space-based work provision, easy search of location-based recruiting/job-seeking information, and system flexibility. This service model combines advantages of sharing economy sites with the advantages of NFC. By cooperation with sharing economy sites, the model can provide recruiters with human resource who finds not only long-term works but also short-term works. Additionally, SMEs (Small Medium-sized Enterprises) can easily find job seeker by attaching NFC tags to any spaces at which human resource with qualification may be located. In short, this service model helps efficient human resource distribution by providing location of job hunters and job applicants. NFC-based smartwork space management service can promote smartwork by linking NFC tags attached to the work space and existing smartwork system. This service has features such as low cost, provision of indoor and outdoor location information, and customized service. In particular, this model can help small company adopt smartwork system because it is light-weight system and cost-effective compared to existing smartwork system. This paper proposes the scenarios of the service models, the roles and incentives of the participants, and the comparative analysis. The superiority of NFC-based smartwork service model is shown by comparing and analyzing the new service models and the existing service models. The service model can expand scope of enterprises and organizations that adopt smartwork and expand the scope of employees that take advantages of smartwork.
https://doi.org/10.13088/jiis.2013.19.2.157 인용 PDF KSCI

A Study on People Counting in Public Metro Service using Hybrid CNN-LSTM Algorithm (Hybrid CNN-LSTM 알고리즘을 활용한 도시철도 내 피플 카운팅 연구)

Choi, Ji-Hye;Kim, Min-Seung;Lee, Chan-Ho;Choi, Jung-Hwan;Lee, Jeong-Hee;Sung, Tae-Eung
- Journal of Intelligence and Information Systems
- /
- v.26 no.2
- /
- pp.131-145
- /
- 2020
In line with the trend of industrial innovation, IoT technology utilized in a variety of fields is emerging as a key element in creation of new business models and the provision of user-friendly services through the combination of big data. The accumulated data from devices with the Internet-of-Things (IoT) is being used in many ways to build a convenience-based smart system as it can provide customized intelligent systems through user environment and pattern analysis. Recently, it has been applied to innovation in the public domain and has been using it for smart city and smart transportation, such as solving traffic and crime problems using CCTV. In particular, it is necessary to comprehensively consider the easiness of securing real-time service data and the stability of security when planning underground services or establishing movement amount control information system to enhance citizens' or commuters' convenience in circumstances with the congestion of public transportation such as subways, urban railways, etc. However, previous studies that utilize image data have limitations in reducing the performance of object detection under private issue and abnormal conditions. The IoT device-based sensor data used in this study is free from private issue because it does not require identification for individuals, and can be effectively utilized to build intelligent public services for unspecified people. Especially, sensor data stored by the IoT device need not be identified to an individual, and can be effectively utilized for constructing intelligent public services for many and unspecified people as data free form private issue. We utilize the IoT-based infrared sensor devices for an intelligent pedestrian tracking system in metro service which many people use on a daily basis and temperature data measured by sensors are therein transmitted in real time. The experimental environment for collecting data detected in real time from sensors was established for the equally-spaced midpoints of 4×4 upper parts in the ceiling of subway entrances where the actual movement amount of passengers is high, and it measured the temperature change for objects entering and leaving the detection spots. The measured data have gone through a preprocessing in which the reference values for 16 different areas are set and the difference values between the temperatures in 16 distinct areas and their reference values per unit of time are calculated. This corresponds to the methodology that maximizes movement within the detection area. In addition, the size of the data was increased by 10 times in order to more sensitively reflect the difference in temperature by area. For example, if the temperature data collected from the sensor at a given time were 28.5℃, the data analysis was conducted by changing the value to 285. As above, the data collected from sensors have the characteristics of time series data and image data with 4×4 resolution. Reflecting the characteristics of the measured, preprocessed data, we finally propose a hybrid algorithm that combines CNN in superior performance for image classification and LSTM, especially suitable for analyzing time series data, as referred to CNN-LSTM (Convolutional Neural Network-Long Short Term Memory). In the study, the CNN-LSTM algorithm is used to predict the number of passing persons in one of 4×4 detection areas. We verified the validation of the proposed model by taking performance comparison with other artificial intelligence algorithms such as Multi-Layer Perceptron (MLP), Long Short Term Memory (LSTM) and RNN-LSTM (Recurrent Neural Network-Long Short Term Memory). As a result of the experiment, proposed CNN-LSTM hybrid model compared to MLP, LSTM and RNN-LSTM has the best predictive performance. By utilizing the proposed devices and models, it is expected various metro services will be provided with no illegal issue about the personal information such as real-time monitoring of public transport facilities and emergency situation response services on the basis of congestion. However, the data have been collected by selecting one side of the entrances as the subject of analysis, and the data collected for a short period of time have been applied to the prediction. There exists the limitation that the verification of application in other environments needs to be carried out. In the future, it is expected that more reliability will be provided for the proposed model if experimental data is sufficiently collected in various environments or if learning data is further configured by measuring data in other sensors.
https://doi.org/10.13088/jiis.2020.26.2.131 인용 PDF KSCI

Deep Learning-based Professional Image Interpretation Using Expertise Transplant (전문성 이식을 통한 딥러닝 기반 전문 이미지 해석 방법론)

Kim, Taejin;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.26 no.2
- /
- pp.79-104
- /
- 2020
Recently, as deep learning has attracted attention, the use of deep learning is being considered as a method for solving problems in various fields. In particular, deep learning is known to have excellent performance when applied to applying unstructured data such as text, sound and images, and many studies have proven its effectiveness. Owing to the remarkable development of text and image deep learning technology, interests in image captioning technology and its application is rapidly increasing. Image captioning is a technique that automatically generates relevant captions for a given image by handling both image comprehension and text generation simultaneously. In spite of the high entry barrier of image captioning that analysts should be able to process both image and text data, image captioning has established itself as one of the key fields in the A.I. research owing to its various applicability. In addition, many researches have been conducted to improve the performance of image captioning in various aspects. Recent researches attempt to create advanced captions that can not only describe an image accurately, but also convey the information contained in the image more sophisticatedly. Despite many recent efforts to improve the performance of image captioning, it is difficult to find any researches to interpret images from the perspective of domain experts in each field not from the perspective of the general public. Even for the same image, the part of interests may differ according to the professional field of the person who has encountered the image. Moreover, the way of interpreting and expressing the image also differs according to the level of expertise. The public tends to recognize the image from a holistic and general perspective, that is, from the perspective of identifying the image's constituent objects and their relationships. On the contrary, the domain experts tend to recognize the image by focusing on some specific elements necessary to interpret the given image based on their expertise. It implies that meaningful parts of an image are mutually different depending on viewers' perspective even for the same image. So, image captioning needs to implement this phenomenon. Therefore, in this study, we propose a method to generate captions specialized in each domain for the image by utilizing the expertise of experts in the corresponding domain. Specifically, after performing pre-training on a large amount of general data, the expertise in the field is transplanted through transfer-learning with a small amount of expertise data. However, simple adaption of transfer learning using expertise data may invoke another type of problems. Simultaneous learning with captions of various characteristics may invoke so-called 'inter-observation interference' problem, which make it difficult to perform pure learning of each characteristic point of view. For learning with vast amount of data, most of this interference is self-purified and has little impact on learning results. On the contrary, in the case of fine-tuning where learning is performed on a small amount of data, the impact of such interference on learning can be relatively large. To solve this problem, therefore, we propose a novel 'Character-Independent Transfer-learning' that performs transfer learning independently for each character. In order to confirm the feasibility of the proposed methodology, we performed experiments utilizing the results of pre-training on MSCOCO dataset which is comprised of 120,000 images and about 600,000 general captions. Additionally, according to the advice of an art therapist, about 300 pairs of 'image / expertise captions' were created, and the data was used for the experiments of expertise transplantation. As a result of the experiment, it was confirmed that the caption generated according to the proposed methodology generates captions from the perspective of implanted expertise whereas the caption generated through learning on general data contains a number of contents irrelevant to expertise interpretation. In this paper, we propose a novel approach of specialized image interpretation. To achieve this goal, we present a method to use transfer learning and generate captions specialized in the specific domain. In the future, by applying the proposed methodology to expertise transplant in various fields, we expected that many researches will be actively conducted to solve the problem of lack of expertise data and to improve performance of image captioning.
https://doi.org/10.13088/jiis.2020.26.2.079 인용 PDF KSCI

A Study on the Yousang-Dae Goksuro(Curve-Waterway) in Gangneung, Yungok-Myun, Yoodung Ri (강릉 연곡면 유등리 '유상대(流觴臺)' 곡수로(曲水路)의 조명(照明))

Rho, Jae-Hyun;Shin, Sang-Sup;Lee, Jung-Han;Huh, Jun;Park, Joo-Sung
- Journal of the Korean Institute of Traditional Landscape Architecture
- /
- v.30 no.1
- /
- pp.14-21
- /
- 2012
The object of the study, Yousang-Dae(流觴臺) and engraved Go broad text on the flat rock in Gangneung-si Yungok-myun Yoodung-ri Baemgol, reveals that the place was for appreciating arts like Yusang Goksu and Taoist hermit's games. three times of detail reconnaissance survey brought about the results as follows. There is a the text, Manwolsan(滿月山) Baegundongcheon(白雲洞天), engraved on the rock in Baegunsa(白雲寺) that had been built by Doun at the first year of King Hungang(in 875) of the United Shilla, became in ruins in the middle of Joseon, and then was rebuilt in 1954. The text is an invaluable evidence indicating that the tradition of Taoist hermit and Sunbee(classical scholars) culture has been generated in Baemgol Valley. According to the 2nd vol. of Donghoseungram(東湖勝覽), the chronicle of Gangneung published by Choi Baeksoon in 1934, there is a record saying that 'Baegunsa in Namjeonhyeon is the classroom where famous teachers like Yulgok Lee Yi or Seongje Choi Ok were teaching' that verifies the historic property of the place. In addition, the management of Nujeong(樓亭) and Dongcheon can be traced through Baegunjeong(白雲亭) constructed by Kim Yoonkyung(金潤卿) in Muo year, the 9th year of Cheoljong(1858) according to Donghoseungram and the completed version of Jeungboyimyoungji(增補臨瀛誌). Also, Baegundongdongcheon(白雲亭洞天), the text engraved on the standing stone across the stream from Yousang-Dae stone, was created 3 years after the Baegunjeong construction in the 12th year of Cheoljong(1861), which refers a symbolic sign closely related with Yousang-Dae. Based on this premise and circumstance, with careful studying the remains of 'Yusang-dae' Goksuro, we discovered that the Sebun-seok(細分石) controling the amount and the speed of moving water and the remains of furrows of Keumbae-soek(擒盃石) and Yubae-gong(留盃孔) containing water stream with cups through the mountain stream and rocks around Yusang-Dae. In addition, as 21 people's names engraved under the statement of 'Oh-Seong(午星)' were discovered on the bottom of the rock, this clearly confirms that the place was one of the main cultural footholds of tasting the arts which have characteristics of Yu-Sang-Gok-Su-Yeon(流觴曲水宴) until the middle of the 20th century. It implies that the arts tasting culture of Sunbees had been inherited centering on Yusang-dae in this particular place until the middle of the 20th century. It is necessary to be studied in depth because the place is a historic and unique cultural place where 'Confucianism, Buddhism, and Zen'were combined together. Based on the result of the study, the identification of 23 people as well as the writer of Yusang-Dae text should be carefully studied in depth in terms of the characteristics of the place through gathering data about appreciation of arts like Yusanggoksu. Likewise, we should make efforts to discover the chess board engraved on the rock described on the documents, thus we should consider to establish plans to recover the original shape of the place, for example, breaking the cement pavement of the road, additional excavation, changing the existing route, and so fourth.
KSCI

Surgical Treatment for Isolated Aortic Endocarditis: a Comparison with Isolated Mitral Endocarditis (대동맥 판막만을 침범한 감염성 심내막염의 수술적 치료: 승모판막만을 침범한 경우와 비교 연구)

Hong, Seong-Beom;Park, Jeong-Min;Lee, Kyo-Seon;Ryu, Sang-Woo;Yun, Ju-Sik;CheKar, Jay-Key;Yun, Chi-Hyeong;Kim, Sang-Hyung;Ahn, Byoung-Hee
- Journal of Chest Surgery
- /
- v.40 no.9
- /
- pp.600-606
- /
- 2007
Background: Infective endocarditis shows high surgical mortality and morbidity rates, especially for aortic endocarditis. This study attempts to investigate the clinical characteristics and operative results of isolated aortic endocarditis. Material and Method: From July 1990 to May 2005, 25 patients with isolated aortic endocarditis (Group I, male female=18 : 7, mean age $43.2{\pm}18.6$ years) and 23 patients with isolated mitral endocarditis (Group II, male female=10 : 13, mean age $43.2{\pm}17.1$ years) underwent surgical treatment in our hospital. All the patients had native endocarditis and 7 patients showed a bicuspid aortic valve in Group I. Two patients had prosthetic valve endocarditis and one patients developed mitral endocarditis after a mitral valvuloplasty in Group II. Positive blood cultures were obtained from 11 (44.0%) patients in Group I, and 10 (43.3%) patients in Group II, The pre-operative left ventricular ejection fraction for each group was $60.8{\pm}8.7%$ and $62.1{\pm}8.1%$ (p=0.945), respectively. There was moderate to severe aortic regurgitation in 18 patients and vegetations were detected in 17 patients in Group I. There was moderate to severe mitral regurgitation in 19 patients and vegetations were found in 18 patients in Group II. One patient had a ventricular septal defect and another patient underwent a Maze operation with microwaves due to atrial fibrillation. We performed echocardiography before discharge and each year during follow-up. The mean follow-up period was $37.2{\pm}23.5$ (range $9{\sim}123$) months. Result: Postoperative complications included three cases of low cardiac output in Group I and one case each of re-surgery because of bleeding and low cardiac output in Group II. One patient died from an intra-cranial hemorrhage on the first day after surgery in Group I, but there were no early deaths in Group II. The 1, 3-, and 5-year valve related event free rates were 92.0%, 88.0%, and 88.0% for Group I patients, and 91.3%, 76.0%, and 76.0% for Group II patients, respectively. The 1, 3-, and 5-year survival rates were 96.0%, 96.0%, and 96.0% for Group I patients, and foo%, 84.9%, and 84.9% for Group II patients, respectively. Conclusion: Acceptable surgical results and mid-term clinical results for aortic endocarditis were seen.
PDF KSCI

Search Result 34,009, Processing Time 0.06 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)