• Title/Summary/Keyword: Training Model

Search Result 4,400, Processing Time 0.034 seconds

A Study on Factors Influencing The State of Adaptation of The Hemiplegic Patients (편마비 환자의 퇴원후 적응상태와 관련요인에 대한 분석적 연구)

  • 서문자
    • Journal of Korean Academy of Nursing
    • /
    • v.20 no.1
    • /
    • pp.88-117
    • /
    • 1990
  • The purposes of this study are to delineate a profile of the state of a stroke patient's adaptation at 3 months after hospitalization and to explore the relationship between the level of adaptation and the variables which influence the adaptation of hemiplegic patients. To these ends, theoretical framework was derived basically from the stress adaptation model. The basic assumption underlying the level of adaptation is influenced by the presenting focal, contextual and residual stimuli. This group of stimuli is further operationalized and represented by a perception of stress. which is the perceived effect of the disability and by the mediating variables such as sociodemographic factors as an external conditioning variables and perceived social support and hardiness personality characteristics as an internal intervening variables. The dependent varibales in this study is the level of physical, psychological and social adaptation and is hypothesized to be a function of the interaction between 3 sets of variables namely, the perceived disability effect, external conditioning variables and internal intevening varibles. A total of fourty three subjects from 3 general hospitals in Seoul were observed and interviewed with the aid of 7 structured instruments. The data were collected twice on each subject : first at the pre-discharge period arid at 3 months post-discharge from hospital for the second time. The study was carried out for the period from February to August, 1988. The instruments used for the study include 4 existing scales and 3 scales developed by the researcher for this study. They are : 1) The ADL dependency scale and the scale of the clinical physical functions for the assessment of physical adaptation. 2) the SDS(self report of depression) to measure the level of psychological adaptation. 3) The scale for the amount of social activities for the measurement of the level of social adaptation. 4) The scale for the perceived effect of disability for the measurement of the focal stimuli. 5) The health related hardiness scale and the perceived interpersonal support self evaluation list(ISEL) for the measurement of the hardiness personality character and the perceived social support. The data obtained were analyzed using percentage, oneway ANOVA, Pearson coefficients correlation and stepwise multiple regression. The findings provide valuable information about the present level of physical adaptation at 3 months after discharge. The patient revealed a decreased ADL dependency and lowered limitation of physical function as compared with pre - discharge state. Psycholcgically, the average degree of depression at follow up was within normal range of depression. Socially, the amount of social activities was very low. The one way ANOVA and the correlational analysis revealed the relationship between the 3 sets of variables and the adaptation level as follows : 1) The perceived disability effect was related to the degree of the depression and the amount of social activities but was not related to the physical adaptation. 2) Among the sociodemographic variables, sex and education were related to the difference of ADL dependency and the change of physical function. These factors indicate that women more than men and educated more than the less educated were found more independent. The education was also related to the degree of depression suggesting that the higher the educational level, the more well adapted the patients were both physically and psychologically. Age, marital status and job state were not found to be related to the patient's adaptation level. 3) Among the internal intervening variables, the health related hardiness characteristic was related to the differences of ADL dependency, physical functions and the social activities, indicating that the higher the hardiness character the higher the level of physical and social adaptation. 4) The perceived social support, another internal intervening variable, was related to the degree of depression and the social activities. This data suggest that the higher the perception of social support, the better adapted the patients were psychogically and socially. In summarizing the results of the correlational analysis, the level of physical adaptation was influenced by sex, the years of education and the hardiness character. The level of psychological adaptation was influenced by the years of education, the perceived disability effect and the perceived social support. And the level of social adaptation was influenced by the perceived disability effect, the hardiness character and the perceived social support. The stepwise multiple regression analysis shows findings as follows : 1) The most important factor to explain the difference of ADL dependency was sex, indicating females were more independent than males. 2) The most important factor to explain the difference of physical function and the degree of depression was the patient's education level. 3) The strongest explaining factor for the amount of social activities was perceived self esteem(one of the subconcepts of perceived social support). Thus the most important factors influencing the level of adaptation were found to be sex, education, the hardiness character and self esteem. From the above findings, the significance of this study can be delineated as follows : 1) Corroboration of the assumed relationship between the various variables and the adaptation level as suggested in the conceptual model. 2) Support for the feasibility of the cognitive approach for nursing intervention such as hardness character training, counselling and teaching for self-care in the chronic patients.

  • PDF

Knowledge Extraction Methodology and Framework from Wikipedia Articles for Construction of Knowledge-Base (지식베이스 구축을 위한 한국어 위키피디아의 학습 기반 지식추출 방법론 및 플랫폼 연구)

  • Kim, JaeHun;Lee, Myungjin
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.43-61
    • /
    • 2019
  • Development of technologies in artificial intelligence has been rapidly increasing with the Fourth Industrial Revolution, and researches related to AI have been actively conducted in a variety of fields such as autonomous vehicles, natural language processing, and robotics. These researches have been focused on solving cognitive problems such as learning and problem solving related to human intelligence from the 1950s. The field of artificial intelligence has achieved more technological advance than ever, due to recent interest in technology and research on various algorithms. The knowledge-based system is a sub-domain of artificial intelligence, and it aims to enable artificial intelligence agents to make decisions by using machine-readable and processible knowledge constructed from complex and informal human knowledge and rules in various fields. A knowledge base is used to optimize information collection, organization, and retrieval, and recently it is used with statistical artificial intelligence such as machine learning. Recently, the purpose of the knowledge base is to express, publish, and share knowledge on the web by describing and connecting web resources such as pages and data. These knowledge bases are used for intelligent processing in various fields of artificial intelligence such as question answering system of the smart speaker. However, building a useful knowledge base is a time-consuming task and still requires a lot of effort of the experts. In recent years, many kinds of research and technologies of knowledge based artificial intelligence use DBpedia that is one of the biggest knowledge base aiming to extract structured content from the various information of Wikipedia. DBpedia contains various information extracted from Wikipedia such as a title, categories, and links, but the most useful knowledge is from infobox of Wikipedia that presents a summary of some unifying aspect created by users. These knowledge are created by the mapping rule between infobox structures and DBpedia ontology schema defined in DBpedia Extraction Framework. In this way, DBpedia can expect high reliability in terms of accuracy of knowledge by using the method of generating knowledge from semi-structured infobox data created by users. However, since only about 50% of all wiki pages contain infobox in Korean Wikipedia, DBpedia has limitations in term of knowledge scalability. This paper proposes a method to extract knowledge from text documents according to the ontology schema using machine learning. In order to demonstrate the appropriateness of this method, we explain a knowledge extraction model according to the DBpedia ontology schema by learning Wikipedia infoboxes. Our knowledge extraction model consists of three steps, document classification as ontology classes, proper sentence classification to extract triples, and value selection and transformation into RDF triple structure. The structure of Wikipedia infobox are defined as infobox templates that provide standardized information across related articles, and DBpedia ontology schema can be mapped these infobox templates. Based on these mapping relations, we classify the input document according to infobox categories which means ontology classes. After determining the classification of the input document, we classify the appropriate sentence according to attributes belonging to the classification. Finally, we extract knowledge from sentences that are classified as appropriate, and we convert knowledge into a form of triples. In order to train models, we generated training data set from Wikipedia dump using a method to add BIO tags to sentences, so we trained about 200 classes and about 2,500 relations for extracting knowledge. Furthermore, we evaluated comparative experiments of CRF and Bi-LSTM-CRF for the knowledge extraction process. Through this proposed process, it is possible to utilize structured knowledge by extracting knowledge according to the ontology schema from text documents. In addition, this methodology can significantly reduce the effort of the experts to construct instances according to the ontology schema.

Disaster Risk Assessment using QRE Assessment Tool in Disaster Cases in Seoul Metropolitan (서울시 재난 사례 QRE 평가도구를 활용한 재난 위험도 평가)

  • Kim, Yong Moon;Lee, Tae Shik
    • Journal of Korean Society of Disaster and Security
    • /
    • v.12 no.1
    • /
    • pp.11-21
    • /
    • 2019
  • This study assessed the risk of disaster by using QRE(Quick Risk Estimation - UNISDR Roll Model City of Basic Evaluation Tool) tools for three natural disasters and sixteen social disasters managed by the Seoul Metropolitan Government. The criteria for selecting 19 disaster types in Seoul are limited to disasters that occur frequently in the past and cause a lot of damage to people and property if they occur. We also considered disasters that are likely to occur in the future. According to the results of the QRE tools for disaster type in Seoul, the most dangerous type of disaster among the Seoul city disasters was "suicide accident" and "deterioration of air quality". Suicide risk is high and it is not easy to take measures against the economic and psychological problems of suicide. This corresponds to the Risk ratings(Likelihood ranking score & Severity rating) "M6". In contrast, disaster types with low risk during the disaster managed by the city of Seoul were analyzed as flooding, water leakage, and water pollution accidents. In the case of floods, there is a high likelihood of disaster such as localized heavy rains and typhoons. However, the city of Seoul has established a comprehensive plan to reduce floods and water every five years. This aspect is considered to be appropriate for disaster prevention preparedness and relatively low disaster risk was analyzed. This corresponds to the disaster Risk ratings(Likelihood ranking score & Severity rating) "VL1". Finally, the QRE tool provides the city's leaders and disaster managers with a quick reference to the risk of a disaster so that decisions can be made faster. In addition, the risk assessment using the QRE tool has helped many aspects such as systematic evaluation of resilience against the city's safety risks, basic data on future investment plans, and disaster response.

Topic Modeling Insomnia Social Media Corpus using BERTopic and Building Automatic Deep Learning Classification Model (BERTopic을 활용한 불면증 소셜 데이터 토픽 모델링 및 불면증 경향 문헌 딥러닝 자동분류 모델 구축)

  • Ko, Young Soo;Lee, Soobin;Cha, Minjung;Kim, Seongdeok;Lee, Juhee;Han, Ji Yeong;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.2
    • /
    • pp.111-129
    • /
    • 2022
  • Insomnia is a chronic disease in modern society, with the number of new patients increasing by more than 20% in the last 5 years. Insomnia is a serious disease that requires diagnosis and treatment because the individual and social problems that occur when there is a lack of sleep are serious and the triggers of insomnia are complex. This study collected 5,699 data from 'insomnia', a community on 'Reddit', a social media that freely expresses opinions. Based on the International Classification of Sleep Disorders ICSD-3 standard and the guidelines with the help of experts, the insomnia corpus was constructed by tagging them as insomnia tendency documents and non-insomnia tendency documents. Five deep learning language models (BERT, RoBERTa, ALBERT, ELECTRA, XLNet) were trained using the constructed insomnia corpus as training data. As a result of performance evaluation, RoBERTa showed the highest performance with an accuracy of 81.33%. In order to in-depth analysis of insomnia social data, topic modeling was performed using the newly emerged BERTopic method by supplementing the weaknesses of LDA, which is widely used in the past. As a result of the analysis, 8 subject groups ('Negative emotions', 'Advice and help and gratitude', 'Insomnia-related diseases', 'Sleeping pills', 'Exercise and eating habits', 'Physical characteristics', 'Activity characteristics', 'Environmental characteristics') could be confirmed. Users expressed negative emotions and sought help and advice from the Reddit insomnia community. In addition, they mentioned diseases related to insomnia, shared discourse on the use of sleeping pills, and expressed interest in exercise and eating habits. As insomnia-related characteristics, we found physical characteristics such as breathing, pregnancy, and heart, active characteristics such as zombies, hypnic jerk, and groggy, and environmental characteristics such as sunlight, blankets, temperature, and naps.

An Artificial Intelligence Approach to Waterbody Detection of the Agricultural Reservoirs in South Korea Using Sentinel-1 SAR Images (Sentinel-1 SAR 영상과 AI 기법을 이용한 국내 중소규모 농업저수지의 수표면적 산출)

  • Choi, Soyeon;Youn, Youjeong;Kang, Jonggu;Park, Ganghyun;Kim, Geunah;Lee, Seulchan;Choi, Minha;Jeong, Hagyu;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.925-938
    • /
    • 2022
  • Agricultural reservoirs are an important water resource nationwide and vulnerable to abnormal climate effects such as drought caused by climate change. Therefore, it is required enhanced management for appropriate operation. Although water-level tracking is necessary through continuous monitoring, it is challenging to measure and observe on-site due to practical problems. This study presents an objective comparison between multiple AI models for water-body extraction using radar images that have the advantages of wide coverage, and frequent revisit time. The proposed methods in this study used Sentinel-1 Synthetic Aperture Radar (SAR) images, and unlike common methods of water extraction based on optical images, they are suitable for long-term monitoring because they are less affected by the weather conditions. We built four AI models such as Support Vector Machine (SVM), Random Forest (RF), Artificial Neural Network (ANN), and Automated Machine Learning (AutoML) using drone images, sentinel-1 SAR and DSM data. There are total of 22 reservoirs of less than 1 million tons for the study, including small and medium-sized reservoirs with an effective storage capacity of less than 300,000 tons. 45 images from 22 reservoirs were used for model training and verification, and the results show that the AutoML model was 0.01 to 0.03 better in the water Intersection over Union (IoU) than the other three models, with Accuracy=0.92 and mIoU=0.81 in a test. As the result, AutoML performed as well as the classical machine learning methods and it is expected that the applicability of the water-body extraction technique by AutoML to monitor reservoirs automatically.

A Convergent and Combined Activation Plan for Exercise Rehabilitation in the Era of the Fourth Industrial Revolution (4차 산업혁명시대에 운동재활분야의 융·복합적 활성화 방안)

  • Cho, Kyoung-Hwan
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.8
    • /
    • pp.407-426
    • /
    • 2020
  • The purpose of this study was to make convergent and combined analysis of the sport industry and exercise rehabilitation in the era of New Normal based on the Fourth Industrial Revolution and devise a comprehensive plan for future activation. For this purpose, literature review was performed mainly by analyzing the environment of the sport industry in the New Normal era based on the Fourth Industrial Revolution and by carrying out convergent and combined analysis of the sport industry to present a convergent and combined activation plan for exercise rehabilitation comprehensively as follows: First, it is necessary to make a strategy of promoting exercise rehabilitation in convergent and combined ways at the sport industry level. This means development of a convergent and combined exercise rehabilitation-tourism-ICT model as well as a convergent and combined exercise rehabilitation-ICT model through collaboration among ministries, including those of health and sports. Second, it is necessary to convert into a convergent and combined way of thinking and extend and reinforce educational competitiveness in the area of exercise rehabilitation. That is, it is necessary to refine the education and training systems for reinforcing personal ICT competence of exercise rehabilitation majors and relevant ones and provide convergent and combined business commencement education. Third, it is necessary to make different types of research and development by applying practical, convergent and combined skills based on the industrial field to exercise rehabilitation and relevant areas. Efforts should be made to overcome any risk in the era of New Normal and support business commencement with convergent and combined skills for exercise rehabilitation. Fourth, it is necessary to make mid- and long-term clusters where exercise rehabilitation and relevant businesses can be accumulated. This means building an industrial hub and complex for exercise rehabilitation and requires making an R&D-based cluster with industrial-academic-governmental collaboration, maximizing the synergy effects with local infrastructures, and fulfilling the function of realizing a spontaneous profit-generating structure.

Awareness of Pre-Service Elementary Teachers' on Science Teaching-Learning Lesson Plan (초등예비교사의 과학과 교수·학습 과정안 작성에 대한 인식)

  • Yong-Seob, Lee;Sun-Sik, Kim
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.15 no.3
    • /
    • pp.335-344
    • /
    • 2022
  • This study was conducted for 4 weeks on the preparation of the science teaching/learning course plan for 109 students in 4 classes of the 2nd year intensive course at B University of Education. Pre-service elementary teachers attended a two-week field training practice after listening to a lecture on how to write a science teaching and learning course plan. Pre-service elementary teachers tried to find out about the selection of materials and the degree of connection between the course plan and the class to prepare the science teaching/learning course plan. The researcher completed the questionnaire by reviewing and deliberation on the questionnaire questions together with 4 pre-service elementary teachers. The questionnaire related to the writing of the science teaching and learning course plan consists of 8 questions. Preferred reference materials when writing the course plan, the level of interest in learning, the success or failure of the science course plan and class, the science preferred model, the evaluation method in unit time, and the science teaching and learning One's own efforts to write the course plan, the contents of this course are the science faculty. It is composed of the preparation of the learning process plan and how helpful it is to the class. The results of this study are as follows. First, it was found that elementary school pre-service elementary teachers preferred teacher guidance the most when drafting science teaching and learning curriculum plans. Second, it is recognized that the development stage is very important in the teaching and learning stage of the science department. Third, Pre-service elementary teachers believe that the science and teaching and learning process plan has a high correlation with the success of the class. Fourth, it was said that the student's level, the teacher's ability, and the appropriate lesson plan had the most influence on the class. Fifth, it was found that pre-service elementary teachers prefer the inquiry learning class model. Sixth, it was found that reports and activity papers were preferred for evaluation in 40-minute classes. Seventh, it was stated that the teaching and learning process plan is highly related to the class, so it will be studied and studied diligently. Eighth, the method of writing a science teaching and learning course plan based on the instructional design principle is interpreted as very beneficial.

Study on the Short-Term Hemodynamic Effects of Experimental Cardiomyoplasty in Heart Failure Model (심부전 모델에서 실험적 심근성형술의 단기 혈역학적 효과에 관한 연구)

  • Jeong, Yoon-Seop;Youm, Wook;Lee, Chang-Ha;Kim, Wook-Seong;Lee, Young-Tak;Kim, Won-Gon
    • Journal of Chest Surgery
    • /
    • v.32 no.3
    • /
    • pp.224-236
    • /
    • 1999
  • Background: To evaluate the short-term effect of dynamic cardiomyoplasty on circulatory function and detect the related factors that can affect it, experimental cardiomyoplasties were performed under the state of normal cardiac function and heart failure. Material and Method: A total of 10 mongrel dogs weighing 20 to 30kg were divided arbitrarily into two groups. Five dogs of group A underwent cardiomyoplasty with latissimus dorsi(LD) muscle mobilization followed by a 2-week vascular delay and 6-week muscle training. Then, hemodynamic studies were conducted. In group B, doxorubicin was given to 5 dogs in an IV dose of 1 mg/kg once a week for 8 weeks to induce chronic heart failure, and simultaneous muscle training was given for preconditioning during this period. Then, cardiomyoplasties were performed and hemodynamic studies were conducted immediately after these cardiomyoplasties in group B. Result: In group A, under the state of normal cardiac function, only mean right atrial pressure significantly increased with the pacer-on(p<0.05) and the left ventricular hemodynamic parameters did not change significantly. However, with pacer-on in group B, cardiac output(CO), rate of left ventricular pressure development(dp/dt), stroke volume(SV), and left ventricular stroke work(SW) increased by 16.7${\pm}$7.2%, 9.3${\pm}$3.2%, 16.8${\pm}$8.6%, and 23.1${\pm}$9.7%, respectively, whereas left ventricular end-diastole pressure(LVEDP) and mean pulmonary capillary wedge pressure(mPCWP) decreased by 32.1${\pm}$4.6% and 17.7${\pm}$9.1%, respectively(p<0.05). In group A, imipramine was infused at the rate of 7.5mg/kg/hour for 34${\pm}$2.6 minutes to induce acute heart failure, which resulted in the reduction of cardiac output by 17.5${\pm}$2.7%, systolic left ventricular pressure by 15.8${\pm}$2.5% and the elevation of left ventricular end-diastole pressure by 54.3${\pm}$15.2%(p<0.05). With pacer-on under this state of acute heart failu e, CO, dp/dt, SV, and SW increased by 4.5${\pm}$1.8% and 3.1${\pm}$1.1%, 5.7${\pm}$3.6%, and 6.9${\pm}$4.4%, respectively, whereas LVEDP decreased by 11.7${\pm}$4.7%(p<0.05). Comparing CO, dp/dt, SV, SW and LVEDP that changed significantly with pacer-on, both under the state of acute and chronic heart failure, augmentation widths of these left ventricular hemodynamic parameters were significantly larger under the state of chronic heart failure(group B) than acute heart failure(group A)(p<0.05). On gross inspection, variable degrees of adhesion and inflammation were present in all 5 dogs of group A, including 2 dogs that showed no muscle contraction. No adhesion and inflammation were, however, present in all 5 dogs of group B, which showed vivid muscle contractions. Considering these differences in gross findings along with the following premise that the acute heart failure state was not statistically different from the chronic one in terms of left ventricular parameters(p>0.05), the larger augmentation effect seen in group B is presumed to be mainly attributed to the viability and contractility of the LD muscle. Conclusion: These results indicate that the positive circulatory augmentation effect of cardiomyoplasty is apparent only under the state of heart failure and the preservation of muscle contractility is important to maximize this effect.

  • PDF

Analysis of Success Cases of InsurTech and Digital Insurance Platform Based on Artificial Intelligence Technologies: Focused on Ping An Insurance Group Ltd. in China (인공지능 기술 기반 인슈어테크와 디지털보험플랫폼 성공사례 분석: 중국 평안보험그룹을 중심으로)

  • Lee, JaeWon;Oh, SangJin
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.3
    • /
    • pp.71-90
    • /
    • 2020
  • Recently, the global insurance industry is rapidly developing digital transformation through the use of artificial intelligence technologies such as machine learning, natural language processing, and deep learning. As a result, more and more foreign insurers have achieved the success of artificial intelligence technology-based InsurTech and platform business, and Ping An Insurance Group Ltd., China's largest private company, is leading China's global fourth industrial revolution with remarkable achievements in InsurTech and Digital Platform as a result of its constant innovation, using 'finance and technology' and 'finance and ecosystem' as keywords for companies. In response, this study analyzed the InsurTech and platform business activities of Ping An Insurance Group Ltd. through the ser-M analysis model to provide strategic implications for revitalizing AI technology-based businesses of domestic insurers. The ser-M analysis model has been studied so that the vision and leadership of the CEO, the historical environment of the enterprise, the utilization of various resources, and the unique mechanism relationships can be interpreted in an integrated manner as a frame that can be interpreted in terms of the subject, environment, resource and mechanism. As a result of the case analysis, Ping An Insurance Group Ltd. has achieved cost reduction and customer service development by digitally innovating its entire business area such as sales, underwriting, claims, and loan service by utilizing core artificial intelligence technologies such as facial, voice, and facial expression recognition. In addition, "online data in China" and "the vast offline data and insights accumulated by the company" were combined with new technologies such as artificial intelligence and big data analysis to build a digital platform that integrates financial services and digital service businesses. Ping An Insurance Group Ltd. challenged constant innovation, and as of 2019, sales reached $155 billion, ranking seventh among all companies in the Global 2000 rankings selected by Forbes Magazine. Analyzing the background of the success of Ping An Insurance Group Ltd. from the perspective of ser-M, founder Mammingz quickly captured the development of digital technology, market competition and changes in population structure in the era of the fourth industrial revolution, and established a new vision and displayed an agile leadership of digital technology-focused. Based on the strong leadership led by the founder in response to environmental changes, the company has successfully led InsurTech and Platform Business through innovation of internal resources such as investment in artificial intelligence technology, securing excellent professionals, and strengthening big data capabilities, combining external absorption capabilities, and strategic alliances among various industries. Through this success story analysis of Ping An Insurance Group Ltd., the following implications can be given to domestic insurance companies that are preparing for digital transformation. First, CEOs of domestic companies also need to recognize the paradigm shift in industry due to the change in digital technology and quickly arm themselves with digital technology-oriented leadership to spearhead the digital transformation of enterprises. Second, the Korean government should urgently overhaul related laws and systems to further promote the use of data between different industries and provide drastic support such as deregulation, tax benefits and platform provision to help the domestic insurance industry secure global competitiveness. Third, Korean companies also need to make bolder investments in the development of artificial intelligence technology so that systematic securing of internal and external data, training of technical personnel, and patent applications can be expanded, and digital platforms should be quickly established so that diverse customer experiences can be integrated through learned artificial intelligence technology. Finally, since there may be limitations to generalization through a single case of an overseas insurance company, I hope that in the future, more extensive research will be conducted on various management strategies related to artificial intelligence technology by analyzing cases of multiple industries or multiple companies or conducting empirical research.

KNU Korean Sentiment Lexicon: Bi-LSTM-based Method for Building a Korean Sentiment Lexicon (Bi-LSTM 기반의 한국어 감성사전 구축 방안)

  • Park, Sang-Min;Na, Chul-Won;Choi, Min-Seong;Lee, Da-Hee;On, Byung-Won
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.4
    • /
    • pp.219-240
    • /
    • 2018
  • Sentiment analysis, which is one of the text mining techniques, is a method for extracting subjective content embedded in text documents. Recently, the sentiment analysis methods have been widely used in many fields. As good examples, data-driven surveys are based on analyzing the subjectivity of text data posted by users and market researches are conducted by analyzing users' review posts to quantify users' reputation on a target product. The basic method of sentiment analysis is to use sentiment dictionary (or lexicon), a list of sentiment vocabularies with positive, neutral, or negative semantics. In general, the meaning of many sentiment words is likely to be different across domains. For example, a sentiment word, 'sad' indicates negative meaning in many fields but a movie. In order to perform accurate sentiment analysis, we need to build the sentiment dictionary for a given domain. However, such a method of building the sentiment lexicon is time-consuming and various sentiment vocabularies are not included without the use of general-purpose sentiment lexicon. In order to address this problem, several studies have been carried out to construct the sentiment lexicon suitable for a specific domain based on 'OPEN HANGUL' and 'SentiWordNet', which are general-purpose sentiment lexicons. However, OPEN HANGUL is no longer being serviced and SentiWordNet does not work well because of language difference in the process of converting Korean word into English word. There are restrictions on the use of such general-purpose sentiment lexicons as seed data for building the sentiment lexicon for a specific domain. In this article, we construct 'KNU Korean Sentiment Lexicon (KNU-KSL)', a new general-purpose Korean sentiment dictionary that is more advanced than existing general-purpose lexicons. The proposed dictionary, which is a list of domain-independent sentiment words such as 'thank you', 'worthy', and 'impressed', is built to quickly construct the sentiment dictionary for a target domain. Especially, it constructs sentiment vocabularies by analyzing the glosses contained in Standard Korean Language Dictionary (SKLD) by the following procedures: First, we propose a sentiment classification model based on Bidirectional Long Short-Term Memory (Bi-LSTM). Second, the proposed deep learning model automatically classifies each of glosses to either positive or negative meaning. Third, positive words and phrases are extracted from the glosses classified as positive meaning, while negative words and phrases are extracted from the glosses classified as negative meaning. Our experimental results show that the average accuracy of the proposed sentiment classification model is up to 89.45%. In addition, the sentiment dictionary is more extended using various external sources including SentiWordNet, SenticNet, Emotional Verbs, and Sentiment Lexicon 0603. Furthermore, we add sentiment information about frequently used coined words and emoticons that are used mainly on the Web. The KNU-KSL contains a total of 14,843 sentiment vocabularies, each of which is one of 1-grams, 2-grams, phrases, and sentence patterns. Unlike existing sentiment dictionaries, it is composed of words that are not affected by particular domains. The recent trend on sentiment analysis is to use deep learning technique without sentiment dictionaries. The importance of developing sentiment dictionaries is declined gradually. However, one of recent studies shows that the words in the sentiment dictionary can be used as features of deep learning models, resulting in the sentiment analysis performed with higher accuracy (Teng, Z., 2016). This result indicates that the sentiment dictionary is used not only for sentiment analysis but also as features of deep learning models for improving accuracy. The proposed dictionary can be used as a basic data for constructing the sentiment lexicon of a particular domain and as features of deep learning models. It is also useful to automatically and quickly build large training sets for deep learning models.