A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)
-
- Journal of Intelligence and Information Systems
- /
- v.21 no.2
- /
- pp.69-92
- /
- 2015
The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.
Purpose : At this time, the sentinel lymph node mapping using radioisotope and blue dye is preceded for breast cancer patient's sentinel lymph node biopsy. But all patients were applied the same protocol without consideration of physical specific character like the breast sizes and body mass indexes. The purpose of this study is search the optimized scan time in breast sentinel lymphangiography by observing how much the body mass index and breast size influence speed of lymphatic flow. Materials and Methods : The Object of this study was 100 breast cancer patients(Female, 100 persons, average age
Automated analysis software was developed to measure the magnitude of the intrafractional and interfractional errors during breast radiation treatments. Error analysis results are important for determining suitable planning target volumes (PTV) prior to Implementing breast-conserving 3-D conformal radiation treatment (CRT). The electrical portal imaging device (EPID) used for this study was a Portal Vision LC250 liquid-filled ionization detector (fast frame-averaging mode, 1.4 frames per second, 256X256 pixels). Twelve patients were imaged for a minimum of 7 treatment days. During each treatment day, an average of 8 to 9 images per field were acquired (dose rate of 400 MU/minute). We developed automated image analysis software to quantitatively analyze 2,931 images (encompassing 720 measurements). Standard deviations (
Purpose: The purpose of this study was to establish accreditation systems of reliable educational materials for nutrition and dietary life which could be used in schools, workplace, and health promotion. Methods: The study was conducted from April 2011 to October 2011. Literature reviews, institutional visits, and telephone interviews were conducted. Expert meetings and advisory councils were held in order to receive feedback on development of the accreditation systems. A survey was conducted for the accreditation procedures on 143 professionals, including professors, researchers, health and medical experts, teachers, nutrition teachers, dietitians, and clinical nutritionists. Results: The final procedure of the developed accreditation system was finalized as follows: 1) receiving application twice per year 2) complete desk review (written evaluation) by three reviewers within two months, 3) board review (all board members) and decision, and 4) notification of results. The accreditation system is set for printed materials, web-site, and materials for activities. The certificate and accreditation mark is issued to the final certified educational materials. Expiration date is established only for the web-site form. The accreditation length lasts for two years, and can be extended by renewal application. Conclusion: The dietary and nutrition related materials, which are certificated by this accreditation system, could impart reliable information and knowledge to both learners and educators, and help them in effective selection of educational materials. Therefore, this accreditation system might be expected to increase satisfaction for teaching and learning about nutrition and healthy dietary life.
Every company wants to know customer's requirement and makes an effort to meet them. Cause that, communication between customer and company became core competition of business and that important is increasing continuously. There are several strategies to find customer's needs, but VOC (Voice of customer) is one of most powerful communication tools and VOC gathering by several channels as telephone, post, e-mail, website and so on is so meaningful. So, almost company is gathering VOC and operating VOC system. VOC is important not only to business organization but also public organization such as government, education institute, and medical center that should drive up public service quality and customer satisfaction. Accordingly, they make a VOC gathering and analyzing System and then use for making a new product and service, and upgrade. In recent years, innovations in internet and ICT have made diverse channels such as SNS, mobile, website and call-center to collect VOC data. Although a lot of VOC data is collected through diverse channel, the proper utilization is still difficult. It is because the VOC data is made of very emotional contents by voice or text of informal style and the volume of the VOC data are so big. These unstructured big data make a difficult to store and analyze for use by human. So that, the organization need to automatic collecting, storing, classifying and analyzing system for unstructured big VOC data. This study propose an intelligent VOC analyzing system based on opinion mining to classify the unstructured VOC data automatically and determine the polarity as well as the type of VOC. And then, the basis of the VOC opinion analyzing system, called domain-oriented sentiment dictionary is created and corresponding stages are presented in detail. The experiment is conducted with 4,300 VOC data collected from a medical website to measure the effectiveness of the proposed system and utilized them to develop the sensitive data dictionary by determining the special sentiment vocabulary and their polarity value in a medical domain. Through the experiment, it comes out that positive terms such as "칭찬, 친절함, 감사, 무사히, 잘해, 감동, 미소" have high positive opinion value, and negative terms such as "퉁명, 뭡니까, 말하더군요, 무시하는" have strong negative opinion. These terms are in general use and the experiment result seems to be a high probability of opinion polarity. Furthermore, the accuracy of proposed VOC classification model has been compared and the highest classification accuracy of 77.8% is conformed at threshold with -0.50 of opinion classification of VOC. Through the proposed intelligent VOC analyzing system, the real time opinion classification and response priority of VOC can be predicted. Ultimately the positive effectiveness is expected to catch the customer complains at early stage and deal with it quickly with the lower number of staff to operate the VOC system. It can be made available human resource and time of customer service part. Above all, this study is new try to automatic analyzing the unstructured VOC data using opinion mining, and shows that the system could be used as variable to classify the positive or negative polarity of VOC opinion. It is expected to suggest practical framework of the VOC analysis to diverse use and the model can be used as real VOC analyzing system if it is implemented as system. Despite experiment results and expectation, this study has several limits. First of all, the sample data is only collected from a hospital web-site. It means that the sentimental dictionary made by sample data can be lean too much towards on that hospital and web-site. Therefore, next research has to take several channels such as call-center and SNS, and other domain like government, financial company, and education institute.
A retrospective study of 94 hypercalcemic dogs was performed to find out most common causes that lead to hypercalcemia through investigating dogs referred to the Veterinary Teaching Hospital of Konkuk University from 2002 to 2004. During the study period, hypercalcemia was found in 94 dogs of 19 breeds, and they were evaluated as case group. Control group was made up of 94 dogs of 18 breeds without hypercalcemia admitted for the same study period. For general signalments, there were no significant differences between case and control group with the exception of age distribution. Shih-tzu(17.02%) and Yorkshire terrier(26.60%) was the most common breed in case and control group, respectively. The most common diseases associated with hypercalcemia were chronic renal failure (18.09%), acute renal failure(14.89%), and renal calculi(6.38%). Malignant neoplasia(lymphoma, hemangiosarcoma, chronic lymphocytic leukemia, mammary gland tumor, and multiple myeloma) and endocrinopathies(hyperadrenocorticism, hyperthyroidism, hypoadrenocorticism, and hypothyroidism) occupied 8.5% and 6.4%, respectively. This report is a first retrospective study of hypercalcemic dogs in South Korea.
Translationally controlled tumor protein (TCTP) is one of the most abundant proteins in various eukaryotic organisms. TCTPs play important roles in cell physiological processes in cancer, cell proliferation, gene regulation, and heat shock response. TCTP is also considered an important factor in the resistance to oxidative stress induced by dithiothreitol or hydrogen peroxide (H2O2). Arctic calanoid copepods have a variety of antioxidant defense systems to regulate the levels of potentially harmful reactive oxygen species generated by ultraviolet radiation in the Arctic marine ecosystem. However, information on the antioxidant activity of TCTP in the Arctic Calanus glacialis is still scarce. To understand the putative antioxidant function of the Arctic copepod C. glacialis TCTP (Cg-TCTP), its gene was cloned and sequenced. The Cg-TCTP comprised 522 bp and encoded a 174-amino acid putative protein with a calculated molecular weight of ~23 kDa. The recombinant Cg-TCTP (Cg-r TCTP) gene was overexpressed in Escherichia coli (BL21), and Cg-rTCTP-transformed cells were grown in the presence or absence of H2O2. Cg-rTCTP-transformed E. coli showed increased tolerance to high H2O2 concentrations. Therefore, TCTP may be an important antioxidant protein related to tolerance of the Arctic copepod C. glacialis to oxidative stress in the harsh environment of the Arctic Ocean.
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70
The wall shear stress in the vicinity of end-to end anastomoses under steady flow conditions was measured using a flush-mounted hot-film anemometer(FMHFA) probe. The experimental measurements were in good agreement with numerical results except in flow with low Reynolds numbers. The wall shear stress increased proximal to the anastomosis in flow from the Penrose tubing (simulating an artery) to the PTFE: graft. In flow from the PTFE graft to the Penrose tubing, low wall shear stress was observed distal to the anastomosis. Abnormal distributions of wall shear stress in the vicinity of the anastomosis, resulting from the compliance mismatch between the graft and the host artery, might be an important factor of ANFH formation and the graft failure. The present study suggests a correlation between regions of the low wall shear stress and the development of anastomotic neointimal fibrous hyperplasia(ANPH) in end-to-end anastomoses. 30523 T00401030523 ^x Air pressure decay(APD) rate and ultrafiltration rate(UFR) tests were performed on new and saline rinsed dialyzers as well as those roused in patients several times. C-DAK 4000 (Cordis Dow) and CF IS-11 (Baxter Travenol) reused dialyzers obtained from the dialysis clinic were used in the present study. The new dialyzers exhibited a relatively flat APD, whereas saline rinsed and reused dialyzers showed considerable amount of decay. C-DAH dialyzers had a larger APD(11.70
Today most developed countries provide modern medical care for most of the population. The rural area is the more neglected area in the medical and health field. In public health, the philosophy is that medical care for in maintenance of health is a basic right of man; it should not be discriminated against racial, environmental or financial situations. The deficiency of the medical care system, cultural bias, economic development, and ignorance of the residents about health care brought about the shortage of medical personnel and facilities on the rural areas. Moreover, medical students and physicians have been taught less about rural health care than about urban health care. Medical care, therefore, is insufficient in terms of health care personnel/and facilities in rural areas. Under such a situation, there is growing concern about the health problems among the rural population. The findings presented in this report are useful measures of the major health problems and even more important, as a guide to planning for improved medical care systems. It is hoped that findings from this study will be useful to those responsible for improving the delivery of health service for the rural population. Objectives: -to determine the health status of the residents in the rural areas. -to assess the rural population's needs in terms of health and medical care. -to make recommendations concerning improvement in the delivery of health and medical care for the rural population. Procedures: For the sampling design, the ideal would be to sample according to the proportion of the composition age-groups. As the health problems would be different by group, the sample was divided into 10 different age-groups. If the sample were allocated by proportion of composition of each age group, some age groups would be too small to estimate the health problem. The sample size of each age-group population was 100 people/age-groups. Personal interviews were conducted by specially trained medical students. The interviews dealt at length with current health status, medical care problems, utilization of medical services, medical cost paid for medical care and attitudes toward health. In addition, more information was gained from the public health field, including environmental sanitation, maternal and child health, family planning, tuberculosis control, and dental health. The sample Sample size was one fourth of total population: 1,438 The aged 10-14 years showed the largest number of 254 and the aged under one year was the smallest number of 81. Participation in examination Examination sessions usually were held in the morning every Tuesday, Wenesday, and Thursday for 3 hours at each session at the Namchun Health station. In general, the rate of participation in medical examination was low especially in ages between 10-19 years old. The highest rate of participation among are groups was the under one year age-group by 100 percent. The lowest use rate as low as 3% of those in the age-groups 10-19 years who are attending junior and senior high school in Taegu city so the time was not convenient for them to recieve examinations. Among the over 20 years old group, the rate of participation of female was higher than that of males. The results are as follows: A. Publie health problems Population: The number of pre-school age group who required child health was 724, among them infants numbered 96. Number of eligible women aged 15-44 years was 1,279, and women with husband who need maternal health numbered 700. The age-group of 65 years or older was 201 needed more health care and 65 of them had disabilities. (Table 2). Environmental sanitation: Seventy-nine percent of the residents relied upon well water as a primary source of dringking water. Ninety-three percent of the drinking water supply was rated as unfited quality for drinking. More than 90% of latrines were unhygienic, in structure design and sanitation (Table 15). Maternal and child health: Maternal health Average number of pregnancies of eligible women was 4 times. There was almost no pre- and post-natal care. Pregnancy wastage Still births was 33 per 1,000 live births. Spontaneous abortion was 156 per 1,000 live births. Induced abortion was 137 per 1,000 live births. Delivery condition More than 90 percent of deliveries were conducted at home. Attendants at last delivery were laymen by 76% and delivery without attendants was 14%. The rate of non-sterilized scissors as an instrument used to cut the umbilical cord was as high as 54% and of sickles was 14%. The rate of difficult delivery counted for 3%. Maternal death rate estimates about 35 per 10,000 live births. Child health Consultation rate for child health was almost non existant. In general, vaccination rate of children was low; vaccination rates for children aged 0-5 years with BCG and small pox were 34 and 28 percent respectively. The rate of vaccination with DPT and Polio were 23 and 25% respectively but the rate of the complete three injections were as low as 5 and 3% respectively. The number of dead children was 280 per 1,000 living children. Infants death rate was 45 per 1,000 live births (Table 16), Family planning: Approval rate of married women for family planning was as high as 86%. The rate of experiences of contraception in the past was 51%. The current rate of contraception was 37%. Willingness to use contraception in the future was as high as 86% (Table 17). Tuberculosis control: Number of registration patients at the health center currently was 25. The number indicates one eighth of estimate number of tuberculosis in the area. Number of discharged cases in the past accounted for 79 which showed 50% of active cases when discharged time. Rate of complete treatment among reasons of discharge in the past as low as 28%. There needs to be a follow up observation of the discharged cases (Table 18). Dental problems: More than 50% of the total population have at least one or more dental problems. (Table 19) B. Medical care problems Incidence rate: 1. In one month Incidence rate of medical care problems during one month was 19.6 percent. Among these health problems which required rest at home were 11.8 percent. The estimated number of patients in the total population is 1,206. The health problems reported most frequently in interviews during one month are: GI trouble, respiratory disease, neuralgia, skin disease, and communicable disease-in that order, The rate of health problems by age groups was highest in the 1-4 age group and in the 60 years or over age group, the lowest rate was the 10-14 year age group. In general, 0-29 year age group except the 1-4 year age group was low incidence rate. After 30 years old the rate of health problems increases gradually with aging. Eighty-three percent of health problems that occured during one month were solved by primary medical care procedures. Seventeen percent of health problems needed secondary care. Days rested at home because of illness during one month were 0.7 days per interviewee and 8days per patient and it accounts for 2,161 days for the total productive population in the area. (Table 20) 2. In a year The incidence rate of medical care problems during a year was 74.8%, among them health problems which required rest at home was 37 percent. Estimated number of patients in the total population during a year was 4,600. The health problems that occured most frequently among the interviewees during a year were: Cold (30%), GI trouble (18), respiratory disease (11), anemia (10), diarrhea (10), neuralgia (10), parasite disease (9), ENT (7), skin (7), headache (7), trauma (4), communicable disease (3), and circulatory disease (3) -in that order. The rate of health problems by age groups was highest in the infants group, thereafter the rate decreased gradually until the age 15-19 year age group which showed the lowest, and then the rate increased gradually with aging. Eighty-seven percent of health problems during a year were solved by primary medical care. Thirteen percent of them needed secondary medical care procedures. Days rested at home because of illness during a year were 16 days per interviewee and 44 days per patient and it accounted for 57,335 days lost among productive age group in the area (Table 21). Among those given medical examination, the conditions observed most frequently were respiratory disease, GI trouble, parasite disease, neuralgia, skin disease, trauma, tuberculosis, anemia, chronic obstructive lung disease, eye disorders-in that order (Table 22). The main health problems required secondary medical care are as fellows: (previous page). Utilization of medical care (treatment) The rate of treatment by various medical facilities for all health problems during one month was 73 percent. The rate of receiving of medical care of those who have health problems which required rest at home was 52% while the rate of those who have health problems which did not required rest was 61 percent (Table 23). The rate of receiving of medical care for all health problems during a year was 67 percent. The rate of receiving of medical care of those who have health problems which required rest at home was 82 percent while the rate of those who have health problems which did not required rest was as low as 53 percent (Table 24). Types of medical facilitied used were as follows: Hospital and clinics: 32-35% Herb clinics: 9-10% Drugstore: 53-58% Hospitalization Rate of hospitalization was 1.7% and the estimate number of hospitalizations among the total population during a year will be 107 persons (Table 25). Medical cost: Average medical cost per person during one month and a year were 171 and 2,800 won respectively. Average medical cost per patient during one month and a year were 1,109 and 3,740 won respectively. Average cost per household during a year was 15,800 won (Table 26, 27). Solution measures for health and medical care problems in rural area: A. Health problems which could be solved by paramedical workers such as nurses, midwives and aid nurses etc. are as follows: 1. Improvement of environmental sanitation 2. MCH except medical care problems 3. Family planning except surgical intervention 4. Tuberculosis control except diagnosis and prescription 5. Dental care except operational intervention 6. Health education for residents for improvement of utilization of medical facilities and early diagnosis etc. B. Medical care problems 1. Eighty-five percent of health problems could be solved by primary care procedures by general practitioners. 2. Fifteen percent of health problems need secondary medical procedures by a specialist. C. Medical cost Concidering the economic situation in rural area the amount of 2,062 won per residents during a year will be burdensome, so financial assistance is needed gorvernment to solve health and medical care problems for rural people.