• Title/Summary/Keyword: Pre-Classification

Search Result 647, Processing Time 0.034 seconds

Using the METHONTOLOGY Approach to a Graduation Screen Ontology Development: An Experiential Investigation of the METHONTOLOGY Framework

  • Park, Jin-Soo;Sung, Ki-Moon;Moon, Se-Won
    • Asia pacific journal of information systems
    • /
    • v.20 no.2
    • /
    • pp.125-155
    • /
    • 2010
  • Ontologies have been adopted in various business and scientific communities as a key component of the Semantic Web. Despite the increasing importance of ontologies, ontology developers still perceive construction tasks as a challenge. A clearly defined and well-structured methodology can reduce the time required to develop an ontology and increase the probability of success of a project. However, no reliable knowledge-engineering methodology for ontology development currently exists; every methodology has been tailored toward the development of a particular ontology. In this study, we developed a Graduation Screen Ontology (GSO). The graduation screen domain was chosen for the several reasons. First, the graduation screen process is a complicated task requiring a complex reasoning process. Second, GSO may be reused for other universities because the graduation screen process is similar for most universities. Finally, GSO can be built within a given period because the size of the selected domain is reasonable. No standard ontology development methodology exists; thus, one of the existing ontology development methodologies had to be chosen. The most important considerations for selecting the ontology development methodology of GSO included whether it can be applied to a new domain; whether it covers a broader set of development tasks; and whether it gives sufficient explanation of each development task. We evaluated various ontology development methodologies based on the evaluation framework proposed by G$\acute{o}$mez-P$\acute{e}$rez et al. We concluded that METHONTOLOGY was the most applicable to the building of GSO for this study. METHONTOLOGY was derived from the experience of developing Chemical Ontology at the Polytechnic University of Madrid by Fern$\acute{a}$ndez-L$\acute{o}$pez et al. and is regarded as the most mature ontology development methodology. METHONTOLOGY describes a very detailed approach for building an ontology under a centralized development environment at the conceptual level. This methodology consists of three broad processes, with each process containing specific sub-processes: management (scheduling, control, and quality assurance); development (specification, conceptualization, formalization, implementation, and maintenance); and support process (knowledge acquisition, evaluation, documentation, configuration management, and integration). An ontology development language and ontology development tool for GSO construction also had to be selected. We adopted OWL-DL as the ontology development language. OWL was selected because of its computational quality of consistency in checking and classification, which is crucial in developing coherent and useful ontological models for very complex domains. In addition, Protege-OWL was chosen for an ontology development tool because it is supported by METHONTOLOGY and is widely used because of its platform-independent characteristics. Based on the GSO development experience of the researchers, some issues relating to the METHONTOLOGY, OWL-DL, and Prot$\acute{e}$g$\acute{e}$-OWL were identified. We focused on presenting drawbacks of METHONTOLOGY and discussing how each weakness could be addressed. First, METHONTOLOGY insists that domain experts who do not have ontology construction experience can easily build ontologies. However, it is still difficult for these domain experts to develop a sophisticated ontology, especially if they have insufficient background knowledge related to the ontology. Second, METHONTOLOGY does not include a development stage called the "feasibility study." This pre-development stage helps developers ensure not only that a planned ontology is necessary and sufficiently valuable to begin an ontology building project, but also to determine whether the project will be successful. Third, METHONTOLOGY excludes an explanation on the use and integration of existing ontologies. If an additional stage for considering reuse is introduced, developers might share benefits of reuse. Fourth, METHONTOLOGY fails to address the importance of collaboration. This methodology needs to explain the allocation of specific tasks to different developer groups, and how to combine these tasks once specific given jobs are completed. Fifth, METHONTOLOGY fails to suggest the methods and techniques applied in the conceptualization stage sufficiently. Introducing methods of concept extraction from multiple informal sources or methods of identifying relations may enhance the quality of ontologies. Sixth, METHONTOLOGY does not provide an evaluation process to confirm whether WebODE perfectly transforms a conceptual ontology into a formal ontology. It also does not guarantee whether the outcomes of the conceptualization stage are completely reflected in the implementation stage. Seventh, METHONTOLOGY needs to add criteria for user evaluation of the actual use of the constructed ontology under user environments. Eighth, although METHONTOLOGY allows continual knowledge acquisition while working on the ontology development process, consistent updates can be difficult for developers. Ninth, METHONTOLOGY demands that developers complete various documents during the conceptualization stage; thus, it can be considered a heavy methodology. Adopting an agile methodology will result in reinforcing active communication among developers and reducing the burden of documentation completion. Finally, this study concludes with contributions and practical implications. No previous research has addressed issues related to METHONTOLOGY from empirical experiences; this study is an initial attempt. In addition, several lessons learned from the development experience are discussed. This study also affords some insights for ontology methodology researchers who want to design a more advanced ontology development methodology.

Mitral Valve Reconstruction in Mitral Insufficiency : Intermediate-Term Results (승모판 폐쇄부전증에서 승모판 재건술의 중기평가)

  • 김석기;김경화;김공수;조중구;신동근
    • Journal of Chest Surgery
    • /
    • v.35 no.10
    • /
    • pp.705-711
    • /
    • 2002
  • The advantages of mitral valve reconstruction have been well established and so mitral valve reconstruction is now considered as the procedure of choice to correct mitral valve disease. This is the report of intermediate-term results of 38 cases that performed mitral valve reconstruction for valve insufficiency(the total number of mitral valve reconstruction were 49 cases, but 11 cases that performed mitral valve replacement due to incomplete reconstruction were excluded). Material and Method : From March 1991 to March 2001, 38 patients underwent mitral valve repair due to mitral valve regurgitation with or without stenosis. Mean age was 47.6$\pm$14.7 years(range 15 to 70 years) : 11 were men and 27 were women. The causes of mitral valve regurgitation were degenerative in 14, rheumatic in 21, infective in 2 and the other was congenital. Result : According to the Carpentier's pathologic classification of mitral valve regurgitation, 3 were type 1 , 16 were type II and 19 were type III. Surgical procedures were annuloplasty 15, commissurotomy 19, leaflet resection and annular plication 9, chordae shortening 11, chordae transfer 5, new chordae formation 2, papillary muscle splitting 2 and vegetectomy 2. These procedures were combined in most patients. There were 2 early death and the causes of death were respiratory failure, renal failure and sepsis. There was no late death. Valve replacement was done in 6 patients after repair due to valve insufficiency or stenosis 3 weeks, 1, 3, 51, 69, 84months later respectively. These patients have been followed up from 1 to 116 months(mean 43.0 months). The mean functional class(NYHA) was 2.36 pre-operatively and improved to 1.70. Conclusion : In most cases of mitral valve regurgitation, mitral valve reconstruction when technically feasible is effective operation that can achieve stable functional results and low surgical and late mortality.

An Analysis of the Locational Selection Factors of the Small- and Medium-sized Hospitals Using the AHP : Centered on the Spine and Joint Hospitals (AHP를 이용한 중·소 병원 입지선택요인 분석 : 척추·관절 병원중심으로)

  • Kim, Duck Ki;Shim, Gyo-Eon
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.5
    • /
    • pp.191-214
    • /
    • 2018
  • This research empirically analyzed the selection factors and the locational selection factors of the medical service facilities according to the gradual increase of the importance of the selection factors and the locational selection factors regarding the establishments of the small- and medium-sized hospitals according to the rapid changes of the socio-economic conditions. By analyzing the priority order according to the levels of the importance of each evaluation item factor through a research related to the selection factors and the locational selection factors of the small- and medium-sized hospitals and by drawing what the important factors that have the influences on the competitiveness of the pre-existent small- and medium-sized hospitals are through the classification of the real estate locational factors and the non-locational factors, the purpose lies in utilizing them as the basic data and materials for the opening strategies of the small- and medium-sized hospitals considering the special, locational characteristics according to the important factors of the selection factors of the small- and medium-sized hospitals, regarding the medical suppliers that have been preparing, for opening the new, small- and medium-sized hospitals. Based on the results of the preceding researches and the researches on the case examples, 28 evaluation factors were arrived at in terms of the level of the medical treatment, the medical services, the accessibilities of the hospitals, the conveniences of the hospitals, and the physical environment. And, regarding the 28 detailed evaluation factors that had been collected, through the interviews with the related experts, the 5 factors of the medical level, the medical service, the expertise of the hospital, the convenience of the hospital, and the physical environment were selected as the upper class evaluation factors. And, according to each upper class, a total of 28 low-part evaluation factors were selected. Regarding the optimal evaluation factors that were selected, the optimal locational factors were selected by carrying out an AHP questionnaire survey investigation with 200 medical experts as the subjects. Regarding the AHP analysis results, similarly with the case examples of the precedent researches, the levels of the importance appeared in the order of the medical level, the medical services, the accessibility of the hospital, the physical environment, and the convenience. And the factors that were related to the facilities of a hospital appeared low. The results of this research can be applied in providing the basis for the decision-makings regarding the selections of the locations of the small- and medium-sized hospitals in the future.

A basic research for evaluation of a Home Care Nursing Delivery System (가정간호 서비스 질 평가를 위한 도구개발연구)

  • Kim, Mo-Im;Cho, Won-Jung;Kim, Eui-Sook;Kim, Sung-Kyu;Chang, Soon-Bok;Ryu, Ho-Sihn
    • Journal of Korean Academic Society of Home Health Care Nursing
    • /
    • v.6
    • /
    • pp.33-45
    • /
    • 1999
  • The purpose of this study was to develop a basic framework and criteria for evaluation of quality care provided to patients with the attributes of disease in the home care nursing field, and to provide measurement tools for home health care in the future. The study design was a developmental study for evaluation of hospital-based HCN(home care nursing) in Korea. The study process was as follows: a home care nursing study team of College of Nursing. Yonsei University reviewed the nursing records of 47 patients who were enrolled at Yonsei University Medical Center Home Care Center in March, 1995. Twenty-five patients were insured at that time, were selected from 47 patients receiving home care service for study feasibility with six disease groups; Caesarean Section (C/S), simple nephrectomy, Liver cirrhosis(LC), chronic obstructive pulmonary disease(COPD), Lung cancer or cerebrovascular accident(CVA). In this study, the following items were selected : First step : Preliminary study 1. Criteria and items were selected on the basis of related literature on each disease area. 2. Items were identified by home care nurses. 3. A physician in charge reviewed the criteria and content of selected items. 4. Items were revised through preliminary study offered to both HCN patients and discharged patients from the home care center. Second step : Pretest 1. To verify the content of the items, a pretest was conducted with 18 patients of which there were three patients in each of the six selected disease groups. Third step : Test of reliability and validity of tools 1. Using the collected data from 25 patients with either cis, Simple nephrectomy, LC, COPD, Lung cancer, or CVA. the final items were revised through a panel discussion among experts in medical care who were researchers, doctors, or nurses. 2. Reliability and validity of the completed tool were verified with both inpatients and HCN patients in each of field for researches. The study results are as follows: 1. Standard for discharge with HCN referral The referral standard for home care, which included criteria for discharge with HCN referral and criteria leaving the hospital were established. These were developed through content analysis from the results of an open-ended questionnaire to related doctors concerning characteristic for discharge with HCN referral for each of the disease groups. The final criteria was decided by discussion among the researchers. 2. Instrument for measurement of health statusPatient health status was measured pre and post home care by direct observation and interview with an open-ended questionnaire which consisted of 61 items based on Gorden's nursing diagnosis classification. These included seven items on health knowledge and health management, eight items on nutrition and metabolism, three items on elimination, five items on activity and exercise, seven items on perception and cognition, three items on sleep and rest, three items on self-perception, three items on role and interpersonal relations, five items on sexuality and reproduction, five items on coping and stress, four items on value and religion, three items on family. and three items on facilities and environment. 3. Instrument for measurement of self-care The instrument for self-care measurement was classified with scales according to the attributes of the disease. Each scale measured understanding level and practice level by a Yes or No scale. Understanding level was measured by interview but practice level was measured by both observation and interview. Items for self-care measurement included 14 for patients with a CVA, five for women who had a cis, ten for patients with lung cancer, 12 for patients with COPD, five for patients with a simple nephrectomy, and 11 for patients with LC. 4. Record for follow-up management This included (1) OPD visit sheet, (2) ER visit form, (3) complications problem form, (4) readmission sheet. and (5) visit note for others medical centers which included visit date, reason for visit, patient name, caregivers, sex, age, time and cost required for visit, and traffic expenses, that is, there were open-end items that investigated OPD visits, emergency room visits, the problem and solution of complications, readmissions and visits to other medical institution to measure health problems and expenditures during the follow up period. 5. Instrument to measure patients satisfaction The satisfaction measurement instrument by Reisseer(1975) was referred to for the development of a tool to measure patient home care satisfaction. The instrument was an open-ended questionnaire which consisted of 11 domains; treatment, nursing care, information, time consumption, accessibility, rapidity, treatment skill, service relevance, attitude, satisfaction factors, dissatisfaction factors, overall satisfaction about nursing care, and others. In conclusion, Five evaluation instruments were developed for home care nursing. These were (1)standard for discharge with HCN referral. (2)instrument for measurement of health status, (3)instrument for measurement of self-care. (4)record for follow-up management, and (5)instrument to measure patient satisfaction. Also, the five instruments can be used to evaluate the effectiveness of the service to assure quality. Further research is needed to increase the reliability and validity of instrument through a community-based HCN evaluation.

  • PDF

Treatment and Survial Rate of Malignant Peripheral Nerve Sheath Tumors (악성 말초신경막 종양의 치료와 생존율)

  • Lee, Jong-Seok;Jeon, Dae-Geun;Cho, Wan-Hyung;Lee, Soo-Yong;Oh, Jung-Moon;Kim, Jin-Wook
    • The Journal of the Korean bone and joint tumor society
    • /
    • v.9 no.2
    • /
    • pp.131-138
    • /
    • 2003
  • Purpose: We analyzed our malignant peripheral nerve sheath tumor (MPNST) cases to find out their oncologic results following by each treatment modalities. Materials and Methods: Thirty four patients with MPNST were registered in Korea Cancer Center Hospital from Feb. 1986 to Nov. 1996. Seventeen cases were male and 17, female. Average age was 41 years (range 18 to 74). Location of the tumor was as follows; 17 in lower extremity, 11 upper extremity, 4 trunk, and 2 retroperitoneum. Following the AJC classification, stage IA were 2 cases, stage IIA 2, stage IIB 6, stage III 16 and stage IV 8. Twenty six patients took operations and adjuvant chemotherapy and/or radiation therapy, 3 operation only and 3 adjuvant chemotherapy or radiation therapy. Average follow up period was 33.5 months (5.6 to 146.1). Kaplan-Meiyer method was done for survival curve, and log rank test for comparison analysis. Results: Fourteen cases were continuous disease free, 2 no evidence of disease, 2 alive with disease and 14 dead of disease states at final follow up. Actual 5-year and 10-year survival rates were 53.5%, 35.7%. Local recurrence rate after operation was 24.1%. 5-year survival rates of stage I/II/III were 100/85.7/55.9% and 2-year survival rate of stage IV was 14.3% (p=0.04). In 21 cases operated with stage II-III, wide margin (15cases) had 76.0% 5-year survival rate, and marginal or intralesional marigin (6cases) had 40.0%. The actual 5-year survival rate of the group which were done 4 or more cycles chemotherapy (8cases) was 71.4% and the actual 3-year survival rate less than 4cycles chemotherapy (6cases) was 83.3% (p=0.96). In 19 cases operated with stage II-III and which had no radiotherapy, marginal or intralesional margin (5cases) had 3 cases of local recurrences (60.0%), though wide margin (14cases) had 4 cases recurrences (28.6%). There was no local recurrence in 8cases which had pre-or post-operative radiotherapy. Conclusions: Surgical margin is an important factor in local recurrence. Resection margin has a tendency to influence the survival despite insufficient statistical significance. Conventional chemotherapy has no defnite statistical sigficance in the effect on local control and survival. Preoperative and postoperative radiotherapy has some positive effect on local control.

  • PDF

Trend and Further Research of Rice Quality Evaluation (쌀의 품질평가 현황과 금후 연구방향)

  • Son, Jong-Rok;Kim, Jae-Hyun;Lee, Jung-Il;Youn, Young-Hwan;Kim, Jae-Kyu;Hwang, Hung-Goo;Moon, Hun-Pal
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.47
    • /
    • pp.33-54
    • /
    • 2002
  • Rice quality is much dependent on the pre-and post harvest management. There are many parameters which influence rice or cooked rice qualitys such as cultivars, climate, soil, harvest time, drying, milling, storage, safety, nutritive value, taste, marketing, eating, cooking conditions, and each nations' food culture. Thus, vice evaluation might not be carried out by only some parameters. Physicochemical evaluation of rice deals with amy-lose content, gelatinizing property, and its relation with taste. The amylose content of good vice in Korea is defined at 17 to 20%. Other parameters considered are as follows; ratio of protein body-1 per total protein amount in relation to taste, and oleic/linoleic acid ratio in relation to storage safety. The rice higher Mg/K ratio is considered as high quality. The optimum value is over 1.5 to 1.6. It was reported that the contents of oligosaccharide, glutamic acid or its derivatives and its proportionalities have high corelation with the taste of rice. Major aromatic compounds in rice have been known as hexanal, acetone, pentanal, butanal, octanal, and heptanal. Recently, it was found that muco-polysaccharides are solubilized during cooking. Cooked rice surface is coated by the muco-polysaccharide. The muco-polysaccharide aye contributing to the consistency and collecting free amino acids and vitamins. Thus, these parameters might be regarded as important items for quality and taste evaluation of rice. Ingredients of rice related with the taste are not confined to the total rice grain. In the internal kernel, starch is main component but nitrogen and mineral compounds are localized at the external kernel. The ingredients related with taste are contained in 91 to 86% part of the outside kernel. For safety that is considered an important evaluation item of rice quality, each residual tolerance limit for agricultural chemicals must be adopted in our country. During drying, rice quality can decline by the reasons of high drying temperature, overdrying, and rapid drying. These result in cracked grain or decolored kernel. Intrinsic enzymes react partially during the rice storage. Because of these enzymes, starch, lipid, or protein can be slowly degraded, resulting in the decline of appearance quality, occurrence of aging aroma, and increased hardness of cooked rice. Milling conditions concerned with quality are paddy quality, milling method, and milling machines. To produce high quality rice, head rice must contain over three fourths of the normal rice kernels, and broken, damaged, colored, and immature kernels must be eliminated. In addition to milling equipment, color sorter and length grader must be installed for the production of such rice. Head rice was examined using the 45 brand rices circulating in Korea, Japan, America, Australia, and China. It was found that the head rice rate of brand rice in our country was approximately 57.4% and 80-86% in foreign countries. In order to develop a rice quality evaluation system, evaluation of technics must be further developed : more detailed measure of qualities, search for taste-related components, creation and grade classification of quality evaluation factors at each management stage of treatment after harvest, evaluation of rice as food material as well as for rice cooking, and method development for simple evaluation and establishment of equation for palatability. On policy concerns, the following must be conducted : development of price discrimination in conformity to rice cultivar and grade under the basis of quality evaluation method, fixation of head rice branding, and introduction of low temperature circulation.

Digital Hologram Compression Technique By Hybrid Video Coding (하이브리드 비디오 코팅에 의한 디지털 홀로그램 압축기술)

  • Seo, Young-Ho;Choi, Hyun-Jun;Kang, Hoon-Jong;Lee, Seung-Hyun;Kim, Dong-Wook
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.5 s.305
    • /
    • pp.29-40
    • /
    • 2005
  • According as base of digital hologram has been magnified, discussion of compression technology is expected as a international standard which defines the compression technique of 3D image and video has been progressed in form of 3DAV which is a part of MPEG. As we can identify in case of 3DAV, the coding technique has high possibility to be formed into the hybrid type which is a merged, refined, or mixid with the various previous technique. Therefore, we wish to present the relationship between various image/video coding techniques and digital hologram In this paper, we propose an efficient coding method of digital hologram using standard compression tools for video and image. At first, we convert fringe patterns into video data using a principle of CGH(Computer Generated Hologram), and then encode it. In this research, we propose a compression algorithm is made up of various method such as pre-processing for transform, local segmentation with global information of object image, frequency transform for coding, scanning to make fringe to video stream, classification of coefficients, and hybrid video coding. Finally the proposed hybrid compression algorithm is all of these methods. The tool for still image coding is JPEG2000, and the toots for video coding include various international compression algorithm such as MPEG-2, MPEG-4, and H.264 and various lossless compression algorithm. The proposed algorithm illustrated that it have better properties for reconstruction than the previous researches on far greater compression rate above from four times to eight times as much. Therefore we expect that the proposed technique for digital hologram coding is to be a good preceding research.

High School Students' Perception on Psychological Learning EnvironmentGenerated by Science Teachers and Their Attitude Change Related to Science (과학교사에 의해 조성되는 심리적 학습 환경에 대한 고등학생들의 인식과 과학과 관련된 태도 변화)

  • Park, Ki-Sung;Kim, Dong-Jin;Park, So-Young;Park, Kwang-Seo;Jeong, Yeon-Mi;Lim, Kyoung-Ok;Park, Kuk-Tae
    • Journal of the Korean Chemical Society
    • /
    • v.53 no.5
    • /
    • pp.570-584
    • /
    • 2009
  • The purpose of this study was to find out high school students' perception on psychologicallearning environment generated by science teachers and their attitude change related to science. The subjectsconsisted of 539 freshmen in a boys' high school pre-applied of common school group in S city. This study wasconducted with students' perception survey and classification of teachers' features according to it. The surveyabout science-related attitude was also made in early 1st semester and 2nd semester, and the students showingthe great attitude change related to science were interviewed. The results of this study revealed that statistically,students had a more positive perception on female teachers than on male ones and that according to their teachers,there were clear different in the psychological learning environment perceived by students. As for the relation of teachers' features and students' attitude change, it showed the negative effect only when the teacher was incharge of only one class, but in most of the cases, there was no meaningful correlation. The semi-structuredinterview with students with great attitude change related to science indicated that the main cause of the changewas the achievement they made in class. The interview showed that the change related to science happenedunder the indirect influence of teachers rather than direct influence. Furthermore, students wanted scienceteachers to meet the science class possessing various instruction behaviors and support behaviors. Therefore,science teachers playing an important role in students' choice of career should make efforts to realize thelearner-centered curriculum and change students' science-related attitude into a positive direction.

A Clinical Analysis of Femur Neck Fracture in Elderly Patients (노년층에서 대퇴경부 골절의 치료)

  • Ihin, Joo-Choul;Ahn, Myun-Whan;Seo, Jae-Sung
    • Journal of Yeungnam Medical Science
    • /
    • v.2 no.1
    • /
    • pp.11-22
    • /
    • 1985
  • Femur neck fracture is well known as one of the major death cause after trauma in elderly patients, and unsolved fracture due to its frequent association with complications such as avascular necrosis and nonunion. Through meticulous evaluation of the patient, hip and surgeon's experiences, reduction of mortality and morbidity as well as rapid recovery of the patient to the preinjury social and ambulatory status without local complications and revision after treatment is urgently needed. Many factors about this fracture In itself were noted, but we have analyzed 18 femur neck fractures of the patients older than 50 years preliminarily according to age, fracture pattern, osteoporosis, etiology and method of treatment with its delay in association with major complications especially avascular necrosis and nonunion. The results are as follows; 1. Of these 18 fractures, 11 were in females, 8 were caused by minor trauma such as slip-down accident and 4 were associated with definite osteoporosis according to the Sing's classification. 2. Fracture pattern of these 18 are undisplaced in 4, displaced subcapital in 11, displaced transcervical in 3. 11 fractures in the patients older than 60 year are composed of 3 undisplaced or impacted fractures and 8 displaced subcapital fractures. 3. These 18 fractures were treated by closed reduction and Internal fixation with multiple pins in 13, and hemiarthroplasty in 4, but one was not treated to die after discharge from hospital. 4. 4 undisplaced or impacted fractures and 3 displaced transcervical fractures were not associated with any complications such as avascular necrosis or nonunion. But 4 of 6 displaced subcapital fractures were complicated by avascular necrosis, 3 of which were reduced in the varus position within 1 week, and the other was reduced in the good position on 1 week after trauma. There was no complication in 2 displaced subcapital fractures reduced in valgus position within 3 days after trauma. According to the above results, the prognosis of the femur neck fracture is dependent upon the fracture pattern and delay in its treatment. So it is inevitable to reduce the fracture in anatomical or valgus position as early as possible. But the arthroplasty may be needed in displaced subcapital fractures delayed for several days, with its reluction in extreme varus position or impossible and with pre-existing disease in the same hip Joint (total hip replacement).

  • PDF

Korean Word Sense Disambiguation using Dictionary and Corpus (사전과 말뭉치를 이용한 한국어 단어 중의성 해소)

  • Jeong, Hanjo;Park, Byeonghwa
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.1-13
    • /
    • 2015
  • As opinion mining in big data applications has been highlighted, a lot of research on unstructured data has made. Lots of social media on the Internet generate unstructured or semi-structured data every second and they are often made by natural or human languages we use in daily life. Many words in human languages have multiple meanings or senses. In this result, it is very difficult for computers to extract useful information from these datasets. Traditional web search engines are usually based on keyword search, resulting in incorrect search results which are far from users' intentions. Even though a lot of progress in enhancing the performance of search engines has made over the last years in order to provide users with appropriate results, there is still so much to improve it. Word sense disambiguation can play a very important role in dealing with natural language processing and is considered as one of the most difficult problems in this area. Major approaches to word sense disambiguation can be classified as knowledge-base, supervised corpus-based, and unsupervised corpus-based approaches. This paper presents a method which automatically generates a corpus for word sense disambiguation by taking advantage of examples in existing dictionaries and avoids expensive sense tagging processes. It experiments the effectiveness of the method based on Naïve Bayes Model, which is one of supervised learning algorithms, by using Korean standard unabridged dictionary and Sejong Corpus. Korean standard unabridged dictionary has approximately 57,000 sentences. Sejong Corpus has about 790,000 sentences tagged with part-of-speech and senses all together. For the experiment of this study, Korean standard unabridged dictionary and Sejong Corpus were experimented as a combination and separate entities using cross validation. Only nouns, target subjects in word sense disambiguation, were selected. 93,522 word senses among 265,655 nouns and 56,914 sentences from related proverbs and examples were additionally combined in the corpus. Sejong Corpus was easily merged with Korean standard unabridged dictionary because Sejong Corpus was tagged based on sense indices defined by Korean standard unabridged dictionary. Sense vectors were formed after the merged corpus was created. Terms used in creating sense vectors were added in the named entity dictionary of Korean morphological analyzer. By using the extended named entity dictionary, term vectors were extracted from the input sentences and then term vectors for the sentences were created. Given the extracted term vector and the sense vector model made during the pre-processing stage, the sense-tagged terms were determined by the vector space model based word sense disambiguation. In addition, this study shows the effectiveness of merged corpus from examples in Korean standard unabridged dictionary and Sejong Corpus. The experiment shows the better results in precision and recall are found with the merged corpus. This study suggests it can practically enhance the performance of internet search engines and help us to understand more accurate meaning of a sentence in natural language processing pertinent to search engines, opinion mining, and text mining. Naïve Bayes classifier used in this study represents a supervised learning algorithm and uses Bayes theorem. Naïve Bayes classifier has an assumption that all senses are independent. Even though the assumption of Naïve Bayes classifier is not realistic and ignores the correlation between attributes, Naïve Bayes classifier is widely used because of its simplicity and in practice it is known to be very effective in many applications such as text classification and medical diagnosis. However, further research need to be carried out to consider all possible combinations and/or partial combinations of all senses in a sentence. Also, the effectiveness of word sense disambiguation may be improved if rhetorical structures or morphological dependencies between words are analyzed through syntactic analysis.