• Title/Summary/Keyword: traditional system

Search Result 6,553, Processing Time 0.037 seconds

Research Trend and Futuristic Guideline of Platform-Based Business in Korea (플랫폼 기반 비즈니스에 대한 국내 연구동향 및 미래를 위한 가이드라인)

  • Namn, Su Hyeon
    • Management & Information Systems Review
    • /
    • v.39 no.1
    • /
    • pp.93-114
    • /
    • 2020
  • Platform is considered as an alternative strategy to the traditional linear pipeline based business. Moreover, in the 4th industrial revolution period, efficiency driven pipeline business model needs to be changed to platform business. We have such success stories about platform as Apple, Google, Amazon, Uber, and so on. However, for those smaller corporations, it is not easy to find out the transformation strategy. The essence of platform business is to leverage network effect in management. Thus platform based management can be rephrased as network management across the business functions. Research on platform business is popular and related to diverse facets. But few scholars cover what the research trend of the domain is. The main purpose of this paper is to identify the research trend on platform business in Korea. To do that we first propose the analytical model for platform architecture whose components are consumers, suppliers, artifacts, and IT platform system. We conjecture that mapping of the research work on platform to the components of the model will make us understand the hidden domain of platform research. We propose three hypotheses regarding the characteristics of research and one proposition for the transitional path from pipeline to platform business model. The mapping is based on the research articles filtered from the Korea Citation Index, using keyword search. Research papers are searched through the keywords provided by authors using the word of "platform". The filtered articles are summarized in terms of the attributes such as major component of platform considered, platform type, main purpose of the research, and research method. Using the filtered data, we test the hypotheses in exploratory ways. The contribution of our research is as follows: First, based on the findings, scholars can find the areas of research on the domain: areas where research has been matured and territory where future research is actively sought. Second, the proposition provided can give business practitioners the guideline for changing their strategy from pipeline to platform oriented. This research needs to be considered as exploratory not inferential since subjective judgments are involved in data collection, classification, and interpretation of research articles.

Hepatotoxicity Reducing Effect of Ethanol Extracts from Fermented Youngia denticulata Houtt Kitamura in Ethanol-treated Rats (발효 이고들빼기(Youngia denticulata Houtt Kitamura) 에탄올 추출물이 알코올 투여로 손상시킨 흰쥐의 간독성 해소 효과)

  • Kim, Mee-Jung;Lee, Hyun-Koo
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.26 no.5
    • /
    • pp.389-399
    • /
    • 2016
  • This study was performed to investigate the hepatotoxicity reducing effect of ethanol extracts from Youngia denticulata Houtt Kitamura (YD) fermented by Aspergillus oryzae, which is well known as a Korean traditional meju microbe. Experimental rats were divided into 4 diets groups (six rats/group) (NC: normal control group, EC: ethanol supplemented control group, ENY: ethanol extracts 0.5% supplemented diets of the non-fermented YD group, EFY: ethanol extracts 0.5% supplemented diets of the fermented YD group). After feeding them for four weeks, the activities GOT and GPT in serum, contents of neutral lipid, total cholesterol, LPO and GSH, and activities of XO (O type) enzyme, SOD and GST in liver tissue were observed. Compared to the NC group, the GOT and GPT activities in the EC group were markedly increased by 87.91% and 57.02%, and the contents of neutral lipid, total cholesterol and LPO were increased by 77.51%, 18.56% and 193.86%, respectively. On the other hand, the GSH content was decreased by 29.52%. The activities of SOD and GST which are anti-oxidation-related enzymes, were decreased by 42.78% and 0.94%, respectively, compared to the NC group, but the XO (O type) enzyme activity was significantly increased by 70.83%. These results confirmed that the liver tissue and anti-oxidation system of the EC group had been damaged. Compared to the EC group, the GOT and GPT activities of the ENY and EFY groups were significantly decreased by 24.73~41.99% and 22.91~30.11%, respectively. The contents of neutral lipid, total cholesterol and LPO were decreased by 16.65~33.33%, 22.44~23.11% and 63.34~66.13%, respectively, but the GSH content was significantly increased by 24.61~35.60%. The GST activity was not significantly affected, but the SOD activity was significantly increased by 214.94~261.19%. The XO (O type) enzyme activity was significantly decreased by 40.34~51.20%. Compared to the ENY group, the GOT and GPT activities of the EFY group were significantly decreased by 22.93% and 9.34%, respectively. The plasma contents of neutral lipid, total cholesterol and LPO were decreased by 20.01%, 0.86% and 7.59%, respectively. The SOD activity was significantly increased by 21.52%, and the XO (O type) enzyme activity was decreased by 18.19%. Given decreasde TG, total cholesterol, LPO contents of liver tissue, GOT, GPT and XO activity, and increased SOD activity, these study results that ethanol extracts from fermented Youngia denticulata Houtt Kitamura may be effective in reducing hepatotoxicity.

The Characteristics and Significance of 'Nim' Texts in the Late Chason Period: Focused on Saseol-sijo and Chap-ga (조선후기 '님' 담론의 특성과 그 의미 : 사설시조와 잡가를 중심으로)

  • Shin Eun-Kyung
    • Sijohaknonchong
    • /
    • v.20
    • /
    • pp.113-139
    • /
    • 2004
  • This article intends to illuminate how the men. leading agents in Saseol-sijo - musical performers. writers of lyrics, patrons. composers. compilers of Sijo anthologies, audience. etc. - In the Late Choson period, viewed or recognized women and how their understanding of women was reflected in the texts. Working with texts with the theme of 'Love,' this article starts with categorizing two types of love: the first type, 'lovelorn heart' focusing on unilateral pining for a single lover who is absent now and the second type. 'physical love' concentrating on bilateral sexual intercourse. In addition to the types of love, the gender of poetic speakers, distinct from real poets is vital to characterize the discourse of love. According to these two factors. texts in question fall into four groups: texts that a female speaker displays her lovelorn heart('Type 1'), those where she speaks about her sexual experiences('Type 2'), those where a male speaker sings his lovelorn heart('Type 3'), and those where he describes his sexual experiences('Type 4'). Of these. 'Type 2' and 'Type 3' are key to understanding of the men's view of women. With respect to the configuration of the theme of 'Love,' it should be noted that in Korean literary history, the nim or a 'sweetheart' had signified the totality of value or a perfect entity which makes one's life meaningful and that 'Type 1,' the pattern that a female subject expresses her love toward male min, had constituted a traditional way to convey the theme of 'Love.' In terms of this connotation of min. a remarkable increase of 'Type 3' implying the increase of male speakers, reveals the extent to which women, the male speakers' min, accomplished their entry into a 'sacred area' -the position of mm-in which only men had occupied; females are focused and centralized. This article considers this phenomenon as an exhibition of the upgrade of women's significance and weight in the Late Choson society and as an index of 'modernity.' Meanwhile, given that most of the Saseol-sijo poets are men, the emergence of the 'Type 2' texts in which male poets have female speakers disclose their sexual experiences, demonstrates a representative example that women are degraded to be a means of men's pleasure; for this situation gives men more pleasure than when male speakers reveal their sexual experiences. Not only 'Type 2,' but texts group which basically belongs to 'Type I' and conveys the theme of 'Loyalty' through the female voice by substituting rulers-subjects relation for men-women relation, also falls under the same case. For men employ female voice as a poetic device in order to stress the theme of 'Loyalty' This article regards this phenomenon as an index of 'pre-modernity,' in the sense that in a pre-modem society, specifically in Early Choson, male-oriented value system dominates, thereby alienating women. As it is well known, the Late Choson is marked by a transitional period from a pre-modem society to a modem society. Therefore the ambivalence of the premodern and the modem can be found mixed in every segment of the society. The dual aspects of the masculine view of women in Saseol-sijo constitutes one example. The significance of the Saseol-sijo in Korean literary history can be found in this phenomenon.

  • PDF

A Need Assessment on Establishment of Oriental Health Promotion Center (한방건강증진센터 설립에 대한 인식 및 요구조사)

  • Lee, Hyang-Yeon;Kim, Kwuy-Bun;Cho, Kyoul-Ja;Shin, Hye-Sook;Kim, Kwang-Joo;Moon, Heui-Ja;Park, Shin-Ae;Kim, Yoon-Hee;Kang, Hyun-Sook
    • Journal of East-West Nursing Research
    • /
    • v.5 no.1
    • /
    • pp.90-101
    • /
    • 2000
  • The study attempts to examine the degree of cognition and demand on health promotion center of oriental nursing. It puts unique nursing intervention using traditional health promotion connected with oriental medicine to practical use for residents' health promotion and prevention of disease. With the study design of cross-sectional descriptive study, 516 residents who live in 26 Dongs, Dongdaemoon-gu were selected. The tool of study consists of 30 questions which the study team made for the degree of cognition and demand on health promotion center of oriental nursing. Cronbach's $\alpha$ in the degree of cognition was .8028. Collection of research data had been done from October 1 to October 30, 1999 with help of Dongdaemoon-gu office after pretest from 20 residents. Collected data were analyzed into the number and percentage in the characteristic of a subject and connected with demand on the establishment of center, the mean and the standard deviation in the degree of cognition and F-tests in the difference of the degree of cognition by characteristics. The results were as follow; 1) The characteristic of subject of this study was male 50.6%, and average age was 38.5 years old and 30-39 years old occupied the highest percentage with 31.6%. The married were 71.8%, over high school graduates was 85.6%, monthly income from 500 thousand won to 2 million won was 86.1%. 50.8% was the type living with parents, children and sibling. 2) When they were sick, the institution which residents used at first was a pharmacy(69.2%) and hospital(27.5%), but oriental medicine hospital was just 1.4%. As for subjective health condition, 82.5% answered over average, and 28.7% answered that they had chronic illness such as arthritis, chronic digestion problem, hypertension and so on. As for information collection on health, mass communication(34.9%) and medically concerned people(28.1%) occupied relatively high rate. Free health diagnosis system(36.8%) and establishment of health promotion center(31.5%) among welfare programs that residents want to enjoy were high ranked. The rate using a special institution for health was 17.8%, and among these institutions, the rate using aerobic exercises, health center(7.0%) and steamed room(5.4%) was high. Besides, other institutions such as breathing at the abdomen, finger-pressure therapy, meridian massage, foot massage, and so on were being used. 3) As the average of the degree of cognition on health promotion center of oriental medicine was 2.92, the degree of cognition was medium. The description, "health promotion center of oriental medicine is necessary for health keeping of healthy people, including people who have a problem in health" showed the highest degree of cognition(3.04, ${\pm}0.64$). 4) As for the intention on using health promotion center of oriental nursing, 61.4% said "yes", "no" was just 1.4%. The services that people relatively high wanted to be served from the center were measures reducing stress(68.0%) (relaxation therapy, meditation, breathing at the abdomen and so on), acupuncture(66.5%), finger-pressure(61.6%), moxibustion(57.6%), meridian massage(44.2%), postpartum care(40.3%) and so on. 5) As for the degree of cognition on the establishment of health promotion center of oriental nursing by characteristics of subject, there was significant difference(F=4.03, p=.046) between male(3.01) and female(2.91). But there was no significant difference by age, marital status, level of educational achievement and monthly income. As the above result, cognition on the establishment of health promotion center of oriental nursing was relatively low because people were not familiar with about the health promotion center of oriental nursing yet. However once the center will be established, the degree of demand on the center will be relatively high. So positive advertisement will be necessary, and the management of useful programs will be also required in order to make people recognize the advantage when they actually will use the center. On the other hand, as the subject of the study consists of many young people of below 30, the health problem came to be low. And in the case of sampling, the study using random sampling that can represent population will be required.

  • PDF

A Respiration Rate Measurement of Fresh Fruits and Vegetables with a Corrected Pressure Variation Method (수정된 압력변위법을 이용한 과채류 호흡속도 측정)

  • Lee, Hyun-Dong;Chung, Hun-Sik;Kang, Jun-Soo;Chung, Shin-Kyo;Choi, Jong-Uck
    • Korean Journal of Food Science and Technology
    • /
    • v.29 no.6
    • /
    • pp.1119-1124
    • /
    • 1997
  • This study was carried out for improvement and correction of the traditional pressure variation method (PVM) in the respiration rate measurements of fresh fruits and vegetables using a microcomputer system and a differential pressure sensor. Water vapor pressure in the container was calculated by equations for psychrometric calculations. At the beginning of experimental period water vapor pressure in the container was increased and maintained constantly in the most experimental period, but was decreased dramatically after $CO_2$ scrubbing. The percentages of water vapor pressure on total differential pressure were $33{\sim}46%$ at $1^{\circ}C$, $23{\sim}45%$ at $11^{circ}C$ and $35{\sim}53%$ at $21^{\circ}C$. The differences between the respiration rates determined by gas chromatography and corrected pressure variation method (CPVM) were $0.2{\sim}0.3\;mgCO_2kg^{-1}h^{-1}$ at $1^{\circ}C$, $0.2{\sim}2.9\;mgCO_2kg^{-1}h^{-1}$ at $11^{\circ}C$ and 1.0{\sim}9.0\;mgCO_2kg^{-1}h^{-1}$ at $21^{circ}C$, while those between gas chromatography and normal pressure variation method (PVM) were $0.8{\sim}1.2\;mgCO_2kg^{-1}h^{-1}$ at $1^{\circ}C$, $3.9{\sim}11.0\;mgCO_2kg^{-1}h^{-1}$ at $11^{\circ}C$ and $8.0{\sim}32.0\;mgCO_2kg^{-1}h^{-1}$ at $21^{circ}C$, respectively. The differences of the respiration rates with CPVM were smaller than those with PVM. CPVM, therefore, were more exact and convenient method than PVM in the measurement of respiration rate of fresh produce.

  • PDF

The Effect of the Context Awareness Value on the Smartphone Adopter' Advertising Attitude (스마트폰광고 이용자의 광고태도에 영향을 미치는 상황인지가치에 관한 연구)

  • Yang, Chang-Gyu;Lee, Eui-Bang;Huang, Yunchu
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.73-91
    • /
    • 2013
  • Advertising market has been facing new challenges due to dramatic change in advertising channels and the advent of innovative media such as mobile devices. Recent research related to mobile devices is mainly focused on the fact that mobile devices could identify users'physical location in real-time, and this sheds light on how location-based technology is utilized to achieve competitive advantage in advertising market. With the introduction of smartphone, the functionality of smartphone has become much more diverse and context awareness is one of the areas that require further study. This work analyses the influence of context awareness value resulted from the transformation of advertising channel in mobile communication market, and our research result reflects recent trend in advertising market environment which is not considered in previous studies. Many constructs has intensively been studied in the context of advertising channel in traditional marketing environment, and entertainment, irritation and information are considered to be the most widely accepted variables that has positive relationship with advertising value. Also, in smartphone advertisement, four main dimensions of context awareness value are recognized: identification, activity, timing and location. In this study, we assume that these four constructs has positive relationship with context awareness value. Finally, we propose that advertising value and context awareness value positively influence smartphone advertising attitude. Partial Least Squares (PLS) structural model is used in our theoretical research model to test proposed hypotheses. A well designed survey is conducted for college students in Korea, and reliability, convergent validity and discriminant validity of constructs and measurement indicators are carefully evaluated and the results show that reliability and validity are confirmed according to predefined statistical criteria. Goodness-of-fit of our research model is also supported. In summary, the results collectively suggest good measurement properties for the proposed research model. The research outcomes are as follows. First, information has positive impact on advertising value while entertainment and irritation have no significant impact. Information, entertainment and irritation together account for 38.8% of advertising value. Second, along with the change in advertising market due to the advent of smartphone, activity, timing and location have positive impact on context awareness value while identification has no significant impact. In addition, identification, activity, location and time together account for 46.3% of context awareness value. Third, advertising value and context awareness value both positively influence smartphone advertising attitude, and these two constructs explain 31.7% of the variability of smartphone advertising attitude. The theoretical implication of our research is as follows. First, the influence of entertainment and irritation is reduced which are known to be crucial factors according to previous studies related to advertising value, while the influence of information is increased. It indicates that smartphone users are not likely interested in entertaining effect of smartphone advertisement, and are insensitive to the inconvenience due to smartphone advertisement. Second, in today' ubiquitous computing environment, it is effective to provide differentiated advertising service by utilizing smartphone users'context awareness values such as identification, activity, timing and location in order to achieve competitive business advantage in advertising market. For practical implications, enterprises should provide valuable and useful information that might attract smartphone users by adopting differentiation strategy as smartphone users are sensitive to the information provided via smartphone. Also enterprises not only provide useful information but also recognize and utilize smarphone users' unique characteristics and behaviors by increasing context awareness values. In summary, our result implies that smartphone advertisement should be optimized by considering the needed information of smartphone users in order to maximize advertisement effect.

Hightechnology industrial development and formation of new industrial district : Theory and empirical cases (첨단산업발전과 신산업지구 형성 : 이론과 사례)

  • ;Park, Sam Ock
    • Journal of the Korean Geographical Society
    • /
    • v.29 no.2
    • /
    • pp.117-136
    • /
    • 1994
  • Contemporary global space economy is so dynamic that any one specific structural force can not explain the whole dynamic processes or trajectories of spatial industrial development. The major purpose of this paper is extending the traditional notion of industrial districts to functioning and development of new industrial districts with relation to the development of high technology industries. Several dynamic forces, which are dominated in new industrial districts in the modern space economy, are incorporated in the formation and dynamic aspects of new industrial districts. Even though key forces governing Marshallian industrial district are localization of small firms, division of labor between firms, constructive cooperation, and industrial atmosphere, Marshall points out a possibility of growing importance of large firms and non-local networks in the districts with changes of external environments. Some of Italian industrial districts can be regarded as Marshallian industrial districts in broader context, but the role of local authorities or institutions and local embeddedness seem to be more important in the Italian industrial districts. More critical implication form the review of Marshallian industrial districts and Italian industrial districts is that the industrial districts are not a static concept but a dynamic one: small firm based industrial districts can be regarded as only a specific feature evolved over time. Dynamic aspects of new industrial districts are resulting from coexistence of contrasting forces governing the functioning and formation of the districts in contemporary global space economy. The contrasting forces governing new industrial districts are coexistence of flexible and mass production systems, local and global networks, local and non-local embeddedness, and small and large firms. Because of these coexistence of contrasting forces, there are various types of new industrial districts. Nine types of industrial districts are identified based on local/non-local networks and intensity of networks in both suppliers and customers linkages. The different types of new industrial districts are described by differences in production systems, embeddedness, governance, cooperation and competition, and institutional factors. Out of nine types of industrial districts, four types - Marshallian; suppliers hub and spoke; customers hub and spoke; and satellite - are regarded as distinctive new industrial districts and four additional types - advanced hub and spoke types (suppliers and customers) and mature satellites (suppliers and customers) - can be evolved from the distinctive types and may be regarded as hybrid types. The last one - pioneering high technology industrial district - can be developed from the advanced hub and spoke types and this type is a most advanced modern industrial district in the era of globalization and high technology. The dynamic aspects of the districts are related with the coexistence of the contrasting forces in the contemporary global space economy. However, the development trajectory is not a natural one and not all the industrial districts can develop to the other hybrid types. Traditionally, localization of industries was developed by historical chances. In the process of high technology industrial development in contemporary global space economy, however, policy and strategies are critical for the formation and evolution of new industrial districts. It needs formation of supportive tissues of institutions for evolution of dyamic pattern of high technology related new industrial districts. Some of the original distinctive types of new industrial districts can not follow the path or trajectory suggested in this paper and may be declined without advancing, if there is no formation of supportive social structure or policy. Provision of information infrastructure and diffusion of an entrepreneurship through the positive supports of local government, public institutions, universities, trade associations and industry associations are important for the evolution of the dynamic new industrial districts. Reduction of sunk costs through the supports for training and retraining of skilled labor, the formation of flexible labor markets, and the establishment of cheap and available telecommunication networks is also regarded as a significant strategies for dynamic progress of new industrial districts in the era of high technology industrial development. In addition, development of intensive international networks in production, technology and information is important policy issue for formation and evolution of the new industrial districts which are related with high technology industrial development.

  • PDF

A Study on the Effect of Using Sentiment Lexicon in Opinion Classification (오피니언 분류의 감성사전 활용효과에 대한 연구)

  • Kim, Seungwoo;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.1
    • /
    • pp.133-148
    • /
    • 2014
  • Recently, with the advent of various information channels, the number of has continued to grow. The main cause of this phenomenon can be found in the significant increase of unstructured data, as the use of smart devices enables users to create data in the form of text, audio, images, and video. In various types of unstructured data, the user's opinion and a variety of information is clearly expressed in text data such as news, reports, papers, and various articles. Thus, active attempts have been made to create new value by analyzing these texts. The representative techniques used in text analysis are text mining and opinion mining. These share certain important characteristics; for example, they not only use text documents as input data, but also use many natural language processing techniques such as filtering and parsing. Therefore, opinion mining is usually recognized as a sub-concept of text mining, or, in many cases, the two terms are used interchangeably in the literature. Suppose that the purpose of a certain classification analysis is to predict a positive or negative opinion contained in some documents. If we focus on the classification process, the analysis can be regarded as a traditional text mining case. However, if we observe that the target of the analysis is a positive or negative opinion, the analysis can be regarded as a typical example of opinion mining. In other words, two methods (i.e., text mining and opinion mining) are available for opinion classification. Thus, in order to distinguish between the two, a precise definition of each method is needed. In this paper, we found that it is very difficult to distinguish between the two methods clearly with respect to the purpose of analysis and the type of results. We conclude that the most definitive criterion to distinguish text mining from opinion mining is whether an analysis utilizes any kind of sentiment lexicon. We first established two prediction models, one based on opinion mining and the other on text mining. Next, we compared the main processes used by the two prediction models. Finally, we compared their prediction accuracy. We then analyzed 2,000 movie reviews. The results revealed that the prediction model based on opinion mining showed higher average prediction accuracy compared to the text mining model. Moreover, in the lift chart generated by the opinion mining based model, the prediction accuracy for the documents with strong certainty was higher than that for the documents with weak certainty. Most of all, opinion mining has a meaningful advantage in that it can reduce learning time dramatically, because a sentiment lexicon generated once can be reused in a similar application domain. Additionally, the classification results can be clearly explained by using a sentiment lexicon. This study has two limitations. First, the results of the experiments cannot be generalized, mainly because the experiment is limited to a small number of movie reviews. Additionally, various parameters in the parsing and filtering steps of the text mining may have affected the accuracy of the prediction models. However, this research contributes a performance and comparison of text mining analysis and opinion mining analysis for opinion classification. In future research, a more precise evaluation of the two methods should be made through intensive experiments.

Determination of Petroleum Aromatic Hydrocarbons in Seawater Using Headspace Solid-Phase Microextraction Coupled to Gas Chromatography/Mass Spectrometry (HS-SPME-GC/MS를 이용한 해수 내 유류계 방향족탄화수소 분석법)

  • An, Joon Geon;Shim, Won Joon;Ha, Sung Yong;Yim, Un Hyuk
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.17 no.1
    • /
    • pp.27-35
    • /
    • 2014
  • The headspace solid-phase microextraction (HS-SPME) followed by gas chromatography/mass spectrometry procedure has been developed for the simultaneous determination of petroleum aromatic hydrocarbons such as benzene, toluene, ethylbenzene and xylene isomers (BTEX) and polycyclic aromatic hydrocarbons (PAHs) in seawater. The advantages of SPME compared to traditional methods of sample preparation are ease of operation, reuse of fiber, portable system, minimal contamination and loss of the sample during transport and storage. SPME fiber, extraction time, temperature, stirring speed, and GC desorption time were key extraction parameters considered in this study. Among three kinds of SPME fibers, i.e., PDMS ($100{\mu}m$), CAR/PDMS ($75{\mu}m$), and PDMS/DVB ($65{\mu}m$), a $65{\mu}m$ PDMS/DVB fiber showed the most optimal extraction efficiencies covering molecular weight ranging from 78 to 202. Other extraction parameters were set up using $65{\mu}m$ PDMS/DVB. The final optimized extraction conditions were extraction time (60 min), extraction temperature (50), stirring speed (750 rpm) and GC desorption time (3 min). When applied to artificially contaminated seawater like water accommodated fraction, our optimized HS-SPME-GC/MS showed comparable performances with other conventional method. The proposed protocol can be an attractive alternative to analysis of BTEX and PAHs in seawater.

Korean Word Sense Disambiguation using Dictionary and Corpus (사전과 말뭉치를 이용한 한국어 단어 중의성 해소)

  • Jeong, Hanjo;Park, Byeonghwa
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.1-13
    • /
    • 2015
  • As opinion mining in big data applications has been highlighted, a lot of research on unstructured data has made. Lots of social media on the Internet generate unstructured or semi-structured data every second and they are often made by natural or human languages we use in daily life. Many words in human languages have multiple meanings or senses. In this result, it is very difficult for computers to extract useful information from these datasets. Traditional web search engines are usually based on keyword search, resulting in incorrect search results which are far from users' intentions. Even though a lot of progress in enhancing the performance of search engines has made over the last years in order to provide users with appropriate results, there is still so much to improve it. Word sense disambiguation can play a very important role in dealing with natural language processing and is considered as one of the most difficult problems in this area. Major approaches to word sense disambiguation can be classified as knowledge-base, supervised corpus-based, and unsupervised corpus-based approaches. This paper presents a method which automatically generates a corpus for word sense disambiguation by taking advantage of examples in existing dictionaries and avoids expensive sense tagging processes. It experiments the effectiveness of the method based on Naïve Bayes Model, which is one of supervised learning algorithms, by using Korean standard unabridged dictionary and Sejong Corpus. Korean standard unabridged dictionary has approximately 57,000 sentences. Sejong Corpus has about 790,000 sentences tagged with part-of-speech and senses all together. For the experiment of this study, Korean standard unabridged dictionary and Sejong Corpus were experimented as a combination and separate entities using cross validation. Only nouns, target subjects in word sense disambiguation, were selected. 93,522 word senses among 265,655 nouns and 56,914 sentences from related proverbs and examples were additionally combined in the corpus. Sejong Corpus was easily merged with Korean standard unabridged dictionary because Sejong Corpus was tagged based on sense indices defined by Korean standard unabridged dictionary. Sense vectors were formed after the merged corpus was created. Terms used in creating sense vectors were added in the named entity dictionary of Korean morphological analyzer. By using the extended named entity dictionary, term vectors were extracted from the input sentences and then term vectors for the sentences were created. Given the extracted term vector and the sense vector model made during the pre-processing stage, the sense-tagged terms were determined by the vector space model based word sense disambiguation. In addition, this study shows the effectiveness of merged corpus from examples in Korean standard unabridged dictionary and Sejong Corpus. The experiment shows the better results in precision and recall are found with the merged corpus. This study suggests it can practically enhance the performance of internet search engines and help us to understand more accurate meaning of a sentence in natural language processing pertinent to search engines, opinion mining, and text mining. Naïve Bayes classifier used in this study represents a supervised learning algorithm and uses Bayes theorem. Naïve Bayes classifier has an assumption that all senses are independent. Even though the assumption of Naïve Bayes classifier is not realistic and ignores the correlation between attributes, Naïve Bayes classifier is widely used because of its simplicity and in practice it is known to be very effective in many applications such as text classification and medical diagnosis. However, further research need to be carried out to consider all possible combinations and/or partial combinations of all senses in a sentence. Also, the effectiveness of word sense disambiguation may be improved if rhetorical structures or morphological dependencies between words are analyzed through syntactic analysis.