Search | Korea Science

Improving Hypertext Classification Systems through WordNet-based Feature Abstraction (워드넷 기반 특징 추상화를 통한 웹문서 자동분류시스템의 성능향상)

Roh, Jun-Ho;Kim, Han-Joon;Chang, Jae-Young
- The Journal of Society for e-Business Studies
- /
- v.18 no.2
- /
- pp.95-110
- /
- 2013
This paper presents a novel feature engineering technique that can improve the conventional machine learning-based text classification systems. The proposed method extends the initial set of features by using hyperlink relationships in order to effectively categorize hypertext web documents. Web documents are connected to each other through hyperlinks, and in many cases hyperlinks exist among highly related documents. Such hyperlink relationships can be used to enhance the quality of features which consist of classification models. The basic idea of the proposed method is to generate a sort of ed concept feature which consists of a few raw feature words; for this, the method computes the semantic similarity between a target document and its neighbor documents by utilizing hierarchical relationships in the WordNet ontology. In developing classification models, the ed concept features are equated with other raw features, and they can play a great role in developing more accurate classification models. Through the extensive experiments with the Web-KB test collection, we prove that the proposed methods outperform the conventional ones.
https://doi.org/10.7838/jsebs.2013.18.2.095 인용 PDF KSCI

Robust Control of a 6-Link Electro-Hydraulic Manipulator using Parallel Feed forward Compensator (PFC보상기를 응용한 6축 전기 유압매니퓰레이터의 강인 제어)

안경관;정연오
- Journal of the Korean Society for Precision Engineering
- /
- v.20 no.3
- /
- pp.89-96
- /
- 2003
An electro-hydraulic manipulator using hydraulic actuators has many nonlinear abetments, and its parameter fluctuations are greater than those of an electrically driven manipulator. So it is relatively difficult to realize not only stable but also accurate trajectory control for the autonomous assembly tasks using hydraulic manipulators. In this report, we propose a two-degree-of-freedom control including parallel feedforward compensator (PFC) where PFC plays a very important role in the stability of a proposed control system. In the experimental results of the 6-link electro hydraulic manipulator, it is verified that the stability and the model matching performance are improved by using the proposed control method.
PDF KSCI

News Big Data Analysis of Media Companies related to Lifelong Education for the Disabled (장애인 평생교육 관련 언론사 뉴스 빅데이터 분석)

Kwon, Choong-Hoon
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2022.01a
- /
- pp.183-184
- /
- 2022
본 연구는 장애인 평생교육 관련 언론사 뉴스 빅데이터를 한국언론재단의 빅카인즈(BIGKinds) 시스템을 이용하여 분석하였다. 본 연구에서는 2000년 1월 1일부터 2020년 12월 31일까지 20년간, 총 54개 언론사에서 보도한 '장애인 평생교육' 관련 뉴스 기사들을 추출하였다. 그 분석대상 뉴스 빅데이터를 대상으로 키워드 트렌드 분석, 언어 네트워크 지도 구현, 연관어 분석(워드클라우드 제시) 등을 진행하였다. 본 연구 결과는 장애인 평생교육 관련 정책 입안 연구 및 실증적인 연구(평생교육 참여 요인 및 효과 등)의 기초자료로 활용될 수 있을 것으로 기대된다.
PDF

Personalized Web Search using Query based User Profile (질의기반 사용자 프로파일을 이용하는 개인화 웹 검색)

Yoon, Sung Hee
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.17 no.2
- /
- pp.690-696
- /
- 2016
Search engines that rely on morphological matching of user query and web document content do not support individual interests. This research proposes a personalized web search scheme that returns the results that reflect the users' query intent and personal preferences. The performance of the personalized search depends on using an effective user profiling strategy to accurately capture the users' personal interests. In this study, the user profiles are the databases of topic words and customized weights based on the recent user queries and the frequency of topic words in click history. To determine the precise meaning of ambiguous queries and topic words, this strategy uses WordNet to calculate the semantic relatedness to words in the user profile. The experiments were conducted by installing a query expansion and re-ranking modules on the general web search systems. The results showed that this method has 92% precision and 82% recall in the top 10 search results, proving the enhanced performance.
https://doi.org/10.5762/KAIS.2016.17.2.690 인용 PDF KSCI

An Intelligent Marking System based on Semantic Kernel and Korean WordNet (의미커널과 한글 워드넷에 기반한 지능형 채점 시스템)

Cho Woojin;Oh Jungseok;Lee Jaeyoung;Kim Yu-Seop
- The KIPS Transactions:PartA
- /
- v.12A no.6 s.96
- /
- pp.539-546
- /
- 2005
Recently, as the number of Internet users are growing explosively, e-learning has been applied spread, as well as remote evaluation of intellectual capacity However, only the multiple choice and/or the objective tests have been applied to the e-learning, because of difficulty of natural language processing. For the intelligent marking of short-essay typed answer papers with rapidness and fairness, this work utilize heterogenous linguistic knowledges. Firstly, we construct the semantic kernel from un tagged corpus. Then the answer papers of students and instructors are transformed into the vector form. Finally, we evaluate the similarity between the papers by using the semantic kernel and decide whether the answer paper is correct or not, based on the similarity values. For the construction of the semantic kernel, we used latent semantic analysis based on the vector space model. Further we try to reduce the problem of information shortage, by integrating Korean Word Net. For the construction of the semantic kernel we collected 38,727 newspaper articles and extracted 75,175 indexed terms. In the experiment, about 0.894 correlation coefficient value, between the marking results from this system and the human instructors, was acquired.
https://doi.org/10.3745/KIPSTA.2005.12A.6.539 인용 PDF KSCI

Vector Analysis on the Quick Torque Control of Induction Motors (유도전동기의 토크 속응제어법에 관한 벡터적해석)

Jeong, Seok-Kwon;Yang, Joo-Ho
- Journal of the Korean Society of Fisheries and Ocean Technology
- /
- v.31 no.4
- /
- pp.393-401
- /
- 1995
In this paper, vector analysis on the novel quick torque control of Induction Motors(I.M) based on voltage-controlled type is conducted. It was very difficult to get a step response of torque when the primary voltage was selected as control input of induction motors in conventional quick torque control methods. To solve this problem, the new control method was developed using a new concept of pulse addition which can realize the stepwise torque response of a specified settling time of $\Delta$. The new method was successfully confirmed through DSP(Digital Signal Processor) system-based experiments. However, it was a little difficult to understand the control mechanism intutionally. The purpose of this paper is to provide more understanding about the quick torque control mechanism using the vector analysis.
PDF

A Low Density Parity Check Coding using the Weighted Bit-flipping Method (가중치가 부과된 Bit-flipping 기법을 이용한 LDPC 코딩)

Joh, Kyung-Hyun;Ra, Keuk-Hwan
- 전자공학회논문지 IE
- /
- v.43 no.4
- /
- pp.115-121
- /
- 2006
In this paper, we proposed about data error check and correction on channel transmission in the communication system. LDPC codes are used for minimizing channel errors by modeling AWGN Channel as a VDSL system. Because LDPC Codes use low density parity bit, mathematical complexity is low and relating processing time becomes shorten. Also the performance of LDPC code is better than that of turbo code in long code word on iterative decoding algorithm. This algorithm is better than conventional algorithms to correct errors, the proposed algorithm assigns weights for errors concerning parity bits. The proposed weighted Bit-flipping algorithm is better than the conventional Bit-flipping algorithm and we are recognized improve gain rate of 1 dB.
PDF KSCI

A Model of Natural Language Information Retrieval Using Main Keywords and Sub-keywords (주 키워드와 부 키워드를 이용한 자연언어 정보 검색 모델)

Kang, Hyun-Kyu;Park, Se-Young
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.12
- /
- pp.3052-3062
- /
- 1997
An Information Retrieval (IR) is to retrieve relevant information that satisfies user's information needs. However a major role of IR systems is not just the generation of sets of relevant documents, but to help determine which documents are most likely to be relevant to the given requirements. Various attempts have been made in the recent past to use syntactic analysis methods for the generation of complex construction that are essential for content identification in various automatic text analysis systems. Unfortunately, it is known that methods based on syntactic understanding alone are not sufficiently powerful to Produce complete analyses of arbitrary text samples. In this paper, we present a document ranking method based on two-level ranking. The first level is used to retrieve the documents, and the second level to reorder the retrieved documents. The main keywords used in the first level can be defined as nouns and/or compound nouns that possess good document discrimination powers. The sub-keywords used in the second level can be also defined as adjectives, adverbs, and/or verbs that are not main keywords, and function words. An empirical study was conducted from a Korean encyclopedia with 23,113 entries and 161 Korean natural language queries collected by end users. 850% of the natural language queries contained sub-keywords. The two-level document ranking methods provides significant improvement in retrieval effectiveness over traditional ranking methods.
PDF

WellnessWordNet: A Word Net for Unconstrained Subjective Well-Being Monitor ing Based on Unstructured Data and Contextual Polarity (웰니스워드넷: 비정형데이터와 상황적 긍부정성에 기반하여 주관적 웰빙 상태를 무구속적으로 모니터링하기 위한 워드넷 개발)

Song, Yeongeun;Nam, Suhyun;Kwon, Ohbyung
- Journal of Intelligence and Information Systems
- /
- v.22 no.3
- /
- pp.1-21
- /
- 2016
IT-based subjective well-being (SWB) services, a main part of wellness IT, should measure the SWB state of individuals in an unrestrained, cost-effective manner. The dictionaries for sentiment analysis available in the market may be useful for this purpose, but obtaining proper sentiment values using only words from the sentiment lexicon is impossible; therefore, a new dictionary including wellness vocabulary is needed. The existing sentiment dictionaries link only a single sentiment value to a single sentiment word, although sentiment values may vary depending on personal traits. In this study, we develop an extended version of the SenticNet sentiment dictionary dubbed WellnessWordNet. SenticNet is considered the best and most expressive among the already existing sentiment dictionaries. Using the information provided by SenticNet, we created a database including the wellness states (estimated values) of stress, depression, and anger to develop the WellnessWordNet system. The accuracy of the system was validated through actual tests with live subjects. This study is unique and unprecedented in that i) an extended sentiment dictionary, WellnessWordNet, is developed; ii) values for wellness state language are offered; and iii) different sentiment values, namely contextual polarity, for people of the same gender or age group are suggested.
https://doi.org/10.13088/jiis.2016.22.3.001 인용 PDF KSCI

Color-related Query Processing for Intelligent E-Commerce Search (지능형 검색엔진을 위한 색상 질의 처리 방안)

Hong, Jung A;Koo, Kyo Jung;Cha, Ji Won;Seo, Ah Jeong;Yeo, Un Yeong;Kim, Jong Woo
- Journal of Intelligence and Information Systems
- /
- v.25 no.1
- /
- pp.109-125
- /
- 2019
As interest on intelligent search engines increases, various studies have been conducted to extract and utilize the features related to products intelligencely. In particular, when users search for goods in e-commerce search engines, the 'color' of a product is an important feature that describes the product. Therefore, it is necessary to deal with the synonyms of color terms in order to produce accurate results to user's color-related queries. Previous studies have suggested dictionary-based approach to process synonyms for color features. However, the dictionary-based approach has a limitation that it cannot handle unregistered color-related terms in user queries. In order to overcome the limitation of the conventional methods, this research proposes a model which extracts RGB values from an internet search engine in real time, and outputs similar color names based on designated color information. At first, a color term dictionary was constructed which includes color names and R, G, B values of each color from Korean color standard digital palette program and the Wikipedia color list for the basic color search. The dictionary has been made more robust by adding 138 color names converted from English color names to foreign words in Korean, and with corresponding RGB values. Therefore, the fininal color dictionary includes a total of 671 color names and corresponding RGB values. The method proposed in this research starts by searching for a specific color which a user searched for. Then, the presence of the searched color in the built-in color dictionary is checked. If there exists the color in the dictionary, the RGB values of the color in the dictioanry are used as reference values of the retrieved color. If the searched color does not exist in the dictionary, the top-5 Google image search results of the searched color are crawled and average RGB values are extracted in certain middle area of each image. To extract the RGB values in images, a variety of different ways was attempted since there are limits to simply obtain the average of the RGB values of the center area of images. As a result, clustering RGB values in image's certain area and making average value of the cluster with the highest density as the reference values showed the best performance. Based on the reference RGB values of the searched color, the RGB values of all the colors in the color dictionary constructed aforetime are compared. Then a color list is created with colors within the range of ${\pm}50$ for each R value, G value, and B value. Finally, using the Euclidean distance between the above results and the reference RGB values of the searched color, the color with the highest similarity from up to five colors becomes the final outcome. In order to evaluate the usefulness of the proposed method, we performed an experiment. In the experiment, 300 color names and corresponding color RGB values by the questionnaires were obtained. They are used to compare the RGB values obtained from four different methods including the proposed method. The average euclidean distance of CIE-Lab using our method was about 13.85, which showed a relatively low distance compared to 3088 for the case using synonym dictionary only and 30.38 for the case using the dictionary with Korean synonym website WordNet. The case which didn't use clustering method of the proposed method showed 13.88 of average euclidean distance, which implies the DBSCAN clustering of the proposed method can reduce the Euclidean distance. This research suggests a new color synonym processing method based on RGB values that combines the dictionary method with the real time synonym processing method for new color names. This method enables to get rid of the limit of the dictionary-based approach which is a conventional synonym processing method. This research can contribute to improve the intelligence of e-commerce search systems especially on the color searching feature.
https://doi.org/10.13088/jiis.2019.25.1.109 인용 PDF KSCI HTML

Search Result 44, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)