• Title/Summary/Keyword: 주석기반

Search Result 331, Processing Time 0.026 seconds

A Design of XML-Based Distributed MDR Retrieval System for Data Preparation (데이터준비를 위한 XML 기반의 분산 MDR 검색 시스템 설계)

  • Ko Sucbum;Youn Sungdae
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.9
    • /
    • pp.1329-1338
    • /
    • 2004
  • The purpose of data mining is to extract multi-dimensional information from a large database. The only information that we can extract from a large database is the column name, data type or simple comments included in the columns of database tables. With such unstructured and scarce information, it is very difficult and time taking to collect and to cleanse data by analyzing the purpose, characteristic and schema of the column during the data preparation step. In order to solve this problem, we propose solutions for reducing the time spent data preparation step in a relational database environment in this paper. That is, we propose useful elements to be considered during the data preparation step and then these elements are organized to constitute MDR(Metadata Registry) which is becoming the international standard of ISO/IEC : ll179. Finally, we propose a XML-based distributed MDR retrieval system that is convertible among heterogeneous systems and heterogeneous DBMSS.

  • PDF

Performance and Limitations of a Korean Sentiment Lexicon Built on the English SentiWordNet (영어 SentiWordNet을 이용하여 구축한 한국어 감성어휘사전의 성능 평가와 한계 연구)

  • Shin, Donghyok;Kim, Sairom;Cho, Donghee;Nguyen, Minh Dieu;Park, Soongang;Eo, Keonjoo;Nam, Jeesun
    • 한국어정보학회:학술대회논문집
    • /
    • 2016.10a
    • /
    • pp.189-194
    • /
    • 2016
  • 본 연구는 다국어 감성사전 및 감성주석 코퍼스 구축 프로젝트인 MUSE 프로젝트의 일환으로 한국어 감성사전을 구축하기 위해 대표적인 영어 감성사전인 SentiWordNet을 이용하여 한국어 감성사전을 구축하는 방법의 의의와 한계점을 검토하는 것을 목적으로 한다. 우선 영어 SentiWordNet의 117,659개의 어휘중에서 긍정/부정 0.5 스코어 이상의 어휘를 추출하여 구글 번역기를 이용해 자동 번역하는 작업을 실시하였다. 그 중에서 번역이 되지 않거나, 중복되는 경우를 제거하고, 언어학 전문가들의 수작업으로 분류해낸 결과 3,665개의 감성어휘를 획득할 수 있었다. 그러나 이마저도 병명이나 순수 감성어휘로 보기 어려운 사례들이 상당수 포함되어 있어 실제 이를 코퍼스에 적용하여 감성어휘를 자동 판별했을 때에 맛집 코퍼스에서의 재현율(recall)이 긍정과 부정에서 각각 47.4%, 37.7%, IT 코퍼스에서 각각 55.2%, 32.4%에 불과하였다. 이와 더불어 F-measure의 경우, 맛집 코퍼스에서는 긍정과 부정의 값이 각각 62.3%, 38.5%였고, IT 코퍼스에서는 각각 65.5%, 44.6%의 낮은 수치를 보여주고 있어, SentiWordNet 기반의 감성사전은 감성사전으로서의 역할을 수행하기에 충분하지 않은 것으로 나타났다. 이를 통해 한국어 감성사전을 구축할 때에는 한국어의 언어적 속성을 고려한 체계적인 접근이 필요함을 역설하고, 현재 한국어 전자사전 DECO에 기반을 두어 보완 확장중인 SELEX 감성사전에 대해 소개한다.

  • PDF

A UCP-based Model to Estimate the Software Development Cost (소프트웨어 개발 비용을 추정하기 위한 사용사례 점수 기반 모델)

  • Park, Ju-Seok;Chong, Ki-Won
    • The KIPS Transactions:PartD
    • /
    • v.11D no.1
    • /
    • pp.163-172
    • /
    • 2004
  • In the software development project applying object-oriented development methodology, the research on the UCP(Use Case Point) as a method to estimate development effort is being carried on. The existing research proposes the linear model calculating the development effort that multiplies an invariant on AUCP(Adjusted Use Case Point) which applied technical and environmental factors. However, the statistical model that estimates the development effort using AUCP and UUCP(Unadjusted Use Case Point) is not being studied. The irrelevant relationship of the linear regression model, whose development period is increasing tremendously as the software size increases, is confirmed. Moreover, during the UCP calculating process, there can be errors in FP by applying the TCF(Technical Complexity Factor) and EF(Environmental Factor). This paper presents a non-linear regression model, that does not consider the TCF and EF, and that estimate the development effort from UUCP directly by utilizing the exponential function. An exponential function is selected among the linear, logarithm, polynomial, power, and exponential model via statistical evaluations of the models mentioned above.

Software Cost Estimation Model Based on Use Case Points by using Regression Model (회귀분석을 이용한 UCP 기반 소프트웨어 개발 노력 추정 모델)

  • Park, Ju-Seok;Yang, Hea-Sool
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.8
    • /
    • pp.147-157
    • /
    • 2009
  • Recently, there has been continued research on UCP from the development effort estimation method to a software development project applying object oriented development methodology. Current research proposes a linear model estimating the developmenteffort by multiplying a constant to AUCP which applies technical and environmental factors. However, the fact that a non-linear regression model is more appropriate as the software size increases, the development period increases exponentially. In addition, in the UCP calculation process the occurrence of FP errors due to the application of TCF and EF, it is unrealistic to estimate the size with AUCP. This paper presents the issue of current research based on UCP without considering problems of the research, for example, TCF and EF and expresses the models (linear, logarithmic, polynomial, power and exponential type) estimating the development effort directly from UUCP. Consequently, the exponential model within non-linear models exhibit more accurate results than the current linear model. Therefore, after calculating the UUCP of the developing software system, using the proposed model to estimate the development effort, it is possible to estimate the direct cost required in development.

Modeling User Preference based on Bayesian Networks for Office Event Retrieval (사무실 이벤트 검색을 위한 베이지안 네트워크 기반 사용자 선호도 모델링)

  • Lim, Soo-Jung;Park, Han-Saem;Cho, Sung-Bae
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.14 no.6
    • /
    • pp.614-618
    • /
    • 2008
  • As the multimedia data increase a lot with the rapid development of the Internet, an efficient retrieval technique focusing on individual users is required based on the analyses of such data. However, user modeling services provided by recent web sites have the limitation of text-based page configurations and recommendation retrieval. In this paper, we construct the user preference model with a Bayesian network to apply the user modeling to video retrieval, and suggest a method which utilizes probability reasoning. To do this, context information is defined in a real office environment and the video scripts acquired from established cameras and annotated the context information manually are used. Personal information of the user, obtained from user input, is adopted for the evidence value of the constructed Bayesian Network, and user preference is inferred. The probability value, which is produced from the result of Bayesian Network reasoning, is used for retrieval, making the system return the retrieval result suitable for each user's preference. The usability test indicates that the satisfaction level of the selected results based on the proposed model is higher than general retrieval method.

The Evaluation of the Thick Polycrystalline HgO and PbO Films Derived by Particle Sedimentation Method for the Mammographic Application (입자침전법을 이용한 다결정 산화수은과 산화납 필름의 방사선 유방촬영 장치 적용성 평가)

  • Noh, Si-Cheol;Park, Ji-Koon;Choi, Il-Hong;Jung, Hyoung-Jin;Kang, Sang-Sik;Jung, Bong-Jae
    • Journal of the Korean Society of Radiology
    • /
    • v.8 no.7
    • /
    • pp.429-433
    • /
    • 2014
  • In this study, the morphology and the x-ray quantum efficient of mercury oxide (HgO) and lead oxide (PbO) sensors derived by particle sedimentation method were discussed. In the pursuit of this purpose, we investigated the electrical characteristics and the x-ray quantum efficiency of various thicknesses of HgO and PbO films in mammographic x-ray energy. We have therefore developed a particle-in-binder sedimentation method of fabricating large area polycrystalline films onto transparent glass substrates coated with indium tin oxide. We are currently optimizing the growth method to improve the quantum efficiency with the ultimate goal of obtaining as quantum efficiency close to that of single crystal performance. Our future efforts will concentrate on optimization of large area film growth techniques specifically for deposition on a-Si:H flat panel readout arrays.

Optimal Path Search Algorithm for Urban Applying Received Signal Strength on Satellite Communication Environment (위성통신 환경에서 전파수신감도를 활용한 도심지 최적경로탐색 알고리즘)

  • Park, No-Uk;Kim, Joo-Seok;Lim, Joo-Yoeng;Lim, Tae-Hyuk;Yoo, Chang-Hyun;Kwon, Kun-Sup;Kim, Kyung-Seok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.6
    • /
    • pp.189-197
    • /
    • 2012
  • In this paper, we propose an optimal path search algorithm that applies the received signal strength between a mobile device and a satellite. Because the common path search algorithm is only based on the shortest path search, it is difficult to provide stable multimedia services for the satellite mobile devices. The proposed algorithm provides the stable communication environment for the satellite mobile devices based on received signal strength. In Satellite communications, changes in the radio quality are severe depending on the receiving environment. Therefore, an accurate analysis of the receiving environment characteristics is very important for providing stable multimedia services of satellite communications. The causes of radio attenuation are atmosphere attenuation, vegetation attenuation and buildings attenuation. These factors were applied to analyze the received signal strength. The proposed algorithm can search the optimal path in urban for stable satellite multimedia services.

Evaluation of Structural Integrity of the ISO-based Moon Pool Type Diver Boats (ISO 기반 Moon Pool형 다이버 보트 구조 건전성 평가)

  • Kang, Byoung-mo;Oh, Woo-jun;Na, Hyun-ho;Choi, Ju-seok
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.24 no.5
    • /
    • pp.597-603
    • /
    • 2018
  • This Study investigates the Structural Integrity of Boats for Divers, given increased demands for Underwater and Recreational use. We conducted research on a Small Catamaran with a Moon Pool in the center of the Hull, using the Finite Element Method to calculate allowable stress based on the ISO Rule. We computed the coefficients defined in ISO 12215-5 and TC118.1225-7, and determined the suitability of using the ISO Standard and Allowable Stress Design method (ASD) by applying Longitudinal Bending Moment, Torsional moment, and Bottom Slamming Load. We also applied the Ultimate Strength Design Method (LFRD) using Finite Element Analysis (FEA). As a Result of this Research, it was found that ships with a Moon Pool do have Structural Integrity according to their Design in accordance with ISO and KR Regulations.

A Novel Sender-Based TCP Congestion Control for Downward Vertical Handover (하향 수직 핸드오버 상황에서 송신자에 기반을 둔 TCP 혼잡 제어 기법)

  • Choi, Yeo-Min;Song, Joo-Seok
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.6B
    • /
    • pp.430-439
    • /
    • 2008
  • In this paper, we propose a sender-based TCP congestion control scheme for downward vertical handover (DVHO), in which mobile node moves from a cellular network to a wireless LAN. DVHO can give rise to severe performance problems in TCP throughput because it causes a drastic change of link characteristics. Particularly, TCP executes falsely congestion control by packet reordering, which is occurred from link delay difference between a cellular link and a wireless LAN link. Therefore, the congestion window is reduced. And unnecessary retransmissions wastes bandwidth. To solve these problems, we propose a method using estimated round-trip time in cellular link to process duplicated ACKs from reordering. Furthermore, the duplicated ACKs are used to the control congestion window size. Simulation result shows that the proposed scheme can solve problems. Moreover, the proposed scheme can have better performance than TCP New Reno and nodupack.

Performance and Limitations of a Korean Sentiment Lexicon Built on the English SentiWordNet (영어 SentiWordNet을 이용하여 구축한 한국어 감성어휘사전의 성능 평가와 한계 연구)

  • Shin, Donghyok;Kim, Sairom;Cho, Donghee;Nguyen, Minh Dieu;Park, Soongang;Eo, Keonjoo;Nam, Jeesun
    • Annual Conference on Human and Language Technology
    • /
    • 2016.10a
    • /
    • pp.189-194
    • /
    • 2016
  • 본 연구는 다국어 감성사전 및 감성주석 코퍼스 구축 프로젝트인 MUSE 프로젝트의 일환으로 한국어 감성사전을 구축하기 위해 대표적인 영어 감성사전인 SentiWordNet을 이용하여 한국어 감성사전을 구축하는 방법의 의의와 한계점을 검토하는 것을 목적으로 한다. 우선 영어 SentiWordNet의 117,659개의 어휘중에서 긍정/부정 0.5 스코어 이상의 어휘를 추출하여 구글 번역기를 이용해 자동 번역하는 작업을 실시하였다. 그 중에서 번역이 되지 않거나, 중복되는 경우를 제거하고, 언어학 전문가들의 수작업으로 분류해 낸 결과 3,665개의 감성어휘를 획득할 수 있었다. 그러나 이마저도 병명이나 순수 감성어휘로 보기 어려운 사례들이 상당수 포함되어 있어 실제 이를 코퍼스에 적용하여 감성어휘를 자동 판별했을 때에 맛집 코퍼스에서의 재현율(recall)이 긍정과 부정에서 각각 47.4%, 37.7%, IT 코퍼스에서 각각 55.2%, 32.4%에 불과하였다. 이와 더불어 F-measure의 경우, 맛집 코퍼스에서는 긍정과 부정의 값이 각각 62.3%, 38.5%였고, IT 코퍼스에서는 각각 65.5%, 44.6%의 낮은 수치를 보여주고 있어, SentiWordNet 기반의 감성사전은 감성사전으로서의 역할을 수행하기에 충분하지 않은 것으로 나타났다. 이를 통해 한국어 감성사전을 구축할 때에는 한국어의 언어적 속성을 고려한 체계적인 접근이 필요함을 역설하고, 현재 한국어 전자사전 DECO에 기반을 두어 보완 확장중인 SELEX 감성사전에 대해 소개한다.

  • PDF