Search | Korea Science

Automatic Text Categorization Using Passage-based Weight Function and Passage Type (문단 단위 가중치 함수와 문단 타입을 이용한 문서 범주화)

Joo, Won-Kyun;Kim, Jin-Suk;Choi, Ki-Seok
- The KIPS Transactions:PartB
- /
- v.12B no.6 s.102
- /
- pp.703-714
- /
- 2005
Researches in text categorization have been confined to whole-document-level classification, probably due to lacks of full-text test collections. However, full-length documents availably today in large quantities pose renewed interests in text classification. A document is usually written in an organized structure to present its main topic(s). This structure can be expressed as a sequence of sub-topic text blocks, or passages. In order to reflect the sub-topic structure of a document, we propose a new passage-level or passage-based text categorization model, which segments a test document into several Passages, assigns categories to each passage, and merges passage categories to document categories. Compared with traditional document-level categorization, two additional steps, passage splitting and category merging, are required in this model. By using four subsets of Routers text categorization test collection and a full-text test collection of which documents are varying from tens of kilobytes to hundreds, we evaluated the proposed model, especially the effectiveness of various passage types and the importance of passage location in category merging. Our results show simple windows are best for all test collections tested in these experiments. We also found that passages have different degrees of contribution to main topic(s), depending on their location in the test document.
https://doi.org/10.3745/KIPSTB.2005.12B.6.703 인용 PDF KSCI

Designing Directory Structure for a SAN-Based Shared File System (SAN 기반 공유 파일 시스템을 위한 디렉토리 구조 설계)

김신우;이용규;김경배;신범주
- Proceedings of the Korea Multimedia Society Conference
- /
- 2001.11a
- /
- pp.503-507
- /
- 2001
최근 개발되고 있는 SAN 기반 리눅스 클러스터 파일 시스템들은 중앙에 파일 서버 없이 디스크를 공유하는 클라이언트들이 화이버 채널을 통하여 마치 파일 서버처럼 디스크에 자유롭게 접근할 수 있으므로, 유용성, 부하의 균형, 확장성 등에서 장점을 가진다. 본 논문에서는 ETRI에서 개발중인 SAN 기반 리눅스 클러스터 파일 시스템인 SANtopia를 위해 설계된 새로운 mode의 구조와 이 inode의 구조를 기반으로 확장 해싱(Extendible Hashing)을 이용한 새로운 디렉토리 구조의 설계에 대하여 기술하고,성능 평가를 통하여 제안된 방법의 우수성을 보인다.
PDF

Performance Measures of Marketing System in Quality of Life Study

안승호
- Proceedings of the Korean DIstribution Association Conference
- /
- 1997.07a
- /
- pp.81-100
- /
- 1997
현재 국내의 유통업계 전반에 걸친 대변혁은 유통업에 참여하고 기업뿐만 아니라 가치전달기능의 최종 목적지인 소비자, 유통정책을 수립하고 추진하는 정부, 유통체제의 일원인 제조업체 등 관련 이익집단에 유통 기능을 수행하는 마케팅에 대한 새로운 이해의 필요성을 강조하고 있다. 여기서 말하는 새로운 이채란 각 기업과 연관하여 파악된 마케팅 활동의 범주를 넘어 다수의 마케팅 연관 기관의 유기적 교류로 이루어진 마케팅 시스템과 사회의 연관 관계에 대한 이해로 정의 할 수 있을 것이다.(중략)
PDF

A Search-Result Clustering Method based on Word Clustering for Effective Browsing of the Paper Retrieval Results (논문 검색 결과의 효과적인 브라우징을 위한 단어 군집화 기반의 결과 내 군집화 기법)

Bae, Kyoung-Man;Hwang, Jae-Won;Ko, Young-Joong;Kim, Jong-Hoon
- Journal of KIISE:Software and Applications
- /
- v.37 no.3
- /
- pp.214-221
- /
- 2010
The search-results clustering problem is defined as the automatic and on-line grouping of similar documents in search results returned from a search engine. In this paper, we propose a new search-results clustering algorithm specialized for a paper search service. Our system consists of two algorithmic phases: Category Hierarchy Generation System (CHGS) and Paper Clustering System (PCS). In CHGS, we first build up the category hierarchy, called the Field Thesaurus, for each research field using an existing research category hierarchy (KOSEF's research category hierarchy) and the keyword expansion of the field thesaurus by a word clustering method using the K-means algorithm. Then, in PCS, the proposed algorithm determines the category of each paper using top-down and bottom-up methods. The proposed system can be used in the application areas for retrieval services in a specialized field such as a paper search service.
PDF KSCI

A Big Data Analysis by Between-Cluster Information using k-Modes Clustering Algorithm (k-Modes 분할 알고리즘에 의한 군집의 상관정보 기반 빅데이터 분석)

Park, In-Kyoo
- Journal of Digital Convergence
- /
- v.13 no.11
- /
- pp.157-164
- /
- 2015
This paper describes subspace clustering of categorical data for convergence and integration. Because categorical data are not designed for dealing only with numerical data, The conventional evaluation measures are more likely to have the limitations due to the absence of ordering and high dimensional data and scarcity of frequency. Hence, conditional entropy measure is proposed to evaluate close approximation of cohesion among attributes within each cluster. We propose a new objective function that is used to reflect the optimistic clustering so that the within-cluster dispersion is minimized and the between-cluster separation is enhanced. We performed experiments on five real-world datasets, comparing the performance of our algorithms with four algorithms, using three evaluation metrics: accuracy, f-measure and adjusted Rand index. According to the experiments, the proposed algorithm outperforms the algorithms that were considered int the evaluation, regarding the considered metrics.
https://doi.org/10.14400/JDC.2015.13.11.157 인용 PDF KSCI

Hierarchical Automatic Classification of News Articles based on Association Rules (연관규칙을 이용한 뉴스기사의 계층적 자동분류기법)

Joo, Kil-Hong;Shin, Eun-Young;Lee, Joo-Il;Lee, Won-Suk
- Journal of Korea Multimedia Society
- /
- v.14 no.6
- /
- pp.730-741
- /
- 2011
With the development of the internet and computer technology, the amount of information through the internet is increasing rapidly and it is managed in document form. For this reason, the research into the method to manage for a large amount of document in an effective way is necessary. The conventional document categorization method used only the keywords of related documents for document classification. However, this paper proposed keyword extraction method of based on association rule. This method extracts a set of related keywords which are involved in document's category and classifies representative keyword by using the classification rule proposed in this paper. In addition, this paper proposed the preprocessing method for efficient keywords creation and predicted the new document's category. We can design the classifier and measure the performance throughout the experiment to increase the profile's classification performance. When predicting the category, substituting all the classification rules one by one is the major reason to decrease the process performance in a profile. Finally, this paper suggested automatically categorizing plan which can be applied to hierarchical category architecture, extended from simple category architecture.
https://doi.org/10.9717/kmms.2011.14.6.730 인용 PDF KSCI

Latent causal inference using the propensity score from latent class regression model (잠재범주회귀모형의 성향점수를 이용한 잠재변수의 원인적 영향력 추론 연구)

Lee, Misol;Chung, Hwan
- The Korean Journal of Applied Statistics
- /
- v.30 no.5
- /
- pp.615-632
- /
- 2017
Unlike randomized trial, statistical strategies for inferring the unbiased causal relationship are required in the observational studies. The matching with the propensity score is one of the most popular methods to control the confounders in order to evaluate the effect of the treatment on the outcome variable. Recently, new methods for the causal inference in latent class analysis (LCA) have been proposed to estimate the average causal effect (ACE) of the treatment on the latent discrete variable. They have focused on the application study for the real dataset to estimate the ACE in LCA. In practice, however, the true values of the ACE are not known, and it is difficult to evaluate the performance of the estimated the ACE. In this study, we propose a method to generate a synthetic data using the propensity score in the framework of LCA, where treatment and outcome variables are latent. We then propose a new method for estimating the ACE in LCA and evaluate its performance via simulation studies. Furthermore we present an empirical analysis based on data form the 'National Longitudinal Study of Adolescents Health,' where puberty as a latent treatment and substance use as a latent outcome variable.
https://doi.org/10.5351/KJAS.2017.30.5.615 인용 PDF KSCI

Dynamic Classification of Categories in Web Search Environment (웹 검색 환경에서 범주의 동적인 분류)

Choi Bum-Ghi;Lee Ju-Hong;Park Sun
- Journal of KIISE:Software and Applications
- /
- v.33 no.7
- /
- pp.646-654
- /
- 2006
Directory searching and index searching methods are two main methods in web search engines. Both of the methods are applied to most of the well-known Internet search engines, which enable users to choose the other method if they are not satisfied with results shown by one method. That is, Index searching tends to come up with too many search results, while directory searching has a difficulty in selecting proper categories, frequently mislead to false ones. In this paper, we propose a novel method in which a category hierarchy is dynamically constructed. To do this, a category is regarded as a fuzzy set which includes keywords. Similarly extensible subcategories of a category can be found using fuzzy relational products. The merit of this method is to enhance the recall rate of directory search by expanding subcategories on the basis of similarity.
PDF KSCI

Are We Really Open to Creativity?: Elementary Gifted Students' Perceptions on Anti-Creativity Bias (우리는 정말 새로운 것에 열려 있는가?: 초등영재들이 인식하는 반창의성 편향)

Lee, Taehee;Han, Ki-Soon
- Journal of Gifted/Talented Education
- /
- v.25 no.2
- /
- pp.321-337
- /
- 2015
The purpose of the present study is to examine elementary gifted students' perceptions on bias against creativity utilizing concept mapping approach. Twelve elementary gifted students participated in the group brainstorming and produced 55 final statements. Based on these statements, the multi-dimensional scale and hierarchial cluster analysis using dissimilarity matrix were performed. Average stress value was .30 which is appropriate for a two-dimensional concept mapping study. In addition, a questionnaire survey using likert 6 points scale was carried out targeting 132 elementary gifted students to analyze the degree of sympathy on their anti-creativity bias perception. The findings are as follow: First, four categories were concluded dividing gifted students' perceptions on bias against creativity from the hierarchial cluster analysis with X-Y coordinate matrix, these were 'Contradictory attitudes to creativity', 'Low evaluation for creativity', 'Forced to predetermined rules and ideas', and 'Aversion to new things'. Second, elementary gifted students were sympathetic to the order 'Forced to predetermined rules and ideas'(M=4.16), 'Aversion to new things'(M=3.68), 'Contradictory attitudes to creativity'(M=3.55) and 'Low evaluation for creativity'(M=3.30). This study aims to examine, analyze and categorize various relevant factors related to elementary gifted students' perceptions on bias against creativity. Implications of the study related to the present and future creative education were discussed in depth.
https://doi.org/10.9722/JGTE.2015.25.2.321 인용 PDF KSCI

산업용 로보트의 제어에 관하여

변증남
- Journal of the KSME
- /
- v.20 no.2
- /
- pp.123-130
- /
- 1980
산업용로보트는 보통의 자동 기계와는 달리 상대하는 대상물과의 위치적 관계가 중요하여, 시 시각각으로 변화하는 대상물의 위치정보를 로보트 자체의 구동에 반영하여야 한다. 따라서 종 래의 써보 메카니즘(Servomechanism)의 제어 기술에서 보다 고려되어야할 사항이 많으며 제어 방법자체도 기능 제어(Intelligent Control) 분야라는 새로운 범주에 속하여 활발한 연구가 되고 있다. 본문에서는 이러한 산업용로보트의 제어기술에 관하여 기술하기로 한다.
PDF

Search Result 395, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)