Search | Korea Science

Document Clustering based on Level-wise Stop-word Removing for an Efficient Document Searching (효율적인 문서검색을 위한 레벨별 불용어 제거에 기반한 문서 클러스터링)

Joo, Kil Hong;Lee, Won Suk
- The Journal of Korean Association of Computer Education
- /
- v.11 no.3
- /
- pp.67-80
- /
- 2008
Various document categorization methods have been studied to provide a user with an effective way of browsing a large scale of documents. They do compares set of documents into groups of semantically similar documents automatically. However, the automatic categorization method suffers from low accuracy. This thesis proposes a semi-automatic document categorization method based on the domains of documents. Each documents is belongs to its initial domain. All the documents in each domain are recursively clustered in a level-wise manner, so that the category tree of the documents can be founded. To find the clusters of documents, the stop-word of each document is removed on the document frequency of a word in the domain. For each cluster, its cluster keywords are extracted based on the common keywords among the documents, and are used as the category of the domain. Recursively, each cluster is regarded as a specified domain and the same procedure is repeated until it is terminated by a user. In each level of clustering, a user can adjust any incorrectly clustered documents to improve the accuracy of the document categorization.
PDF

Phonological Characteristics of Russian Nasal Consonants (러시아어 비음의 음운적 특성)

Kim, Shin-Hyo
- Cross-Cultural Studies
- /
- v.39
- /
- pp.381-406
- /
- 2015
Russian nasal consonants / m /, / n / have a feature value not only [+consonant] in common with obstruents, but also [+sonorant] in common with vowels. Nasal / m /(bi-labial) and / n /(dental) have the same place of articulation but different manner of articulation. The feature value of / m / is [+cons, +son, +nas, +ant, -cor, -high, -low, -back, -cont, -del, rel, -strid, +voic], and that of / n / is [+cons, +son, +nas, +ant, +cor, -high, -low, -back, -cont, -del, rel, -strid, + voic]. There is a difference in feature [cor] value of / m / and / n /. In this study it is confirmed that it is a fact that the Russian nasal consonants behave differently from the other consonants in each phonological phenomenon due to their phonological characteristics. The preceding voiced obstruent is changed to an unvoiced one in a process where the last voiceless obstruent in the consonant cluster ' voiced obstruent + nasal /m/ + voiceless obstruent' skips the nasal consonant and spreads its feature value to the preceding voiced obstruent transparently because of the feature [+sonorant] of the nasal consonant. The coronal nasal /n/ participates in a palatalization with the following palatal actively and palatalize preceding plain consonants passively because of markedness hierarchy such as 'Velar > Labial > Coronal'. But the labial nasal /m/ is palatalized with the following velar palatal actively and participates in a palatalization with the following coronal palatal passively. This result helps us confirm the phonological difference of /m/ and /n/ in a palatalization. When the a final consonant is nasal, the unvoicing phenomenon of a final consonant doesn't occur. In such a case as cluster 'obstruent + nasal' the feature value [voiced] of the preceding obstruent doesn't change, but the following nasal can assimilate into the preceding obstruent. When continuing the same nasals / -nn- / in a consonant cluster, the feature value [+cont] of a weak position leads the preceding nasal / n / to be changed into [-cont] / l /. Through the analysis of the frequency of occurrences of consonants in syllabic onsets and codas that should observe the 'Sonority Sequence Principle', the sonority hierarchy of nasal consonants has been confirmed. In a diachronic perspective following nasal / m /, / n / there is a loss of the preceding labial stop and dental stop. But in clusters with the velar stop+nasal, the two-component cluster has been kept phonetically intact.

A Neural Network for Concept Learning : Recognitron (개념 학습에 의한 신경 회로망 컴퓨터)

Lee, Ki-Han;Whang, Hee-Yoong;Kim, Choon-Suk
- Proceedings of the KIEE Conference
- /
- 1989.07a
- /
- pp.495-499
- /
- 1989
Concept is the set of selected neurons in a stable state of a neurel network. The Recognitron uses a parallel feedback structure to support concept learning. A number of clusters can exist in response to a given input, each of which make up a selective neuron. There are supervised and unsupervised learnig methods in concept teaming. In this paper, we have chosen unsupervised learning. Also, a new concept called relaxational learning has been introduced to stop runaway weights
PDF

The Role of Linguistic Knowledge in the Perception of English Stops after /s/

Kim, Dae-Won
- Speech Sciences
- /
- v.3
- /
- pp.71-82
- /
- 1998
Five sets of nonsense acoustical stimuli {$[sp{\varepsilon},st{\varepsilon},sk{\varepsilon}],\;[p{\varepsilon},t{\varepsilon},k{\varepsilon}],\;[sb{\varepsilon},sd{\varepsilon},sg{\varepsilon}],\;[b{\varepsilon},d{\varepsilon},g{\varepsilon}],\;['{\varepsilon}b{\varepsilon},'{\varepsilon}d{\varepsilon},'{\varepsilon}g{\varepsilon}]$} were presented for identification of English stops to native speakers of English, Chinese, and Korean. The English speakers perceived stops after /s/ as /p, t, k/; in other contexts as /b, d, g/. In the languages where other distinctions exist, however, the evaluation was different. The results suggest that in English the cue for stops after /s/ was syllable structure constraint: After initial /s/ always /p, t, k/ follow; the cue for the initial stops was aspiration. On the basis of the results, it was concluded that in English we should classify the unaspirated voiceless stops in initial /s/-stop clusters into the phoneme where [$p^{h},t^{h},k^{h}$] are in, and that perception is not only language specific but also context specific.
PDF

An Effective Increment리 Content Clustering Method for the Large Documents in U-learning Environment (U-learning 환경의 대용량 학습문서 판리를 위한 효율적인 점진적 문서)

Joo, Kil-Hong;Choi, Jin-Tak
- Journal of the Korea Computer Industry Society
- /
- v.5 no.9
- /
- pp.859-872
- /
- 2004
With the rapid advance of computer and communication techonology, the recent trend of education environment is edveloping in the ubiquitous learning (u-learning) direction that learners select and organize the contents, time and order of learning by themselves. Since the amount of education information through the internet is increasing rapidly and it is managed in document in an effective way is necessary. The document clustering is integrated documents to subject by classifying a set of documents through their similarity among them. Accordingly, the document clustering can be used in exploring and searching a document and it can increased accuracy of search. This paper proposes an efficient incremental clustering method for a set of documents increase gradually. The incremental document clustering algorithm assigns a set of new documents to the legacy clusters which have been identified in advance. In addition, to improve the correctness of the clustering, removing the stop words can be proposed.
PDF

An Effective Incremental Text Clustering Method for the Large Document Database (대용량 문서 데이터베이스를 위한 효율적인 점진적 문서 클러스터링 기법)

Kang, Dong-Hyuk;Joo, Kil-Hong;Lee, Won-Suk
- The KIPS Transactions:PartD
- /
- v.10D no.1
- /
- pp.57-66
- /
- 2003
With the development of the internet and computer, the amount of information through the internet is increasing rapidly and it is managed in document form. For this reason, the research into the method to manage for a large amount of document in an effective way is necessary. The document clustering is integrated documents to subject by classifying a set of documents through their similarity among them. Accordingly, the document clustering can be used in exploring and searching a document and it can increased accuracy of search. This paper proposes an efficient incremental cluttering method for a set of documents increase gradually. The incremental document clustering algorithm assigns a set of new documents to the legacy clusters which have been identified in advance. In addition, to improve the correctness of the clustering, removing the stop words can be proposed and the weight of the word can be calculated by the proposed TF$\times$NIDF function.
https://doi.org/10.3745/KIPSTD.2003.10D.1.057 인용 PDF KSCI

Perception of the English Epenthetic Stops by Korean Listeners

Han, Jeong-Im
- Speech Sciences
- /
- v.11 no.1
- /
- pp.87-103
- /
- 2004
This study investigates Korean listeners' perception of the English stop epenthesis between the sonorant and fricative segments. Specifically this study investigates 1) how often English epenthetic stops are perceived by native Korean listeners, given the fact that Korean does not allow consonant clusters in codas; and 2) whether perception of the epenthetic stops, which are optional phonetic variations, not phonemes, could be improved without any explicit training. 120 English non-words with a mono-syllable structure of CVC1C2, where C1=/m, n, $\eta$, 1/, and C2=/s, $\theta$, $\int$/, were given to two groups of native Korean listeners, and they were asked to detect the target stops such as [p], [t], and [k]. The number of their responses were computed to determine how often listeners succeed in recovering the string of segments produced by the native English speaker. The results of the present study show that English epenthetic stops are poorly identified by native Korean listeners with low English proficiency, even in the case where stimuli with strong acoustic cues are provided with, but perception of epenthetic stops is closely related with listeners' English proficiency, showing the possibility of the improvement of perception. It further shows that perception of epenthetic stops shows asymmetry between coronal and non-coronal consonants.
PDF

A Study on development of Innovational Cluster for Knowledge Management in Busan (부산지역 지식경영을 위한 혁신클러스터 모델 구축에 관한 연구)

Jeong, Hyung-Il;Bang, Kwuen-Soo;Kim, Jong-Duk
- Management & Information Systems Review
- /
- v.29 no.4
- /
- pp.169-186
- /
- 2010
This study aims to reveal the ways to sharpen the edges of Korean companies through the relativity analysis between knowledge management and innovational cluster in environmental changes in resent Busan. That is, according to the knowledge management approach, the methods and directions of strengthening industrial competition were established, while the strategy of innovational clusters was suggested as a way of expanding and encouraging knowledge management. The key words of innovational cluster are in this research are the framework of Cluster theory, the importance of innovational cluster, and the change of managerial strategy paradigm. This study provide the several implication for the practice of knowledge management and the researchers. Based on these theories of knowledge management and industrial clusters, their close relationships were analyzed. As a result, industrial clusters were found to be effectively utilized to enlarge and deepen knowledge management. In addition, this suggests the efficient operation guideline of knowledge management. this study indicates both knowledge and innovational cluster should be operated and handled together in the managerial strategy. but this research has limitations in generaling the study result because it collects data from local firms only in Busan.
PDF

Mobile Sink Data Gathering through Clustering (클러스터링을 통한 모바일 싱크 데이터 수집)

Park, Jang-Su;Ahn, Byoung-Chul
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.46 no.5
- /
- pp.79-85
- /
- 2009
A sink node and its neighbor nodes spend more energy than other nodes since a stationary sink node collects data from wireless sensor networks(WSNs). For larger WSNs, the unbalanced energy of nodes causes the operation of WSNs to stop rapidly. This paper proposes a data gathering method by adapting the mobile sink to prolong the life time of large WSNs. After partitioning a network into several clusters, a mobile sink visits each cluster and collects data from it. An efficient algorithm is proposed to improve the energy efficiency by delivering the message from the mobile sink to the cluster head as well as to reduce the data gathering delay, which is the disadvantage of the mobile sink. Also, The algorithm is analyzed for the energy consumption and the data gathering delay. The validity of the ananlysis result is confirmed by the simulation.
PDF KSCI

QoS Guarantee in Partial Failure of Clustered VOD Server (클러스터 VOD 서버의 부분적 장애에서 QoS 보장)

Lee, Joa-Hyoung;Jung, In-Bum
- The KIPS Transactions:PartC
- /
- v.16C no.3
- /
- pp.363-372
- /
- 2009
For large scale VOD service, cluster servers are spotlighted to their high performance and low cost. A cluster server usually consists of a front-end node and multiple back-end nodes. Though increasing the number of back-end nodes can result in the more QoS streams for clients, the possibility of failures in back-end nodes is proportionally increased. The failure causes not only the stop of all streaming service but also the loss of the current playing positions. In this paper, when a back-end node becomes a failed state, the recovery mechanisms are studied to support the unceasing streaming service. For the actual VOD service environment, we implement a cluster-based VOD servers composed of general PCs and adopt the parallel processing for MPEG movies. From the implemented VOD server, a video block recovery mechanism is designed on parity algorithms. However, without considering the architecture of cluster-based VOD server, the application of the basic technique causes the performance bottleneck of the internal network for recovery and also results in the inefficiency CPU usage of back-end nodes. To address these problems, we propose a new failure recovery mechanism based on the pipeline computing concept.
https://doi.org/10.3745/KIPSTC.2009.16-C.3.363 인용 PDF KSCI

Search Result 21, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)