Search | Korea Science

Cho, Shi-Won;Lee, Dong-Wook
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.58 no.3
- /
- pp.632-638
- /
- 2009
Probabilistic Latent Semantic Analysis has many applications in information retrieval and filtering, natural language processing, machine learning from text, and in related areas. In this paper, we propose an algorithm using weighted Probabilistic Latent Semantic Analysis Model to find the contextual phrases and opinions from documents. The traditional keyword search is unable to find the semantic relations of phrases, Overcoming these obstacles requires the development of techniques for automatically classifying semantic relations of phrases. Through experiments, we show that the proposed algorithm works well to discover semantic relations of phrases and presents the semantic relations of phrases to the vector-space model. The proposed algorithm is able to perform a variety of analyses, including such as document classification, online reputation, and collaborative recommendation.
PDF KSCI

Choe, Yong-Seok;Sin, Ji-Ae;Choe, Gi-Seon;Kim, Gi-Tae;Lee, Sang-Tae
- Proceedings of the Korean Society for Emotion and Sensibility Conference
- /
- 2008.10a
- /
- pp.90-93
- /
- 2008
In natural language, it is common that repetitive constituents in an expression are to be left out and it is necessary to figure out the constituents omitted at analyzing the meaning of the sentence. This paper is on recognition of boundaries of parallel noun phrases by figuring out constituents omitted. Recognition of parallel noun phrases can greatly reduce complexity at the phase of sentence parsing. Moreover, in natural language information retrieval, recognition of noun with modifiers can play an important role in making indexes. We propose an unsupervised probabilistic model that identifies parallel cores as well as boundaries of parallel noun phrases conjoined by a conjunctive particle. It is based on the idea of swapping constituents, utilizing symmetry (two or more identical constituents are repeated) and reversibility (the order of constituents is changeable) in parallel structure. Semantic features of the modifiers around parallel noun phrase, are also used the probabilistic swapping model. The model is language-independent and in this paper presented on parallel noun phrases in Korean language. Experiment shows that our probabilistic model outperforms symmetry-based model and supervised machine learning based approaches.
PDF

Park, Kyung-Soo;Shin, Soo-Hwan;Lee, Jane
- Journal of the Ergonomics Society of Korea
- /
- v.25 no.4
- /
- pp.41-49
- /
- 2006
In a decision making process, the ambiguity of qualitative probabilistic expressions may result in a wrong conclusion. For this reason there had been many studies of quantifying qualitative probabilistic expressions in English-speaking countries. In this research, quantification of Korean qualitative probabilistic expressions is conducted through 4-step questionnaires. The numerical data of 78 verbal phrases were collected in the first questionnaire and classified in two categories (i.e., uncertainty and frequency). In each category, qualitative probabilistic expressions were divided into eleven groups according to the similarity of the numerical values. In the second questionnaire, subjects selected a representative expression for each group, which totaled 11. In the third questionnaire each subject was asked to rank eleven expressions from 1 to 11 with 1 indicating the highest probability. At last, subjects conducted pairwise comparisons to obtain relative weights, which are used to convert into the numerical probability scale.
https://doi.org/10.5143/JESK.2006.25.4.041 인용 PDF KSCI

Choi, Yong-Seok;Shin, Ji-Ae;Choi, Key-Sun
- Journal of KIISE:Software and Applications
- /
- v.35 no.2
- /
- pp.128-136
- /
- 2008
Recognition of parallel structure at early stage of sentence parsing can reduce the complexity of parsing. In this paper, we propose an unsupervised language-independent probabilistic model for recongition of parallel noun structures. The proposed model is based on the idea of swapping constituents, which replies the properties of symmetry (two or more identical constituents are repeated) and of reversibility (the order of constituents is inter-changeable) in parallel structures. The non-symmetric patterns that cannot be captured by the general symmetry rule are resolved additionally by the modifier information. In particular this paper shows how the proposed model is applied to recognize Korean parallel noun phrases connected by "wa/kwa" particle. Our model is compared with other models including supervised models and performs better on recongition of parallel noun phrases.
PDF KSCI