Search | Korea Science

Moon, Seonghyeon;Chung, Sehwan;Chi, Seokho
- KSCE Journal of Civil and Environmental Engineering Research
- /
- v.38 no.4
- /
- pp.595-599
- /
- 2018
Sufficient understanding of oversea construction market status is crucial to get profitability in the international construction project. Plenty of researchers have been considering the news article as a fine data source for figuring out the market condition, since the data includes market information such as political, economic, and social issue. Since the text data exists in unstructured format with huge size, various text-mining techniques were studied to reduce the unnecessary manpower, time, and cost to summarize the data. However, there are some limitations to extract the needed information from the news article because of the existence of various topics in the data. This research is aimed to overcome the problems and contribute to summarization of market status by performing topic modeling with Latent Dirichlet Allocation. With assuming that 10 topics existed in the corpus, the topics included projects for user convenience (topic-2), private supports to solve poverty problems in Africa (topic-4), and so on. By grouping the topics in the news articles, the results could improve extracting useful information and summarizing the market status.
https://doi.org/10.12652/Ksce.2018.38.4.0595 인용 PDF KSCI

Goo, Juna;Kim, Kyunga
- The Korean Journal of Applied Statistics
- /
- v.27 no.7
- /
- pp.1207-1217
- /
- 2014
2011 Korean Economic Census is the first economic census in Korea, which contains text data on menus served by Korean-food restaurants as well as structured data on characteristics of restaurants including area, opening year and total sales. In this paper, we applied text mining to the text data and investigated statistical and technical issues and characteristics of Korean text mining. Pork belly roast was the most popular menu across provinces and/or restaurant types in year 2010, and the number of restaurants per 10000 people was especially high in Kangwon-do and Daejeon metropolitan city. Beef tartare and fried pork cutlet are popular menus in start-up restaurants while whole chicken soup and maeuntang (spicy fish stew) are in long-lived restaurants. These results can be used as a guideline for menu development to restaurant owners, and for government policy-making process that lead small restaurants to choose proper menus for successful business.
https://doi.org/10.5351/KJAS.2014.27.7.1207 인용 PDF KSCI

Yoo, Han-mook;Kim, Han-joon;Chang, Jae-young
- Journal of KIISE
- /
- v.44 no.11
- /
- pp.1236-1243
- /
- 2017
In this paper, we propose a novel way of producing keyword networks, named LSI-based ClusterTextRank, which extracts significant key words from a set of clusters with a mutual information metric, and constructs an association network using latent semantic indexing (LSI). The proposed method reduces the dimension of documents through LSI, decomposes documents into multiple clusters through k-means clustering, and expresses the words within each cluster as a maximal spanning tree graph. The significant key words are identified by evaluating their mutual information within clusters. Then, the method calculates the similarities between the extracted key words using the term-concept matrix, and the results are represented as a keyword association network. To evaluate the performance of the proposed method, we used travel-related blog data and showed that the proposed method outperforms the existing TextRank algorithm by about 14% in terms of accuracy.
https://doi.org/10.5626/JOK.2017.44.11.1236 인용 KSCI

이용주
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.303-309
- /
- 1994
최근의 음성연구의 관신은 낭독음성에서 자유발화음성으로 옮겨가고 있다. 본고에서는 자유발화음성을 대상으로한 음성번역 및 대화시스템의 연구동향과 함께 자유발화의 음성 및 텍스트코퍼스 구축을 위한 몇몇 사항들을 살펴보고, 필자들이 현재 수집중인 코퍼스의 예를 소개한다.
PDF