• Title/Summary/Keyword: cluster based

Search Result 4,005, Processing Time 0.032 seconds

A Proposal of a Keyword Extraction System for Detecting Social Issues (사회문제 해결형 기술수요 발굴을 위한 키워드 추출 시스템 제안)

  • Jeong, Dami;Kim, Jaeseok;Kim, Gi-Nam;Heo, Jong-Uk;On, Byung-Won;Kang, Mijung
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.1-23
    • /
    • 2013
  • To discover significant social issues such as unemployment, economy crisis, social welfare etc. that are urgent issues to be solved in a modern society, in the existing approach, researchers usually collect opinions from professional experts and scholars through either online or offline surveys. However, such a method does not seem to be effective from time to time. As usual, due to the problem of expense, a large number of survey replies are seldom gathered. In some cases, it is also hard to find out professional persons dealing with specific social issues. Thus, the sample set is often small and may have some bias. Furthermore, regarding a social issue, several experts may make totally different conclusions because each expert has his subjective point of view and different background. In this case, it is considerably hard to figure out what current social issues are and which social issues are really important. To surmount the shortcomings of the current approach, in this paper, we develop a prototype system that semi-automatically detects social issue keywords representing social issues and problems from about 1.3 million news articles issued by about 10 major domestic presses in Korea from June 2009 until July 2012. Our proposed system consists of (1) collecting and extracting texts from the collected news articles, (2) identifying only news articles related to social issues, (3) analyzing the lexical items of Korean sentences, (4) finding a set of topics regarding social keywords over time based on probabilistic topic modeling, (5) matching relevant paragraphs to a given topic, and (6) visualizing social keywords for easy understanding. In particular, we propose a novel matching algorithm relying on generative models. The goal of our proposed matching algorithm is to best match paragraphs to each topic. Technically, using a topic model such as Latent Dirichlet Allocation (LDA), we can obtain a set of topics, each of which has relevant terms and their probability values. In our problem, given a set of text documents (e.g., news articles), LDA shows a set of topic clusters, and then each topic cluster is labeled by human annotators, where each topic label stands for a social keyword. For example, suppose there is a topic (e.g., Topic1 = {(unemployment, 0.4), (layoff, 0.3), (business, 0.3)}) and then a human annotator labels "Unemployment Problem" on Topic1. In this example, it is non-trivial to understand what happened to the unemployment problem in our society. In other words, taking a look at only social keywords, we have no idea of the detailed events occurring in our society. To tackle this matter, we develop the matching algorithm that computes the probability value of a paragraph given a topic, relying on (i) topic terms and (ii) their probability values. For instance, given a set of text documents, we segment each text document to paragraphs. In the meantime, using LDA, we can extract a set of topics from the text documents. Based on our matching process, each paragraph is assigned to a topic, indicating that the paragraph best matches the topic. Finally, each topic has several best matched paragraphs. Furthermore, assuming there are a topic (e.g., Unemployment Problem) and the best matched paragraph (e.g., Up to 300 workers lost their jobs in XXX company at Seoul). In this case, we can grasp the detailed information of the social keyword such as "300 workers", "unemployment", "XXX company", and "Seoul". In addition, our system visualizes social keywords over time. Therefore, through our matching process and keyword visualization, most researchers will be able to detect social issues easily and quickly. Through this prototype system, we have detected various social issues appearing in our society and also showed effectiveness of our proposed methods according to our experimental results. Note that you can also use our proof-of-concept system in http://dslab.snu.ac.kr/demo.html.

Characteristic on the Layout and Semantic Interpretation of Chungryu-Gugok, Dongaksan Mountain, Gokseong (곡성 동악산 청류구곡(淸流九曲)의 형태 및 의미론적 특성)

  • Rho, Jae-Hyun;Shin, Sang-Sup;Huh, Joon;Lee, Jung-Han;Han, Sang-Yub
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.32 no.4
    • /
    • pp.24-36
    • /
    • 2014
  • The result of the research conducted for the purpose of investigating the semantic value and the layout of the Cheongryu Gugok of Dorimsa Valley, which exhibits a high level of completeness and scenic preservation value among the three gugoks distributed in the area around Mt. Dongak of Gogseong is as follows.4) The area around Cheongryu Gugok shows a case where the gugok culture, which has been enjoyed as a model of the Neo-Confucianism culture and bedrock scenery, such as waterfall, riverside, pond, and flatland, following the beautiful valley, has been actually substituted, and is an outstanding scenery site as stated in a local map of Gokseong-hyeon in 1872 as "Samnam Jeil Amban Gyeryu Cheongryu-dong(三南第一巖盤溪流 淸流洞: Cheongryu-dong, the best rock mooring in the Samnam area)." Cheongryu Gugok, which is differentiated through the seasonal scenery and epigrams established on both land route and waterway, was probably established by the lead of Sun-tae Jeong(丁舜泰, ?~1916) and Byeong-sun Cho(曺秉順, 1876~1921) before 1916 during the Japanese colonization period. However, based on the fact that a number of Janggugiso of ancient sages, such as political activists, Buddhist leaders, and Neo-Confucian scholars, have been established, it is presumed to have been utilized as a hermit site and scenery site visited by masters from long ago. Cheongryu Gugok, which is formed on the rock floor of the bed rock of Dorimsa Valley, is formed in a total length of 1.2km and average gok(曲) length of 149m on a mountain type stream, which appears to be shorter compared to other gugoks in Korea. The rock writings of the three gugoks in Mt. Dongak, such as Cheongryu Gugok, which was the only one verified in the Jeonnam area, total 165 in number, which is determined to be the assembly place for the highest number of rock writings in the nation. In particular, a result of analyzing the rock writings in Cheongryu Gugok totaling 112 places showed 49pieces(43.8%) with the meaning of 'moral training' in epigram, 21pieces (18.8%) of human life, 16pieces(14.2%) of seasonal scenery, and 12pieces(10.6%) of Janggugiso such as Jangguchur, and the ratio occupied by poem verses appeared to be six cases(3.6%). Sweyeonmun(鎖烟門), which was the first gok of land route, and Jesiinganbyeolyucheon(除是人間別有天) which was the ninth gok of the waterway, corresponds to the Hongdanyeonse(虹斷烟鎖) of the first gok and Jesiinganbyeolyucheon of the ninth gok established in Jaecheon, Chungbuk by Se-hwa Park(朴世和, 1834~1910), which is inferred to be the name of Gugok having the same origin. In addition, the Daeeunbyeong(大隱屛) of the sixth gok. of land route corresponds to the Chu Hsi's Wuyi-Gugok of the seventh gok, which is acknowledged as the basis for Gugok Wollim, and the rock writings and stonework of 'Amseojae(巖棲齋)' and 'Pogyeongjae(抱經齋)' between the seventh gok and eighth gok is a trace comparable with Wuyi Jeongsa(武夷精舍) placed below Wuyi Gugok Eunbyeon-bong, which is understood to be the activity base of Cheongryu-dong of the Giho Sarim(畿湖士林). The rock writings in the Mt. Dongak area, including famous sayings by masters such as Sunsaeuhje(鮮史御帝, Emperor Gojong), Bogahyowoo(保家孝友, Emperor Gojong), Manchunmungywol(萬川明月, King Joengjo), Biryeobudong(非禮不動, Chongzhen Emperor of the Ming Dynasty)', Samusa(思無邪, Euijong of the Ming Dynasty), Baksechungpwoong(百世淸風, Chu Hsi), and Chungryususuk-Dongakpungkyung(淸流水石 動樂風景, Heungseon Daewongun) can be said to be a repository of semantic symbolic cultural scenery, instead of only expressing Confucian aesthetics. In addition, Cheongryu Gugok is noticeable with its feature as a cluster of cultural scenery of the three religions of Confucian-Buddhism-Taoism, where the Confucianism value system, Buddhist concept, and Taoist concept co-exists for mind training and cultivation. Cheongryu Gugok has a semantic feature and spatial character as a basis for history and cultural struggle for the Anti-Japan spirit that has been conceived during the process of establishing and utilizing the spirit of the learning, loyalty for the Emperor and expulsion of barbarians, and inspiration of Anti-Japan force, by inheriting the sense of Dotong(道統) of Neo-Confucianism by the Confucian scholar class at the end of the Joseon era that is represented by Ik-hyun Choi(崔益鉉, 1833~1906), Woo Jeon(田愚, 1841~1922), Woo-man Gi(奇宇萬, 1846~1916), Byung-sun Song(宋秉璿, 1836~1905), and Hyeon Hwang(黃玹, 1855~1910).

Clustering Method based on Genre Interest for Cold-Start Problem in Movie Recommendation (영화 추천 시스템의 초기 사용자 문제를 위한 장르 선호 기반의 클러스터링 기법)

  • You, Tithrottanak;Rosli, Ahmad Nurzid;Ha, Inay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.1
    • /
    • pp.57-77
    • /
    • 2013
  • Social media has become one of the most popular media in web and mobile application. In 2011, social networks and blogs are still the top destination of online users, according to a study from Nielsen Company. In their studies, nearly 4 in 5active users visit social network and blog. Social Networks and Blogs sites rule Americans' Internet time, accounting to 23 percent of time spent online. Facebook is the main social network that the U.S internet users spend time more than the other social network services such as Yahoo, Google, AOL Media Network, Twitter, Linked In and so on. In recent trend, most of the companies promote their products in the Facebook by creating the "Facebook Page" that refers to specific product. The "Like" option allows user to subscribed and received updates their interested on from the page. The film makers which produce a lot of films around the world also take part to market and promote their films by exploiting the advantages of using the "Facebook Page". In addition, a great number of streaming service providers allows users to subscribe their service to watch and enjoy movies and TV program. They can instantly watch movies and TV program over the internet to PCs, Macs and TVs. Netflix alone as the world's leading subscription service have more than 30 million streaming members in the United States, Latin America, the United Kingdom and the Nordics. As the matter of facts, a million of movies and TV program with different of genres are offered to the subscriber. In contrast, users need spend a lot time to find the right movies which are related to their interest genre. Recent years there are many researchers who have been propose a method to improve prediction the rating or preference that would give the most related items such as books, music or movies to the garget user or the group of users that have the same interest in the particular items. One of the most popular methods to build recommendation system is traditional Collaborative Filtering (CF). The method compute the similarity of the target user and other users, which then are cluster in the same interest on items according which items that users have been rated. The method then predicts other items from the same group of users to recommend to a group of users. Moreover, There are many items that need to study for suggesting to users such as books, music, movies, news, videos and so on. However, in this paper we only focus on movie as item to recommend to users. In addition, there are many challenges for CF task. Firstly, the "sparsity problem"; it occurs when user information preference is not enough. The recommendation accuracies result is lower compared to the neighbor who composed with a large amount of ratings. The second problem is "cold-start problem"; it occurs whenever new users or items are added into the system, which each has norating or a few rating. For instance, no personalized predictions can be made for a new user without any ratings on the record. In this research we propose a clustering method according to the users' genre interest extracted from social network service (SNS) and user's movies rating information system to solve the "cold-start problem." Our proposed method will clusters the target user together with the other users by combining the user genre interest and the rating information. It is important to realize a huge amount of interesting and useful user's information from Facebook Graph, we can extract information from the "Facebook Page" which "Like" by them. Moreover, we use the Internet Movie Database(IMDb) as the main dataset. The IMDbis online databases that consist of a large amount of information related to movies, TV programs and including actors. This dataset not only used to provide movie information in our Movie Rating Systems, but also as resources to provide movie genre information which extracted from the "Facebook Page". Formerly, the user must login with their Facebook account to login to the Movie Rating System, at the same time our system will collect the genre interest from the "Facebook Page". We conduct many experiments with other methods to see how our method performs and we also compare to the other methods. First, we compared our proposed method in the case of the normal recommendation to see how our system improves the recommendation result. Then we experiment method in case of cold-start problem. Our experiment show that our method is outperform than the other methods. In these two cases of our experimentation, we see that our proposed method produces better result in case both cases.

The Characteristics and Performances of Manufacturing SMEs that Utilize Public Information Support Infrastructure (공공 정보지원 인프라 활용한 제조 중소기업의 특징과 성과에 관한 연구)

  • Kim, Keun-Hwan;Kwon, Taehoon;Jun, Seung-pyo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.1-33
    • /
    • 2019
  • The small and medium sized enterprises (hereinafter SMEs) are already at a competitive disadvantaged when compared to large companies with more abundant resources. Manufacturing SMEs not only need a lot of information needed for new product development for sustainable growth and survival, but also seek networking to overcome the limitations of resources, but they are faced with limitations due to their size limitations. In a new era in which connectivity increases the complexity and uncertainty of the business environment, SMEs are increasingly urged to find information and solve networking problems. In order to solve these problems, the government funded research institutes plays an important role and duty to solve the information asymmetry problem of SMEs. The purpose of this study is to identify the differentiating characteristics of SMEs that utilize the public information support infrastructure provided by SMEs to enhance the innovation capacity of SMEs, and how they contribute to corporate performance. We argue that we need an infrastructure for providing information support to SMEs as part of this effort to strengthen of the role of government funded institutions; in this study, we specifically identify the target of such a policy and furthermore empirically demonstrate the effects of such policy-based efforts. Our goal is to help establish the strategies for building the information supporting infrastructure. To achieve this purpose, we first classified the characteristics of SMEs that have been found to utilize the information supporting infrastructure provided by government funded institutions. This allows us to verify whether selection bias appears in the analyzed group, which helps us clarify the interpretative limits of our study results. Next, we performed mediator and moderator effect analysis for multiple variables to analyze the process through which the use of information supporting infrastructure led to an improvement in external networking capabilities and resulted in enhancing product competitiveness. This analysis helps identify the key factors we should focus on when offering indirect support to SMEs through the information supporting infrastructure, which in turn helps us more efficiently manage research related to SME supporting policies implemented by government funded institutions. The results of this study showed the following. First, SMEs that used the information supporting infrastructure were found to have a significant difference in size in comparison to domestic R&D SMEs, but on the other hand, there was no significant difference in the cluster analysis that considered various variables. Based on these findings, we confirmed that SMEs that use the information supporting infrastructure are superior in size, and had a relatively higher distribution of companies that transact to a greater degree with large companies, when compared to the SMEs composing the general group of SMEs. Also, we found that companies that already receive support from the information infrastructure have a high concentration of companies that need collaboration with government funded institution. Secondly, among the SMEs that use the information supporting infrastructure, we found that increasing external networking capabilities contributed to enhancing product competitiveness, and while this was no the effect of direct assistance, we also found that indirect contributions were made by increasing the open marketing capabilities: in other words, this was the result of an indirect-only mediator effect. Also, the number of times the company received additional support in this process through mentoring related to information utilization was found to have a mediated moderator effect on improving external networking capabilities and in turn strengthening product competitiveness. The results of this study provide several insights that will help establish policies. KISTI's information support infrastructure may lead to the conclusion that marketing is already well underway, but it intentionally supports groups that enable to achieve good performance. As a result, the government should provide clear priorities whether to support the companies in the underdevelopment or to aid better performance. Through our research, we have identified how public information infrastructure contributes to product competitiveness. Here, we can draw some policy implications. First, the public information support infrastructure should have the capability to enhance the ability to interact with or to find the expert that provides required information. Second, if the utilization of public information support (online) infrastructure is effective, it is not necessary to continuously provide informational mentoring, which is a parallel offline support. Rather, offline support such as mentoring should be used as an appropriate device for abnormal symptom monitoring. Third, it is required that SMEs should improve their ability to utilize, because the effect of enhancing networking capacity through public information support infrastructure and enhancing product competitiveness through such infrastructure appears in most types of companies rather than in specific SMEs.

9 Provinces and 5 Secondary Capitals, Myeong-ju(Haseo-ju) - Revolve Around Urban Structure - (구주오소경과 명주(하서주) - 그 도시구조를 중심으로 -)

  • Takahumi, Yamada
    • Korean Journal of Heritage: History & Science
    • /
    • v.45 no.2
    • /
    • pp.20-37
    • /
    • 2012
  • After withdrawal of military troops of Chinese Tang dynasty in the 18th year of King Moon-moo's reign(678), the Silla Kingdom had actually unified the Korean peninsula and had divided the territory into 9 states benchmarking the China's local administrations adjustment system. He had established local administrative units by deploying secondary capitals, counties and prefectures in the nine states. The so-called "9 Provinces and 5 Secondary capitals" are what constitutes the local administrations system. The provinces can be compared to current provinces of the Republic of Korea(hereinafter Korea), and secondary capitals to megalopolises. According to a chapter of the Samkuksaki(三?史記) which had recorded the achievements of king Kyoungdeok in December in his 16th year on the throne(757), the local administrative units had amounted to 5 secondary capitals, 117 counties and 293 prefectures. There are still lots of ambiguous points since there have never been any consultation on locations of provinces and secondary capitals' castles, and on structures of cities because the researches for local cities inside the 9 Provinces and 5 Secondary capitals in the Unified Silla Kingdom has been conducted centering on the historic literatures only. The research for restoring structures of cities seen from an archeological perspective are limited to the studies of Taewoo Park("A study on the local cities in the Unified Kingdom Age" 1987) and that of the author("A study on the restoration of planned cities for the Unified Silla Kingdom in terms of the structures and realities of the castles in the 9 Provinces and 5 Secondary capitals" 2009). The Gangneung city of Gangwon province was originally called Haseoryang(河西良) of the Gogureo Kingdom as an ancient nation of Ye(濊). According to "Samkuksaki", it had evolved from Haseoju(河西州) to a secondary capitals in the 8th year of King Seonduk(639). Afterwards, it had been renamed as Myeongju(溟洲) in the 16th year of King Kyoungduk(757), and then several other names were given to it after Goryo dynasty. Taewoo Park claims that it is being defined as a sanctuary remaining in Myoungjudong because of the vestige of bare castle, and this cannot be ascertained due to the on-going urbanization processes. Also, the Kwandong university authority is suggesting an opinion of regarding Myeongju mountain castle located 3 Kms southwest of the center of Gangwon city as commanding post for the pertinent state. The author has restored the pertinent area into a city composed of villages within a lattice framework like Silla Keumkyoung and many other cities. The structure is depicted next. The downtown of Gangneung is situated on a flat terrain at the west bank of Namdaecheon stream flowing southwest to northeast along the inner area of the city. Though there isn't any hill comparatively higher than others in the vicinity, hills are continuously linked east to west along the northern area of the downtown, and the maximum width of flat terrain is about 1 Km and is not so large. Currently, urbanization is being proceeded into the inner portion of Gangneung city, the lands in all directions from the hub of Gangneung station have been readjusted, and thus previous land-zoning program is almost nullified. However, referring to the topographic chart drawn at the time of Japanese colonial rule, it can be validated that land-zoning program to accord the lattice framework with the length of its one side equaling to 190m leaves its vestige about 0.8Km northwest to southeast and about 1.7Km northeast to southwest of the vicinity of Okcheondong, Imdangdong, Geumhakdong, Myeongjudong, and etcetera which comprize the hub of the downtown. The land-zoning vestige within the lattice framework, compared to other cases related with the '9 states and 5 secondary capitals', is very much likely to be that of the Unified Silla Kingdom. That the length of a side of a lattice framework is 190m as opposed to that of Silla Geumkyoung and other cities with their 140m or 160m long sides is a single survey item in the future. The baseline direction for zoning the lands is tilting approximately 37.5 degrees west of northwest to southeast axis in accordance with the topographic features. It seems that this phenomenon takes place because of the direction of Namdaecheon and the geographic constraints of the hills in the north. Reviewing minimally, a rectangular size of zoned land by 4 Pangs(坊) on the northwest to southeast side multiplied by 7 Pangs(坊) on the northeast to southwest side had been restored within a lattice framework. Otherwise, considering the extent of expansion of the existing zoned lands in the lattice framework and one more Pang(坊) being added to each side, it is likely that the size could have been with 5 Pangs(坊) on the northwest to southeast side multiplied by 8 Pangs(坊) on the northeast to southwest side(950 M on the northwest to southeast side multiplied by 1,520m on the northeast to southwest side). The overall shape is rectangle, but land-zoning programs reminiscent of rebuilt roads(red phoenix road) like Jang-an castle(長安城) of Chinese Tang dynasty or Pyoungseong castle(平城城) in Japan is not to be validated. There are some historic items among the roof tiles and earthen wares excavated at local administrative office sites or Gangneung's town castle in Joseon dynasty inside the area assumed to be containing municipal vestiges even though archeological survey for the vestige of Myeongju has not been made yet, and these items deserve dating back to the Unified Silla Kingdom age. Also, all of the construction sites at local administrative authorities of the Joseon dynasty are showing large degrees of slant in the azimuth. This is a circumstantial evidence indicating the fact that the inherited land-zoning programs to be seen in Gangneung in terms of the lattice framework had ever existed in the past. Also, the author does not decline that Myeongju mountain castle had once been the commanding post when reviewing the roof tiles at the edge of eaves in this stronghold. The ancient municipal castles in the Korean peninsula are composed of castles on the flat terrain as well as hilly areas and the cluster of strongholds like Myounghwal, Namhan, Seohyoung mountain castles built around municipal castle of Geumkyoung based on a lattice framework program. Considering that mountain castles are spread in the vicinity of municipal vestiges in other cities other than the 9 states and 5 secondary capitals, it is estimated that Myeongju was assuming the function of commanding post incorporating cities on the flat terrain and castles on the hills.