• 제목/요약/키워드: Search weight

검색결과 503건 처리시간 0.027초

키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법 (A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model)

  • 조원진;노상규;윤지영;박진수
    • Asia pacific journal of information systems
    • /
    • 제21권1호
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

NFC 기반의 스마트워크 서비스 모델 설계 (NFC-based Smartwork Service Model Design)

  • 박아름;강민수;전정호;이경전
    • 지능정보연구
    • /
    • 제19권2호
    • /
    • pp.157-175
    • /
    • 2013
  • 본 연구는 기존의 스마트워크 모델인 재택 근무, 스마트워크 센터, 모바일 오피스 등을 지원 및 확장할 수 있는 NFC 기반의 스마트워커 네트워킹 서비스 모델과 NFC 기반의 공간관리 서비스 모델을 제시한다. 본래 재택 근무나 원격 근무는 직원들의 생산성 제고를 위해 시행되었지만, 최근 생산성 저하와 협업의 어려움, 근태 관리의 어려움 등을 이유로 재택 근무의 실효성에 대해 부정적인 의견이 제시되고 있고, 일부 기업은 재택 근무를 폐지하고 있는 실정이다. 이에 본 논문은 직원들 간의 협업 및 커뮤니케이션을 지원하여 업무 생산성을 제고시킬 수 있는 NFC 기반의 커뮤니케이션/SNS 서비스 모델을 제시한다. 또한, NFC 기술을 이용한 지역 기반의 실시간 구인, 구직 서비스 모델을 제안하는데, 이 서비스 모델은 기존의 공유 경제 사이트와의 제휴를 통해 사용자들이 NFC 태그 터치 후 공유경제 사이트에서 필요한 인력을 구하거나, 자신의 기술이나 재능을 무료 또는 유료로 제공함으로써 효율적으로 인력이 활용되는 효과를 가져 올 수 있다. NFC 기반의 커뮤니케이션/SNS 서비스 모델은 구축비용이 낮다는 점과 종업원들의 위치 정보 제공이 가능하다는 점, 지식 축적이 가능하다는 등의 특징을 가진다. NFC 기반의 공간관리 서비스 모델은 스마트워크가 주로 시행되던 오피스 공간뿐만 아니라 그 외의 현장이나 카페 등 기존의 업무공간 이외에서도 업무를 수행할 수 있도록 지원하는 서비스 모델로 공간확장 측면에 중점을 둔 서비스 모델이다. 이 서비스 모델은 구축비용이 낮다는 점, 개인화 서비스의 제공이 가능하다는 점, 기업 외부에 시스템 구축이 가능하다는 점, 기업 내외부에 있는 종업원들의 위치 정보 파악이 가능하다는 특징을 가진다. 본 논문은 위와 같은 스마트워크 서비스 모델을 설계하기 위해 시나리오를 제시하고, 비즈니스 모델의 프로세스와 이해 관계자들의 역할 및 혜택을 검토하며, 기존의 서비스와 비교 분석하여 차별점을 도출하고 시사점을 제시한다. 본 논문이 제시하는 서비스 모델은 기존의 서비스 모델을 대체하는 것이 아니라 지원하고 확장할 수 있는 모델로, NFC라는 인식기술을 활용하여 기업이 좀 더 유연한 스마트워크 시스템을 구축할 수 있도록 한다. 기존에 대기업 위주로 스마트워크가 시행되었으나, NFC 기반의 스마트워크 서비스 모델은 스마트워크를 도입하는 기업의 범위를 확장시키고, 스마트워크 제도의 수혜를 받을 수 있는 구성원의 범위를 확장시킬 수 있을 것으로 기대된다.

고려의 원시영역 유목초지, 그 부르칸(불함)이즘과 한국축산의 비전 (Burqanism from the Origin of the Pastoral Nomadic Koryo Region and the Vision of Korean Livestock Farming)

  • 주채혁
    • 한국초지조사료학회지
    • /
    • 제25권1호
    • /
    • pp.71-82
    • /
    • 2005
  • Khori(高麗) refers to the Chaabog(reindeer) that live on lichens(蘚) on Mt. Soyon(鮮) in which pastures are the cold and dry plateau of North Eurasia. Thus, the origin region of the Khori or Koguryo that are the ancestors of the reindeer-herding pastoral nomads(馴鹿 遊牧民) can be said to be the Steppe-Taiga-Tundra pastoral areas of North Eurasia and North America. When the pastoral nomads moved on to the great mountain(大山) zone of the Jangbaek(長白) to the Baekdu(白頭) Mountains, they could have been in contact with pastoral farmers or agricultural farmers living there and they became the farmers remaining on agricultural farms. They were the Koryo people, the ancestors of Korea. Staying in one place, they gradually forgot the origin of their reindeer-herding pastoral nomadic history in the Northwest area of Mt. Soyon, the small mountain(小山) zone of the Steppe-Taiga-Tundra pastoral areas. In other words, they lost their identity as reindeer-herding pastoral nomads when they entered the agricultural area after leaving the pastoral area. However, since their basic genes had already formed when they lived on the cold and dry plateau of North Eurasia, it is possible to study their pastoral nomadic history focusing on 'the minority living in the broad area(廣域少數)', by utilizing highly advanced biotechnological science and focusing on genes and information technology innovation, and removing various past hindrances in research. Therefore, it is not so difficult to restore the reindeerherding pastoral nomadic history of the Koguryo(高句麗) people and secure their pastoral nomadic identity, of which the first steps have already been taken into their historical stages. The Eurasian continent and the Korean peninsula, especially the cold and dry plateau of North Eurasia and the Korean peninsula have been closely related to each other ecologically and historically. They can never be a separate space at all. The Eurasian continent lies horizontally east to west and thus, the continent forms an isothermal zone. Also, since the time of producing their own foods, it was relatively easy for people with their technology to move to other places owing to the pastoral nomadic characteristic of mobility. Unlike the Chungyen(中原) region, western Asia and the regions covering the Siberia-Manchu-Korean peninsula where food production revolution was first made were connected to the Mongolian lichens route(蘚苔之路: Ni, ukinii jam) and steppe roads. Although the ecological conditions of nature have changed a bit throughout a long history, it was natural for the many tribes in North Asia living on the largest Steppe-Taiga-Tundra area in the world to have believed 'the legends related to animals in relation to their founders and ancestors(獸祖傳說)'. Assuming that Siberian tigers and the tigers living on Mt. Baekdu were connected ecologically and genetically because of the ecological characteristics of the animals, and their migration from plateau to plateau, we would suspect that the Chosun(朝鮮) tribe living on Mt. Baekdu were ethnically and culturally more closely connected to the farther removed Ural-Altai tribes that lived on the cold and dry plateau region than to the Han(i14;) tribe who lived in Chungyen(中原) that was close to Mt. Baekdu. More evidence is the structure of the Korean language which has the form of 'Subject + Object + Verb', which is assumed to have originated from the speedy lifestyle of the reindeer-herding pastoral nomads. The structure is quite different from that of the Han(漢) language, which is based on agricultural life. Also, it is natural for reindeer riding reindeerherding pastoral nomads or horse-riding sheep-herding pastoral nomads(騎馬, 羊遊牧民) to have held military and political power over the region and eventually to have established an ancient pastoral nomadic empire in the process of their conquest of agricultural regions. The stages for founding global empires in the history of mankind maybe largely divided into two, in terms of ecological conditions and occupations. They are the steppes and the oceans. Of course, the steppe-based empires were established based on the skills to deal with horses and the ability to shoot arrows while riding horses, along with the use of iron ware in the 8th century BC. The steppe-based empires became the foundation for an oceanic empire, which could have been established by the use of warships and warship guns since the 15th Century. Based on those facts, we know that Chosun, Puyo(夫餘), and Koguryo are the products of a developmental process of pastoral nomadic empires on the steppes. Maybe we can easily find the pastoral nomadic identity of the Koguryo more than we expected when we trace the origins and history of the Korean tribe living in the pastures located in the northwest area of Mt. Jangbaek by focusing on pastoral nomadic mobility and organization just as we have investigated the historic origins of Anglo-Saxons in America by focusing on the times before the 15th Century. In the process, we should keep in mind that English culture originated from the Industrial Revolution and was directly delivered to the American continent, although America was far from England and was not an intermediate point on long sojourns either. Further, American culture came back to England in a more advanced form later. The most important thing currently to be resolved is to cause Koreans to look back on their own history in a freer way of thinking and with diverse, profound, and sharp insight, taking away the old and existing conventional recognition that is entangled with complicated interests with Korean people and other countries. The meanings of Chosun, Khori, and Solongos have been interpreted arbitrarily without any historic evidence by the scholars who followed conventional tradition of fixed-minded aristocrats in an agricultural society. If the Siberian cultural properties of the stone age, the earthenware age, the bronze age, and the iron age are analyzed in such a way, archaeological discovery will never be able to contribute to the restoration of the Koguryo's pastoral nomadic identity. One should transcend the errors that tend to interpret the cultural properties discovered in the pastoral nomadic regions as not being differentiated from those of agricultural regions and just interpret them altogether from the agricultural point of view. A more careful intention is required in the interpretation of cultural properties of ancient Korean empires that seem to have been formed due to mutual interactions of pastoral nomadic and agricultural cultures. Also, it is required that the conventional recognition chain of 'reverse-genes' be severed, which has placed more weight on agricultural properties than pastoral nomadic ones, since their settlement on agricultural farms was made after the establishment of their ancient pastoral nomadic empires. There is no reason at all to place priority on stoneware, earthenware, bronze ware, and iron ware than on wooden ware(木器) and other ware which were made of animal skins(皮器), bones and horns(骨角器), in analyzing the history in the regions of reindeer or sheep pastures. Reading ancient Korean history from the perspective of pastoral nomadic history, one feels strongly the instinctive emotions to return to the natural 'mother place'. The reindeer-herding pastoral nomadic identity of the Koguryo people that has been accumulated in volumes in their genes and hidden deep inside and have interacted organically could be reborn with Burqanism(Burqan refers to 不咸 in Chinese), which was their religion by birth and symbolized as the red willow(紅柳=不咸). The mother place of the Koguryo's people is the endless vast green pastures of North Eurasia and North America, where we anticipated the development of Korean livestock farming following the inherent properties in the genes of the reindeer-herding pastoral nomads with Korean ancestors. We anticipate that the place would be the core resource that could contribute to the development of life of living creatures following the inherent properties of their genes and biotechnological factors. In other words, biotechnology used for a search for clues on the well-being of humans could be the fruit brought by Burqanism of the Koguryo people and the fruit of the globalization of Korean livestock farming. It is the Chosun farmer in China come from the vast nomadic reindeer pastures of North Eurasia that resolved the food problem of a billion Chinese people with lowland paddy rice seeds (水稻) by transforming Heilongjiang Province(黑龍江省) into an oceanic lowland paddy rice field(水田). Even Mao Tse-tung(毛擇東) could not resolve the food problem by his revolution campaigns for tens of years. Today is the very time that requires the development of special livestock farming following the inherent properties of the ancient Korean reindeer-herding pastoral nomads that respected the dignity of life on the cold and dry plateau of North Eurasia and the America continent. I suggest that research should be started from the pastures of the Dariganga Steppe in East Mongolia that was the homeland of Hanwoo(韓牛) and the central horse-herding steppe place(牧馬場) of Chingis Khan's Mongolia. The Dariganga Steppe is awash with an affluent natural environment for pastoral nomadic living however, the quality of life of the pastoral nomads there is still low. I suggest we Koreans, the descendents of the Koguryo, should take our first steps for our livestock farming business project and develop the Northern nomadic pastures, here at the pastures of the Dariganga Steppe, which is the Mongolian core place of state-of-the-art technology for military weapons.