• Title/Summary/Keyword: Search weight

Search Result 503, Processing Time 0.025 seconds

A New Approach to Automatic Keyword Generation Using Inverse Vector Space Model (키워드 자동 생성에 대한 새로운 접근법: 역 벡터공간모델을 이용한 키워드 할당 방법)

  • Cho, Won-Chin;Rho, Sang-Kyu;Yun, Ji-Young Agnes;Park, Jin-Soo
    • Asia pacific journal of information systems
    • /
    • v.21 no.1
    • /
    • pp.103-122
    • /
    • 2011
  • Recently, numerous documents have been made available electronically. Internet search engines and digital libraries commonly return query results containing hundreds or even thousands of documents. In this situation, it is virtually impossible for users to examine complete documents to determine whether they might be useful for them. For this reason, some on-line documents are accompanied by a list of keywords specified by the authors in an effort to guide the users by facilitating the filtering process. In this way, a set of keywords is often considered a condensed version of the whole document and therefore plays an important role for document retrieval, Web page retrieval, document clustering, summarization, text mining, and so on. Since many academic journals ask the authors to provide a list of five or six keywords on the first page of an article, keywords are most familiar in the context of journal articles. However, many other types of documents could not benefit from the use of keywords, including Web pages, email messages, news reports, magazine articles, and business papers. Although the potential benefit is large, the implementation itself is the obstacle; manually assigning keywords to all documents is a daunting task, or even impractical in that it is extremely tedious and time-consuming requiring a certain level of domain knowledge. Therefore, it is highly desirable to automate the keyword generation process. There are mainly two approaches to achieving this aim: keyword assignment approach and keyword extraction approach. Both approaches use machine learning methods and require, for training purposes, a set of documents with keywords already attached. In the former approach, there is a given set of vocabulary, and the aim is to match them to the texts. In other words, the keywords assignment approach seeks to select the words from a controlled vocabulary that best describes a document. Although this approach is domain dependent and is not easy to transfer and expand, it can generate implicit keywords that do not appear in a document. On the other hand, in the latter approach, the aim is to extract keywords with respect to their relevance in the text without prior vocabulary. In this approach, automatic keyword generation is treated as a classification task, and keywords are commonly extracted based on supervised learning techniques. Thus, keyword extraction algorithms classify candidate keywords in a document into positive or negative examples. Several systems such as Extractor and Kea were developed using keyword extraction approach. Most indicative words in a document are selected as keywords for that document and as a result, keywords extraction is limited to terms that appear in the document. Therefore, keywords extraction cannot generate implicit keywords that are not included in a document. According to the experiment results of Turney, about 64% to 90% of keywords assigned by the authors can be found in the full text of an article. Inversely, it also means that 10% to 36% of the keywords assigned by the authors do not appear in the article, which cannot be generated through keyword extraction algorithms. Our preliminary experiment result also shows that 37% of keywords assigned by the authors are not included in the full text. This is the reason why we have decided to adopt the keyword assignment approach. In this paper, we propose a new approach for automatic keyword assignment namely IVSM(Inverse Vector Space Model). The model is based on a vector space model. which is a conventional information retrieval model that represents documents and queries by vectors in a multidimensional space. IVSM generates an appropriate keyword set for a specific document by measuring the distance between the document and the keyword sets. The keyword assignment process of IVSM is as follows: (1) calculating the vector length of each keyword set based on each keyword weight; (2) preprocessing and parsing a target document that does not have keywords; (3) calculating the vector length of the target document based on the term frequency; (4) measuring the cosine similarity between each keyword set and the target document; and (5) generating keywords that have high similarity scores. Two keyword generation systems were implemented applying IVSM: IVSM system for Web-based community service and stand-alone IVSM system. Firstly, the IVSM system is implemented in a community service for sharing knowledge and opinions on current trends such as fashion, movies, social problems, and health information. The stand-alone IVSM system is dedicated to generating keywords for academic papers, and, indeed, it has been tested through a number of academic papers including those published by the Korean Association of Shipping and Logistics, the Korea Research Academy of Distribution Information, the Korea Logistics Society, the Korea Logistics Research Association, and the Korea Port Economic Association. We measured the performance of IVSM by the number of matches between the IVSM-generated keywords and the author-assigned keywords. According to our experiment, the precisions of IVSM applied to Web-based community service and academic journals were 0.75 and 0.71, respectively. The performance of both systems is much better than that of baseline systems that generate keywords based on simple probability. Also, IVSM shows comparable performance to Extractor that is a representative system of keyword extraction approach developed by Turney. As electronic documents increase, we expect that IVSM proposed in this paper can be applied to many electronic documents in Web-based community and digital library.

NFC-based Smartwork Service Model Design (NFC 기반의 스마트워크 서비스 모델 설계)

  • Park, Arum;Kang, Min Su;Jun, Jungho;Lee, Kyoung Jun
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.157-175
    • /
    • 2013
  • Since Korean government announced 'Smartwork promotion strategy' in 2010, Korean firms and government organizations have started to adopt smartwork. However, the smartwork has been implemented only in a few of large enterprises and government organizations rather than SMEs (small and medium enterprises). In USA, both Yahoo! and Best Buy have stopped their flexible work because of its reported low productivity and job loafing problems. In addition, according to the literature on smartwork, we could draw obstacles of smartwork adoption and categorize them into the three types: institutional, organizational, and technological. The first category of smartwork adoption obstacles, institutional, include the difficulties of smartwork performance evaluation metrics, the lack of readiness of organizational processes, limitation of smartwork types and models, lack of employee participation in smartwork adoption procedure, high cost of building smartwork system, and insufficiency of government support. The second category, organizational, includes limitation of the organization hierarchy, wrong perception of employees and employers, a difficulty in close collaboration, low productivity with remote coworkers, insufficient understanding on remote working, and lack of training about smartwork. The third category, technological, obstacles include security concern of mobile work, lack of specialized solution, and lack of adoption and operation know-how. To overcome the current problems of smartwork in reality and the reported obstacles in literature, we suggest a novel smartwork service model based on NFC(Near Field Communication). This paper suggests NFC-based Smartwork Service Model composed of NFC-based Smartworker networking service and NFC-based Smartwork space management service. NFC-based smartworker networking service is comprised of NFC-based communication/SNS service and NFC-based recruiting/job seeking service. NFC-based communication/SNS Service Model supplements the key shortcomings that existing smartwork service model has. By connecting to existing legacy system of a company through NFC tags and systems, the low productivity and the difficulty of collaboration and attendance management can be overcome since managers can get work processing information, work time information and work space information of employees and employees can do real-time communication with coworkers and get location information of coworkers. Shortly, this service model has features such as affordable system cost, provision of location-based information, and possibility of knowledge accumulation. NFC-based recruiting/job-seeking service provides new value by linking NFC tag service and sharing economy sites. This service model has features such as easiness of service attachment and removal, efficient space-based work provision, easy search of location-based recruiting/job-seeking information, and system flexibility. This service model combines advantages of sharing economy sites with the advantages of NFC. By cooperation with sharing economy sites, the model can provide recruiters with human resource who finds not only long-term works but also short-term works. Additionally, SMEs (Small Medium-sized Enterprises) can easily find job seeker by attaching NFC tags to any spaces at which human resource with qualification may be located. In short, this service model helps efficient human resource distribution by providing location of job hunters and job applicants. NFC-based smartwork space management service can promote smartwork by linking NFC tags attached to the work space and existing smartwork system. This service has features such as low cost, provision of indoor and outdoor location information, and customized service. In particular, this model can help small company adopt smartwork system because it is light-weight system and cost-effective compared to existing smartwork system. This paper proposes the scenarios of the service models, the roles and incentives of the participants, and the comparative analysis. The superiority of NFC-based smartwork service model is shown by comparing and analyzing the new service models and the existing service models. The service model can expand scope of enterprises and organizations that adopt smartwork and expand the scope of employees that take advantages of smartwork.

Burqanism from the Origin of the Pastoral Nomadic Koryo Region and the Vision of Korean Livestock Farming (고려의 원시영역 유목초지, 그 부르칸(불함)이즘과 한국축산의 비전)

  • Chu Chae Hyok
    • Journal of The Korean Society of Grassland and Forage Science
    • /
    • v.25 no.1
    • /
    • pp.71-82
    • /
    • 2005
  • Khori(高麗) refers to the Chaabog(reindeer) that live on lichens(蘚) on Mt. Soyon(鮮) in which pastures are the cold and dry plateau of North Eurasia. Thus, the origin region of the Khori or Koguryo that are the ancestors of the reindeer-herding pastoral nomads(馴鹿 遊牧民) can be said to be the Steppe-Taiga-Tundra pastoral areas of North Eurasia and North America. When the pastoral nomads moved on to the great mountain(大山) zone of the Jangbaek(長白) to the Baekdu(白頭) Mountains, they could have been in contact with pastoral farmers or agricultural farmers living there and they became the farmers remaining on agricultural farms. They were the Koryo people, the ancestors of Korea. Staying in one place, they gradually forgot the origin of their reindeer-herding pastoral nomadic history in the Northwest area of Mt. Soyon, the small mountain(小山) zone of the Steppe-Taiga-Tundra pastoral areas. In other words, they lost their identity as reindeer-herding pastoral nomads when they entered the agricultural area after leaving the pastoral area. However, since their basic genes had already formed when they lived on the cold and dry plateau of North Eurasia, it is possible to study their pastoral nomadic history focusing on 'the minority living in the broad area(廣域少數)', by utilizing highly advanced biotechnological science and focusing on genes and information technology innovation, and removing various past hindrances in research. Therefore, it is not so difficult to restore the reindeerherding pastoral nomadic history of the Koguryo(高句麗) people and secure their pastoral nomadic identity, of which the first steps have already been taken into their historical stages. The Eurasian continent and the Korean peninsula, especially the cold and dry plateau of North Eurasia and the Korean peninsula have been closely related to each other ecologically and historically. They can never be a separate space at all. The Eurasian continent lies horizontally east to west and thus, the continent forms an isothermal zone. Also, since the time of producing their own foods, it was relatively easy for people with their technology to move to other places owing to the pastoral nomadic characteristic of mobility. Unlike the Chungyen(中原) region, western Asia and the regions covering the Siberia-Manchu-Korean peninsula where food production revolution was first made were connected to the Mongolian lichens route(蘚苔之路: Ni, ukinii jam) and steppe roads. Although the ecological conditions of nature have changed a bit throughout a long history, it was natural for the many tribes in North Asia living on the largest Steppe-Taiga-Tundra area in the world to have believed 'the legends related to animals in relation to their founders and ancestors(獸祖傳說)'. Assuming that Siberian tigers and the tigers living on Mt. Baekdu were connected ecologically and genetically because of the ecological characteristics of the animals, and their migration from plateau to plateau, we would suspect that the Chosun(朝鮮) tribe living on Mt. Baekdu were ethnically and culturally more closely connected to the farther removed Ural-Altai tribes that lived on the cold and dry plateau region than to the Han(i14;) tribe who lived in Chungyen(中原) that was close to Mt. Baekdu. More evidence is the structure of the Korean language which has the form of 'Subject + Object + Verb', which is assumed to have originated from the speedy lifestyle of the reindeer-herding pastoral nomads. The structure is quite different from that of the Han(漢) language, which is based on agricultural life. Also, it is natural for reindeer riding reindeerherding pastoral nomads or horse-riding sheep-herding pastoral nomads(騎馬, 羊遊牧民) to have held military and political power over the region and eventually to have established an ancient pastoral nomadic empire in the process of their conquest of agricultural regions. The stages for founding global empires in the history of mankind maybe largely divided into two, in terms of ecological conditions and occupations. They are the steppes and the oceans. Of course, the steppe-based empires were established based on the skills to deal with horses and the ability to shoot arrows while riding horses, along with the use of iron ware in the 8th century BC. The steppe-based empires became the foundation for an oceanic empire, which could have been established by the use of warships and warship guns since the 15th Century. Based on those facts, we know that Chosun, Puyo(夫餘), and Koguryo are the products of a developmental process of pastoral nomadic empires on the steppes. Maybe we can easily find the pastoral nomadic identity of the Koguryo more than we expected when we trace the origins and history of the Korean tribe living in the pastures located in the northwest area of Mt. Jangbaek by focusing on pastoral nomadic mobility and organization just as we have investigated the historic origins of Anglo-Saxons in America by focusing on the times before the 15th Century. In the process, we should keep in mind that English culture originated from the Industrial Revolution and was directly delivered to the American continent, although America was far from England and was not an intermediate point on long sojourns either. Further, American culture came back to England in a more advanced form later. The most important thing currently to be resolved is to cause Koreans to look back on their own history in a freer way of thinking and with diverse, profound, and sharp insight, taking away the old and existing conventional recognition that is entangled with complicated interests with Korean people and other countries. The meanings of Chosun, Khori, and Solongos have been interpreted arbitrarily without any historic evidence by the scholars who followed conventional tradition of fixed-minded aristocrats in an agricultural society. If the Siberian cultural properties of the stone age, the earthenware age, the bronze age, and the iron age are analyzed in such a way, archaeological discovery will never be able to contribute to the restoration of the Koguryo's pastoral nomadic identity. One should transcend the errors that tend to interpret the cultural properties discovered in the pastoral nomadic regions as not being differentiated from those of agricultural regions and just interpret them altogether from the agricultural point of view. A more careful intention is required in the interpretation of cultural properties of ancient Korean empires that seem to have been formed due to mutual interactions of pastoral nomadic and agricultural cultures. Also, it is required that the conventional recognition chain of 'reverse-genes' be severed, which has placed more weight on agricultural properties than pastoral nomadic ones, since their settlement on agricultural farms was made after the establishment of their ancient pastoral nomadic empires. There is no reason at all to place priority on stoneware, earthenware, bronze ware, and iron ware than on wooden ware(木器) and other ware which were made of animal skins(皮器), bones and horns(骨角器), in analyzing the history in the regions of reindeer or sheep pastures. Reading ancient Korean history from the perspective of pastoral nomadic history, one feels strongly the instinctive emotions to return to the natural 'mother place'. The reindeer-herding pastoral nomadic identity of the Koguryo people that has been accumulated in volumes in their genes and hidden deep inside and have interacted organically could be reborn with Burqanism(Burqan refers to 不咸 in Chinese), which was their religion by birth and symbolized as the red willow(紅柳=不咸). The mother place of the Koguryo's people is the endless vast green pastures of North Eurasia and North America, where we anticipated the development of Korean livestock farming following the inherent properties in the genes of the reindeer-herding pastoral nomads with Korean ancestors. We anticipate that the place would be the core resource that could contribute to the development of life of living creatures following the inherent properties of their genes and biotechnological factors. In other words, biotechnology used for a search for clues on the well-being of humans could be the fruit brought by Burqanism of the Koguryo people and the fruit of the globalization of Korean livestock farming. It is the Chosun farmer in China come from the vast nomadic reindeer pastures of North Eurasia that resolved the food problem of a billion Chinese people with lowland paddy rice seeds (水稻) by transforming Heilongjiang Province(黑龍江省) into an oceanic lowland paddy rice field(水田). Even Mao Tse-tung(毛擇東) could not resolve the food problem by his revolution campaigns for tens of years. Today is the very time that requires the development of special livestock farming following the inherent properties of the ancient Korean reindeer-herding pastoral nomads that respected the dignity of life on the cold and dry plateau of North Eurasia and the America continent. I suggest that research should be started from the pastures of the Dariganga Steppe in East Mongolia that was the homeland of Hanwoo(韓牛) and the central horse-herding steppe place(牧馬場) of Chingis Khan's Mongolia. The Dariganga Steppe is awash with an affluent natural environment for pastoral nomadic living however, the quality of life of the pastoral nomads there is still low. I suggest we Koreans, the descendents of the Koguryo, should take our first steps for our livestock farming business project and develop the Northern nomadic pastures, here at the pastures of the Dariganga Steppe, which is the Mongolian core place of state-of-the-art technology for military weapons.