• Title/Summary/Keyword: subject indexing

Search Result 68, Processing Time 0.024 seconds

Methods for Integration of Documents using Hierarchical Structure based on the Formal Concept Analysis (FCA 기반 계층적 구조를 이용한 문서 통합 기법)

  • Kim, Tae-Hwan;Jeon, Ho-Cheol;Choi, Joong-Min
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.63-77
    • /
    • 2011
  • The World Wide Web is a very large distributed digital information space. From its origins in 1991, the web has grown to encompass diverse information resources as personal home pasges, online digital libraries and virtual museums. Some estimates suggest that the web currently includes over 500 billion pages in the deep web. The ability to search and retrieve information from the web efficiently and effectively is an enabling technology for realizing its full potential. With powerful workstations and parallel processing technology, efficiency is not a bottleneck. In fact, some existing search tools sift through gigabyte.syze precompiled web indexes in a fraction of a second. But retrieval effectiveness is a different matter. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not nessarily appear at the top of the query output order. Also, current search tools can not retrieve the documents related with retrieved document from gigantic amount of documents. The most important problem for lots of current searching systems is to increase the quality of search. It means to provide related documents or decrease the number of unrelated documents as low as possible in the results of search. For this problem, CiteSeer proposed the ACI (Autonomous Citation Indexing) of the articles on the World Wide Web. A "citation index" indexes the links between articles that researchers make when they cite other articles. Citation indexes are very useful for a number of purposes, including literature search and analysis of the academic literature. For details of this work, references contained in academic articles are used to give credit to previous work in the literature and provide a link between the "citing" and "cited" articles. A citation index indexes the citations that an article makes, linking the articleswith the cited works. Citation indexes were originally designed mainly for information retrieval. The citation links allow navigating the literature in unique ways. Papers can be located independent of language, and words in thetitle, keywords or document. A citation index allows navigation backward in time (the list of cited articles) and forwardin time (which subsequent articles cite the current article?) But CiteSeer can not indexes the links between articles that researchers doesn't make. Because it indexes the links between articles that only researchers make when they cite other articles. Also, CiteSeer is not easy to scalability. Because CiteSeer can not indexes the links between articles that researchers doesn't make. All these problems make us orient for designing more effective search system. This paper shows a method that extracts subject and predicate per each sentence in documents. A document will be changed into the tabular form that extracted predicate checked value of possible subject and object. We make a hierarchical graph of a document using the table and then integrate graphs of documents. The graph of entire documents calculates the area of document as compared with integrated documents. We mark relation among the documents as compared with the area of documents. Also it proposes a method for structural integration of documents that retrieves documents from the graph. It makes that the user can find information easier. We compared the performance of the proposed approaches with lucene search engine using the formulas for ranking. As a result, the F.measure is about 60% and it is better as about 15%.

A Curricular Study on AI & ES in Library and Information Science (문헌정보학에서의 인공지능과 전문가시스템 교육과정 연구)

  • Koo Bon-Young;Park Mi-Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.32 no.2
    • /
    • pp.211-232
    • /
    • 1998
  • It is the purpose of this study to specify contents of Library and Information Science to train information professional to meet environment change of technology and system. Among them. recognizing necessity of present Artificial Intelligence and Export System (AI and ES) required by changing environment of latest Information technology, it is also the purpose of this work to figure out fundamental data and the way of solution how to introduce what contents out of AI and ES to Library and Information Science. The briefed results are as follows. 1. Due to rapid change of high Information technology and computer application it is the most important essential points, In order of Importance, in finding available network source, In indexing on-line data base, in analysing and design information system. and in computer application ability. 2. In contents of AI and ES, most Important training portion for Library and Information Science are : data base treating, thesaurus, natural language processing. and knowledge representation. 3. Library and information science professors recognize It necessary for bigger number of Library and Information Science students to be educated artificial intelligence and expert system. 4. During forthcoming age it shows more important reorganization that artificial intelligence and expert system improves information professional in reference service, cataloging, classification, information retrieval, and documentation delivery 5. According to library and information science professors more important reorganization on the subject of AI and ES, the curricular on AI and ES is, forthcoming, to be Introduced to curricular on library and information science in the nation, In order of importance, (see 1. above).

  • PDF

A Study of the Curriculum Operating Model and Standard Courses for Library & Information Science in Korea (한국문헌정보학 교과과정 운영모형 및 표준교과목 개발에 관한 연구)

  • Noh, Young-Hee;Ahn, in-Ja;Choi, Sang-Ki
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.46 no.2
    • /
    • pp.55-82
    • /
    • 2012
  • This study seeks to develop a curriculum operating model for Korean Library and Information Science, based on investigations into LIS curricula at home and abroad. Standard courses that can be applied to this model were also proposed. This study comprehensively analyzed the contents of domestic and foreign curricula and surveyed current librarians in all types of library fields. As a result, this study proposed required courses, core courses, and elective courses. Six required LIS courses are: Introduction to Library and Information Science, Information Organization, Information Services, Library and Information Center Management, Information Retrieval, and Field Work. Six core LIS courses are: Classification & Cataloging Practice, Subject Information Resources, Collection Development, Digital Library, Introduction to Bibliography, and Introduction to Archive Management. Twenty selective LIS courses include: the General Library and Information Science area (Cultural History of Information, Information Society and Library, Library and Copyright, Research Methods in Library and Information Science), the Information Organization area (Metadata Fundamentals, KORMARC Practice), the Information Services area (Information Literacy Instruction, Reading Guidance, Information User Study), the Library and Information Center Management area (Library Management, including management for different kinds of libraries, Library Information Cooperator, Library Marketing, Non-book Material and Multimedia Management (Contents Management), the Information Science area (Database Management, including Web DB Management, Indexing and Abstracting, Introduction to Information Science, Understanding Information Science, Automated System of Library, Library Information Network), and the Archival Science area (Preservation Management).

A Study on Creation and Development of Folksonomy Tags on LibraryThing (폭소노미 태그의 생성과 성장에 관한 연구 - LibraryThing을 중심으로 -)

  • Kim, Dong-Suk;Chung, Yeon-Kyoung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.44 no.4
    • /
    • pp.203-230
    • /
    • 2010
  • This study analyzed the development and growth of folksonomy by examining tags associated with 40 bestsellers on LibraryThing.com in 6-month intervals. It was found that tag values do not decrease but grow in terms of quantity and quality. Accordingly, we examined the major significances of the tags and their potential utilization as an expression of subjects. Our findings were as follows. First, the motivations for tagging can be categorized into personal information for search purposes, self-fulfillment such as sense of achievement, display of emotion and sharing of one's experience with others, or an altruistic objective that emphasizes sociality with a desire that one's actions might provide social benefits. According to our analysis, 74.12% of tags had a social motivation. Second, the total number of tags and the frequency of usage increased with time. Third, the categories that showed a high increase in tag usage were dates of publication and reading, key words, main characters, and book reviews. Tags related to subjects had the highest ratio. Fourth, among Library of Congress Subject Headings (LCSH), multiple genres, key words and main characters were assigned to books, and specific key words and other properties were added as time progressed. There was also a slight increase in the number of tags consistent with LCSH. Fifth, we found that key tags could serve as a compilation of terms that reflects the knowledge base of the corresponding era. Thus, folksonomy should be continuously monitored for its quantitative and qualitative development of the tags to make improvements on its formative disadvantages, and identify internal semantic significance, be actively utilized in conjunction with taxonomy as a flexible compilation of terms that incorporate the history of a specific era.

An Efficient Thumbnail Extraction Method in H.264/AVC Bitstreams (H.264/AVC 비트스트림에서 효율적으로 축소 영상을 추출 하는 방법)

  • Yu, Sang-Jun;Yoon, Myung-Keun;Kim, Eun-Seok;Sohn, Chae-Bong;Sim, Dong-Gyu;Oh, Seoung-Jun
    • Journal of Broadcast Engineering
    • /
    • v.13 no.2
    • /
    • pp.222-235
    • /
    • 2008
  • Recently, as growing of high definition media services like HDTV and IPTV, fast moving picture manipulation techniques need to meet what those services require. Especially, a fast reduced-size image extracting method is required in the areas of video indexing and video summary Conventional DC image extracting methods, however, can't be applied to H.264/AVC streams since a spatial domain prediction scheme is adopted in H.264/AVC intra mode. In this paper, we propose a theoretical method for extracting a thumbnail image from an H.264/AVC intra frame in the frequency domain. Furthermore, the proposed scheme can extract the thumbnail very fast since all operations are applied to transform coefficients directly, after a general equation for the thumbnail extraction in nine H.264/AVC intra prediction modes is introduced, an LUT(Look Up Table) for each mode is designed. Through the implementation and performance evaluation, while the subject quality difference between the output of our scheme and a conventional output is negligible, the former can extract the thumbnail faster then the latter by up to 63%.

Simulation of Pension Finance and Its Economic Effects (연금재정(年金財政) 시뮬레이션과 경제적(經濟的) 파급효과(波及效果))

  • Min, Jae-sung;Kim, Yong-ha
    • KDI Journal of Economic Policy
    • /
    • v.13 no.1
    • /
    • pp.115-134
    • /
    • 1991
  • The role of pension plans in the macroeconomy has been a subject of much interest for some years. It has come to be recognized that pension plans may alter basic macroeconomic behavior patterns. The net effects on both savings and labor supply are thus matters for speculation. The aim of the present paper is to provide quantitative results which may be helpful in attaching orders of magnitude to some of the possible effects. We are not concerned with the providing empirical evidence relating to actual behavior, but rather with deriving the macroeconomic implications for a alternative possibilities. The pension plan interacts with the economy and the population in a number of ways. Demographic variables may thus affect both the economic burden of a national pension plan and the ability of the economy to sustain the burden. The tax transfer process associated with the pension plan may have implications for national patterns of saving and consumption. The existence of a pension plan may have implications also for the size of the labor force, inasmuch as labor force participation rates may be affected. Changes in technology and the associated changes in average productivity levels bear directly on the size of the national income, and hence on the pension contribution base. The vehicle for the analysis is a hypothetical but broadly realistic simulation model of an economic- demographic system into which is inserted a national pension plan. All income, expenditure, and related aggregates are in real terms. The economy is basically neoclassical; full employment is assumed, output is generated by a Cobb-Douglas production process, and factors receive their marginal products. The model was designed for use in computer simulation experiments. The simulation results suggest a number of general conclusions. These may be summarized as follows; - The introduction of a national pension plan (funded system) tends to increase the rate of economic growth until cost exceeds revenue. - A scheme with full wage indexing is more expensive than one in which pensions are merely price indexed. - The rate of technical progress is not a critical element in determining the economic burden of the pension scheme. - Raising the rate of benefits affects its economic burden, and raising the age of eligibility may decrease the burden substantially. - The level of fertility is an element in determining the long-run burden. A sustained low fertility rate increases the proportion of the aged in total population and increases the burden of the pension plan. High fertility has inverse effects.

  • PDF

An Analysis of IT Trends Using Tweet Data (트윗 데이터를 활용한 IT 트렌드 분석)

  • Yi, Jin Baek;Lee, Choong Kwon;Cha, Kyung Jin
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.143-159
    • /
    • 2015
  • Predicting IT trends has been a long and important subject for information systems research. IT trend prediction makes it possible to acknowledge emerging eras of innovation and allocate budgets to prepare against rapidly changing technological trends. Towards the end of each year, various domestic and global organizations predict and announce IT trends for the following year. For example, Gartner Predicts 10 top IT trend during the next year, and these predictions affect IT and industry leaders and organization's basic assumptions about technology and the future of IT, but the accuracy of these reports are difficult to verify. Social media data can be useful tool to verify the accuracy. As social media services have gained in popularity, it is used in a variety of ways, from posting about personal daily life to keeping up to date with news and trends. In the recent years, rates of social media activity in Korea have reached unprecedented levels. Hundreds of millions of users now participate in online social networks and communicate with colleague and friends their opinions and thoughts. In particular, Twitter is currently the major micro blog service, it has an important function named 'tweets' which is to report their current thoughts and actions, comments on news and engage in discussions. For an analysis on IT trends, we chose Tweet data because not only it produces massive unstructured textual data in real time but also it serves as an influential channel for opinion leading on technology. Previous studies found that the tweet data provides useful information and detects the trend of society effectively, these studies also identifies that Twitter can track the issue faster than the other media, newspapers. Therefore, this study investigates how frequently the predicted IT trends for the following year announced by public organizations are mentioned on social network services like Twitter. IT trend predictions for 2013, announced near the end of 2012 from two domestic organizations, the National IT Industry Promotion Agency (NIPA) and the National Information Society Agency (NIA), were used as a basis for this research. The present study analyzes the Twitter data generated from Seoul (Korea) compared with the predictions of the two organizations to analyze the differences. Thus, Twitter data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. To overcome these challenges, we used SAS IRS (Information Retrieval Studio) developed by SAS to capture the trend in real-time processing big stream datasets of Twitter. The system offers a framework for crawling, normalizing, analyzing, indexing and searching tweet data. As a result, we have crawled the entire Twitter sphere in Seoul area and obtained 21,589 tweets in 2013 to review how frequently the IT trend topics announced by the two organizations were mentioned by the people in Seoul. The results shows that most IT trend predicted by NIPA and NIA were all frequently mentioned in Twitter except some topics such as 'new types of security threat', 'green IT', 'next generation semiconductor' since these topics non generalized compound words so they can be mentioned in Twitter with other words. To answer whether the IT trend tweets from Korea is related to the following year's IT trends in real world, we compared Twitter's trending topics with those in Nara Market, Korea's online e-Procurement system which is a nationwide web-based procurement system, dealing with whole procurement process of all public organizations in Korea. The correlation analysis show that Tweet frequencies on IT trending topics predicted by NIPA and NIA are significantly correlated with frequencies on IT topics mentioned in project announcements by Nara market in 2012 and 2013. The main contribution of our research can be found in the following aspects: i) the IT topic predictions announced by NIPA and NIA can provide an effective guideline to IT professionals and researchers in Korea who are looking for verified IT topic trends in the following topic, ii) researchers can use Twitter to get some useful ideas to detect and predict dynamic trends of technological and social issues.

Analysis of Authority Control System in Collecting Repository -from the case of Archival Management System in Korea Democracy Foundation- (수집형 기록관의 전거제어시스템 분석 - 민주화운동기념사업회 사료관리시스템의 사례를 중심으로 -)

  • Lee, Hyun-Jeong
    • The Korean Journal of Archival Studies
    • /
    • no.13
    • /
    • pp.91-134
    • /
    • 2006
  • In general, personally collected archives, manuscripts, are physically badly conditioned and also contextual of the archives and information on the history of production is mostly collected partly in the manuscripts. Therefore they need to control the name of the producers on the archives collected in various ways effectively and accumulate provenance information which is the key element when understanding the production background in the collecting repository. Here, the authority control and provenance information management must be organized from the beginning of acquisition and this means to collect necessary information considering control process of acquisition as well. This thesis is for verifying the necessity of the authority control in collecting repository and accumulation of the provenance information and for suggesting the things to be considered as collecting Archival authority system. For all these, this thesis shows that it has checked out the necessity of the authority control in archival management and archival authority control and researched the standard of archival authority control, work process and accumulation process. Archival provenance information management and authority control in the archival authority control system are organized through the whole steps of the archival management starting from the lead file to the name of the producers at archival registration and archival description at acquisition. And a lot of information is registered and described at the proper point of time and finally all the information including authority control which controls the Heading in the authority management must be organized to use them as an intellectual management of archives and Finding Aids. The features of the Archival authority system are as follows; first of all, Authority file type which is necessary at the archival authority control of democracy movement is made up of the name of the group, person, affair and terminology(subject name). Second of all, basic record structures and description elements in authority collection of Korea Democracy Foundation Archives apply in the paragraph 1 of ISAAR(CPF) adding some necessary elements and details of description rule such as spacing words and using the periods apply in the paragraph 4 of KCR coping with the features of the archival management system. And also the way of input on the authority record is based on EAC(Encoded Archival Context). Third of all, it made users approach to the sources which they want more easily by connecting the authority terms systemically making it possible to connect the relative terms with up and down words, before and after words variously and concretely expanding the term relations rather than earlier traditional authority system which is usually expressed only with relative words (see also). So the authority control of archival management system can effectively collect and manage the function of various and multiple groups and information on main activities as well as its own function which is controlling the Heading and express the multiple and intermediary relationship between archives and producers or between producers and it also provides them with expanded Record information service which satisfies user's various requests through Indexing service. Finally applying in this international standard ISAAR(CPF) through the instance of the authority management like this, it can be referred to making Archival authority system in Collecting repository hereafter by reorganizing the description elements into appropriate formations and setting up the authority file type which is to be managed properly for every service.