• Title/Summary/Keyword: Document Representation

Search Result 113, Processing Time 0.023 seconds

A Study of Characteristics and Types of Congressional Records (의회기록의 특질과 종류)

  • Lee, Won-young
    • The Korean Journal of Archival Studies
    • /
    • no.9
    • /
    • pp.110-142
    • /
    • 2004
  • This paper treated what are congressional records which are one of core fields of national records and what kind of significant records they have. The characteristics of "substantive records" of the institution among public records are as follows: firstly, their contents depend on the inherent function of the institution; secondly, their types differ by the character of basic unit(member); thirdly, their sources are determined by the character of institution. Starting from the above points of view, the contents, characteristics, main sources, and types by sources of congressional records were presented. They are summarized as follows. In Chapter 2, the substantial records, which document the intrinsic function of congress on the basis of analyses of inherent function and structural uniqueness of congress have natures of which the contents are "legislative records", "oversight records", and "political activity records" starting from the inherency of congress as the people's representation. The typical natures of congress records are related with the specialty that the basic unit of congress structure is an individual congressman as an indepent national institution and congress is a council of these congressmen. Firstly, the records of congressmen as basic member of congress are the national records with the type of personal records. Secondly, "council records" produced by the council (commettee and main conference), which have evidencial and informative value for decision making through the process of investigating, discussing and voting bill and policy (item) of the basis for national management, are very special kind of records, such as item records, decision records, journal records, and congress assistant records. Because congressmen and councils composing congress have an equal inter-relationship in the structure of congress, the main sources of congress records are an individual congressman and all the councils. In chapter 3, the contents and sorts of main records are discribed, centering around congressmen and councils as the main sources of congress records. In chapter 4, the management of records of congressmen is issued as an urgent subject for the management of congress records, instead of conclusions.

Evaluating Records and Their Descriptive Elements in the Records Management of Korea on the Basis of the Characteristics of a Record and Recordkeeping Metadata Standards (기록의 속성과 메타데이터 표준을 통해 본 한국의 기록·기록기술)

  • Kim, Ik-han
    • The Korean Journal of Archival Studies
    • /
    • no.10
    • /
    • pp.3-26
    • /
    • 2004
  • ISO 15489:2001 addresses the principles and requirements with which organizations, both public and private, should comply on the management of their records to ensure that adequate records are created, captured and managed. The standard defines the characteristics that a record should have through records management system as follows: authenticity, reliability, integrity, and usability. Authenticity means that records can be proven to be what it purports to be, to have been created or sent by the person purported to have created or sent it, and to have been created or sent at the time purported. Reliability means that the contents of the records can be trusted as a full and accurate representation of the transactions, activities or facts to which they attest and can be depended upon in the course of subsequent transactions or activities. Integrity refers to ensuring that a record is complete and unaltered. Usability means that records can be located, retrieved, presented and interpreted. In order to have these characteristics, a record should be persistently linked to the metadata necessary to document a transaction. Metadata is "data describing context, content and structure of records and their management through time." Metadata ensure the creation and maintenance of authentic, reliable and usable records and the protection of the integrity of those records. It could be implemented by creating and capturing records management metadata in systems that create and manage records. There have been some projects and standard initiatives to identify a core set of records management metadata. Included are the Australian Recordkeeping Metadata Standard and the British Metadata Standard which is part of the Requirements for Electronic Records Management System. Recently ISO/TS 23081-1 is published to implement metadata requirements within the framework of ISO 15489. Public records management system in Korea is ruled by the Act on the Management of Archives by Public Agencies and Administrative Records Management Regulation. This article evaluates records and their descriptive elements captured and maintained by the records management system in Korea on the basis of the international metadata standards.

Resume Classification System using Natural Language Processing & Machine Learning Techniques

  • Irfan Ali;Nimra;Ghulam Mujtaba;Zahid Hussain Khand;Zafar Ali;Sajid Khan
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.7
    • /
    • pp.108-117
    • /
    • 2024
  • The selection and recommendation of a suitable job applicant from the pool of thousands of applications are often daunting jobs for an employer. The recommendation and selection process significantly increases the workload of the concerned department of an employer. Thus, Resume Classification System using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process and ease the job of an employer. Moreover, the automation of this process can significantly expedite and transparent the applicants' selection process with mere human involvement. Nevertheless, various Machine Learning approaches have been proposed to develop Resume Classification Systems. However, this study presents an automated NLP and ML-based system that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of Resume Classification Systems and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP & ML techniques for processing & classification of Resumes, the extracted features were tested on nine machine learning models Support Vector Machine - SVM (Linear, SGD, SVC & NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN) and Logistic Regression (LR). The Term-Frequency Inverse Document (TF-IDF) feature representation scheme proven suitable for Resume Classification Task. The developed models were evaluated using F-ScoreM, RecallM, PrecissionM, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume Classification task, the SVM class of Machine Learning algorithms performed better on the study dataset with over 96% overall accuracy. The promising results suggest that NLP & ML techniques employed in this study could be used for the Resume Classification task.

Directions of Implementing Documentation Strategies for Local Regions (지역 기록화를 위한 도큐멘테이션 전략의 적용)

  • Seol, Moon-Won
    • The Korean Journal of Archival Studies
    • /
    • no.26
    • /
    • pp.103-149
    • /
    • 2010
  • Documentation strategy has been experimented in various subject areas and local regions since late 1980's when it was proposed as archival appraisal and selection methods by archival communities in the United States. Though it was criticized to be too ideal, it needs to shed new light on the potentialities of the strategy for documenting local regions in digital environment. The purpose of this study is to analyse the implementation issues of documentation strategy and to suggest the directions for documenting local regions of Korea through the application of the strategy. The documentation strategy which was developed more than twenty years ago in mostly western countries gives us some implications for documenting local regions even in current digital environments. They are as follows; Firstly, documentation strategy can enhance the value of archivists as well as archives in local regions because archivist should be active shaper of history rather than passive receiver of archives according to the strategy. It can also be a solution for overcoming poor conditions of local archives management in Korea. Secondly, the strategy can encourage cooperation between collecting institutions including museums, libraries, archives, cultural centers, history institutions, etc. in each local region. In the networked environment the cooperation can be achieved more effectively than in traditional environment where the heavy workload of cooperative institutions is needed. Thirdly, the strategy can facilitate solidarity of various groups in local region. According to the analysis of the strategy projects, it is essential to collect their knowledge, passion, and enthusiasm of related groups to effectively implement the strategy. It can also provide a methodology for minor groups of society to document their memories. This study suggests the directions of documenting local regions in consideration of current archival infrastructure of Korean as follows; Firstly, very selective and intensive documentation should be pursued rather than comprehensive one for documenting local regions. Though it is a very political problem to decide what subject has priority for documentation, interests of local community members as well as professional groups should be considered in the decision-making process seriously. Secondly, it is effective to plan integrated representation of local history in the distributed custody of local archives. It would be desirable to implement archival gateway for integrated search and representation of local archives regardless of the location of archives. Thirdly, it is necessary to try digital documentation using Web 2.0 technologies. Documentation strategy as the methodology of selecting and acquiring archives can not avoid subjectivity and prejudices of appraiser completely. To mitigate the problems, open documentation system should be prepared for reflecting different interests of different groups. Fourth, it is desirable to apply a conspectus model used in cooperative collection management of libraries to document local regions digitally. Conspectus can show existing documentation strength and future documentation intensity for each participating institution. Using this, documentation level of each subject area can be set up cooperatively and effectively in the local regions.

A study on the Elements of Communication in the Tasks of Function of Mathematics in Context Textbook (MiC 교과서의 함수 과제에 대한 의사소통의 유형별 요소에 관한 탐색)

  • Hwang, Hye Jeang;Choe, Seon A
    • Communications of Mathematical Education
    • /
    • v.30 no.3
    • /
    • pp.353-374
    • /
    • 2016
  • Communication is one of 6 core competencies suggested newly in mathematics curriculum revised in 2015 in Korea. Also, it's importance has been emphasized through NCTM and CCSSI. By the subject of Mathematics in Context(MiC) textbook, this study planned to explore the communication elements according to the types of communication such as discourse, representation, operation. Namely, this study dealt with 316 questions in a total of 34 tasks relevant to function content in the MiC textbook, and this study explored the communication elements on the questions of each task. To accomplish this, this study first of all was to reconstruct and establish an analytic framework, on the basis of 'D.R.O.C type' of communication developed by Kim & Pang in 2010. In addition, based on the achievement standards of function domain in mathematics curriculum revised in 2015 in Korea, this study basically compared with the function content included in MiC textbook and Korean mathematics curriculum document. Also, it tried to explore the distribution of communication elements according to the types of communication.

Hardware-Based High Performance XML Parsing Technique Using an FPGA (FPGA를 이용한 하드웨어 기반 고성능 XML 파싱 기법)

  • Lee, Kyu-hee;Seo, Byeong-seok
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.12
    • /
    • pp.2469-2475
    • /
    • 2015
  • A structured XML has been widely used to present services on various Web-services. The XML is also used for digital documents and digital signatures and for the representation of multimedia files in email systems. The XML document should be firstly parsed to access elements in the XML. The parsing is the most compute-instensive task in the use of XML documents. Most of the previous work has focused on hardware based XML parsers in order to improve parsing performance, while a little work has studied parsing techniques. We present the high performance parsing technique which can be used all of XML parsers and design hardware based XML parser using an FPGA. The proposed parsing technique uses element analyzers instead of the state machine and performs multibyte-based element matching. As a result, our parsing technique can reduce the number of clock cycles per byte(CPB) and does not need to require any preprocessing, such as loading XML data into memory. Compared to other parsers, our parser acheives 1.33~1.82 times improvement in the system performance. Therefore, the proposed parsing technique can process XML documents in real time and is suitable for applying to all of XML parsers.

A Study on the Roles and Revision of eUCP for Global Electronic Trading (글로벌 전자무역의 실현을 위한 eUCP의 역할과 개정방안)

  • Choi, Seok-Beom;Hong, Sung-Kyu
    • THE INTERNATIONAL COMMERCE & LAW REVIEW
    • /
    • v.18
    • /
    • pp.105-134
    • /
    • 2002
  • In the Spring of 2000, the Banking Commission of the ICC decided to appoint a working group to draft a supplement to the UCP 500 to clarify the position regarding electronic presentation under a documentary credit. Provisions was drafted to supplement its existing rules for documentary credit, that is, UCP 500. These new provisions known as Supplement to UCP 500 for Electronic Presentation was approved by the ICC Banking Commission at the beginning of November 2001 and came in force as of 1 April 2002 The eUCP covers matters such as definitions of key terms such as electronic record, electronic signature, format, paper document, received. An eUCP Credit must specify the formats in which electronic records are to be presented and if not, electronic records may be presented in any format. Electronic records may be presented separately and need not be presented at the same time. The purpose of this paper is to understand the main substance of eUCP and to facilitate the introduction of electronic letter of credit by studying the problems and revision of eUCP and new electronic UCP. The main substances of eUCP are electronic address as place for presentation of electronic records, flexibility of the formats of electronic records to be presented, endowment of the notice of completeness of presentation to the beneficiary, one electronic record satisfying one or more originals or copies of an electronic record, the electronic records to be examined including the electronic record at the hyperlink to an external system or the referenced system, no remark as to the time period for the examination of documents. The Roles of eUCP are the Promotion of the Electronic Trade, the Supply of Basis on the Uniform Rules for Electronic Letter of Credit, the introduction of Electronic Trade Model. The characteristics of eUCP are a supplement to the UCP, no address of any issues relating to the issuance or advice of Credit electronically, independence of specific technologies and developing electronic commerce system, that is, Bolero Service. The Problems of eUCP are flexibility of format of electronic record, heavy burden on the side of banks, and the problems regrading the number of presentation, the notice of completeness of presentation, no provision in regard to the time to examine the electronic records, and representation of the electronic records. In the revision of eUCP to resolve the problems, the things to be taken into consideration are as follows; the designation of the format allowing the banks to examine electronically, prohibition of the paper documents, the development of the system receiving the electronic records, the addition of the reception notice on the side of the banks, the setting of the time to examine the electronic records, the construction of the backup system or the dual processing system.

  • PDF

Optimal supervised LSA method using selective feature dimension reduction (선택적 자질 차원 축소를 이용한 최적의 지도적 LSA 방법)

  • Kim, Jung-Ho;Kim, Myung-Kyu;Cha, Myung-Hoon;In, Joo-Ho;Chae, Soo-Hoan
    • Science of Emotion and Sensibility
    • /
    • v.13 no.1
    • /
    • pp.47-60
    • /
    • 2010
  • Most of the researches about classification usually have used kNN(k-Nearest Neighbor), SVM(Support Vector Machine), which are known as learn-based model, and Bayesian classifier, NNA(Neural Network Algorithm), which are known as statistics-based methods. However, there are some limitations of space and time when classifying so many web pages in recent internet. Moreover, most studies of classification are using uni-gram feature representation which is not good to represent real meaning of words. In case of Korean web page classification, there are some problems because of korean words property that the words have multiple meanings(polysemy). For these reasons, LSA(Latent Semantic Analysis) is proposed to classify well in these environment(large data set and words' polysemy). LSA uses SVD(Singular Value Decomposition) which decomposes the original term-document matrix to three different matrices and reduces their dimension. From this SVD's work, it is possible to create new low-level semantic space for representing vectors, which can make classification efficient and analyze latent meaning of words or document(or web pages). Although LSA is good at classification, it has some drawbacks in classification. As SVD reduces dimensions of matrix and creates new semantic space, it doesn't consider which dimensions discriminate vectors well but it does consider which dimensions represent vectors well. It is a reason why LSA doesn't improve performance of classification as expectation. In this paper, we propose new LSA which selects optimal dimensions to discriminate and represent vectors well as minimizing drawbacks and improving performance. This method that we propose shows better and more stable performance than other LSAs' in low-dimension space. In addition, we derive more improvement in classification as creating and selecting features by reducing stopwords and weighting specific values to them statistically.

  • PDF

A Study on the Persons Enjoying the Landscape of Daegodea in Hamyang and Space Hegemony through Analysis of Poetry and Letters Carved on the Rocks (시문과 바위글씨로 본 함양 대고대(大孤臺)의 경관 향유자와 장소패권(場所覇權))

  • Rho, Jae-Hyun;Lee, Jung-Han
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.32 no.1
    • /
    • pp.10-21
    • /
    • 2014
  • This study focuses on the landscape of Daegodae(大孤臺), a prominent rock placed at the side of Namgae Stream in Hamyang, and the person who enjoy the landscape. Through the analysis of the letters such as names carved on the rocks based on ancient poetry and stone walls, the study examines the characteristics of the landscape and the space of Daegodae and the phase of hegemony to enjoy the landscape and space. The result of this study is as follow.2) There are 5 Seowon(書院: lecture halls) nearby Daegodae identified in the ancient map has 5 auditoriums nearby, and three-dimensional volume and eccentricity of the Daegodae is impressive. Daegodae, named by Noh Jin(1518~1578) in 16th century, was used in a variety of ways, including viewing, game, recreation, and meeting, by the staff of the lecture halls including Namgae Seowon(南溪書院), as a result of analyzing the ancient document Go-dae-il-Loc(孤臺日錄) written by Jung Kyung-Woon(鄭慶雲: 1556~?). The structure of Daegodae is that there is Chunggeunchung(淸近亭) on the rock face of the top and Sanangjae(山仰齋) to the west around the memorial stone for Yang Hee(梁喜: 1515~1581). The upper part of the foundation of Daegodae with 11m high and $10m^2$ wide to the east and west was widely used for lecturing and poetry reading. To the north and west of the foundation were the writing of Kim Jeong-Hee(金正喜: 1786~1856) with the words 'Seoksong Chusa(石松 秋史)' carved on the rock and the remains of a dead tree that is presumed to have been called as 'Seoksong'. They are the landscapes that further enhance the history and authenticity of this place. The two kinds of letters carved on the rock 'Daegodae Gaeeunseo(大高臺 介隱書)' and 'Mukheon JungGeunSang(鄭近相: 1893~1934)' were recorded each by Jung Jae-Gi(1811~1879) and his grandson Jung Geun-Sang, which are, as the outcome of exclusive space possession and space hegemony, the signatures indicating that they were the persons who enjoyed this place during the late Joseon and Japanese colonial era. In other words, Daegodae had some implied meaning of preoccupancy of the place as Gujolyangseonsengjangguso since the middle of Joseon, and the place was passed down as a buddhism lecturing and memorial venue called "Dungbukganghoiso Cheonryungjaeseonhyunjangguso" after going through the space hegemony of Jung Jae-Gi and Jung Geun-Sang during the late Joseon and Japanese colonial era each, Nevertheless, a number of letters carved on the rock identified also imply that 'Hadong Jung(河東鄭氏)' and 'Pungcheon Noh(豊川盧氏)' were those who enjoyed the landscape of Daegodae and the center of the space hegemony. The "letters carved on the rock of Daegudae" is another case of cultural landscape and traditional gardening space that serves as the representation of the will of enjoying the landscape in this place and the history of space hegemony.

Semantic Access Path Generation in Web Information Management (웹 정보의 관리에 있어서 의미적 접근경로의 형성에 관한 연구)

  • Lee, Wookey
    • Journal of the Korea Society of Computer and Information
    • /
    • v.8 no.2
    • /
    • pp.51-56
    • /
    • 2003
  • The structuring of Web information supports a strong user side viewpoint that a user wants his/her own needs on snooping a specific Web site. Not only the depth first algorithm or the breadth-first algorithm, but also the Web information is abstracted to a hierarchical structure. A prototype system is suggested in order to visualize and to represent a semantic significance. As a motivating example, the Web test site is suggested and analyzed with respect to several keywords. As a future research, the Web site model should be extended to the whole WWW and an accurate assessment function needs to be devised by which several suggested models should be evaluated.

  • PDF