• 제목/요약/키워드: machine-readable

검색결과 84건 처리시간 0.024초

주제분석기법으로서의 자동색인 (Automatic indexing as a subject analysis technique)

  • 이영자
    • 한국도서관정보학회지
    • /
    • 제12권
    • /
    • pp.61-96
    • /
    • 1985
  • The human subject analysis of a document has some critical problems. The method results in the inconsistency in analysis process and the contradiction of two objects of the subject analysis (one is the identification of the content for the retrieval of specific items and the other is to identify the content for the grouping of related materials). Since the subject analysis by mechanized has been recognized to be the possible way to aggregate the problems of manual analysis, various a n.0, pproaches of automatic indexing have been studied and experimented. This study is to examine the automatic indexing as one of the promising subject analysis techniques by statistical, syntactical and semantic a n.0, pproaches. In conclusion, the reasonable a n.0, pplication time of the automatic indexing should be made a decision based on the through investigation on the cost verse effectiveness, and automatic indexing system should be developed in the close relationship with the on-line search which is a good retrieval system for information explosion society. From now on, since the machine-readable document-text will be envisaged to be more and more available due to the rapid development of computer technology, the more substantial research on the automatic indexing will be also possible, which can bring about the increasing of practical automatic indexing systems.

  • PDF

고전적의 형태기술에 관한 연구 -국제표준서지기술법(ISBD)의 형식을 중심으로- (A Study on Physical Description of the Oriental Traditional Books : According to ISBD)

  • 현영아
    • 한국문헌정보학회지
    • /
    • 제20권
    • /
    • pp.271-295
    • /
    • 1991
  • The external forms and contents of many library materials are very various. The physical description of the specific materials in the forms must be fitted to each forms. The oriental traditional books are very special in the printing forms. The machine readable cataloging of library materials is used internationally in these days. So, the cataloging of the oriental traditional materials must be reconsidered for computerizing of that. The physical descriptions of these materials will accord with ISBD to prepare for comuterzing of that. This study presented the recording forms of physical description that fitted to peculiarity of the oriental traditional materials and it refered to ISBD of non-book materials that are special in the forms. These recording forms of that are as the follows; The first part is the recording forms of description and number of the parts of items. The second part is the recording forms of the other physical details. This part contains the Illustration, Kwankwak, Keseon, Hengjasu, Heucku, Eormee. The Third part is the dimensions of items. The dimensions of the oriental traditional books consist of two kind. One is the dimensions of actual printing. The other is that of a book cover. The fourth part is the recording forms of the accompany materials.

  • PDF

목록에 있어서의 일본인명 표기-<대한민국출판물총목록>의 색인에 나타난 표기를 중심으로-

  • 김영귀
    • 한국도서관정보학회지
    • /
    • 제20권
    • /
    • pp.285-315
    • /
    • 1993
  • Some conclusions can be derived from the study: 1. Person's name should be script by the one's mother tongue because of its uniqueness. 2. Japanese person's name should be script and pronounce their mother tongue for exchange and sharing of an academic information. 3. We can anticipate that Japanese language materials will be increase in near future. 4. The National Central Library which publish Korean National Bibliography must have to responsibility to lead other library. 5. The script of [Korean National bibliography] must contribute to standardization and national and Universal Bibliographic Control. 6. The area of education, newspaper, publishing are scripting Japanese person's name with script conversion schemes for Koreanization, devised by Ministry of Education. 7. The script of [Korean National Bibliography]'s name index can be used as authority file at selection of heading in library cataloging. 8. Most of libraries script Japanese person's name with Chinese character in Korean language pronunciation. 9. Korean Cataloging Rules (KCR) and Korean Machine Readable Cataloguing (KORMARC) description rules should be defined about the mother tongue script of Japanese person's name. 10. It is desirable to increase of credit of Readings in Japanese material course in college curriculum. 11. Because Japanese person's name is complex and variable that it is desirable to add Chinese character with mother tongue script.

  • PDF

표목의 기능에 관한 연구 (A study on the functions of headings)

  • 김태수
    • 정보관리학회지
    • /
    • 제12권2호
    • /
    • pp.9-35
    • /
    • 1995
  • 특정문헌의 검색기능은 기본표목만의 기능이 아니라 오히려 부출표목의 기능이라고 할 수 잇다. 특히 기계가독목록에서는 기본표목 이외에 각종 제어번호나 표준서지번호를 사용하여, 이 기능이 크게 확장되었다. 특정저자의 저작과 특정저작의 제판을 집중하는 목록의 기능도 부출표목과 참조, 주기, 통일서명이 기본표목과 대등한 역할을 수행하고 있음을 확인할 수 있었다. 하이퍼목록에서는 서지적 관계유형을 규정하고 관련저작간을 연결할 수 있어, 기본표목을 배제하더라도 목록의 기능수행에는 아무런 영향을 주지 않는다. 접근점의 확장과 관련저록간의 연결수단을 통하여 이용자의 검색기회를 확대하는 것이 중요하다.

  • PDF

Computer Codes for Korean Sounds: K-SAMPA

  • Kim, Jong-mi
    • The Journal of the Acoustical Society of Korea
    • /
    • 제20권4E호
    • /
    • pp.3-16
    • /
    • 2001
  • An ASCII encoding of Korean has been developed for extended phonetic transcription of the Speech Assessment Methods Phonetic Alphabet (SAMPA). SAMPA is a machine-readable phonetic alphabet used for multilingual computing. It has been developed since 1987 and extended to more than twenty languages. The motivating factor for creating Korean SAMPA (K-SAMPA) is to label Korean speech for a multilingual corpus or to transcribe native language (Ll) interfered pronunciation of a second language learner for bilingual education. Korean SAMPA represents each Korean allophone with a particular SAMPA symbol. Sounds that closely resemble it are represented by the same symbol, regardless of the language they are uttered in. Each of its symbols represents a speech sound that is spectrally and temporally so distinct as to be perceptually different when the components are heard in isolation. Each type of sound has a separate IPA-like designation. Korean SAMPA is superior to other transcription systems with similar objectives. It describes better the cross-linguistic sound quality of Korean than the official Romanization system, proclaimed by the Korean government in July 2000, because it uses an internationally shared phonetic alphabet. It is also phonetically more accurate than the official Romanization in that it dispenses with orthographic adjustments. It is also more convenient for computing than the International Phonetic Alphabet (IPA) because it consists of the symbols on a standard keyboard. This paper demonstrates how the Korean SAMPA can express allophonic details and prosodic features by adopting the transcription conventions of the extended SAMPA (X-SAMPA) and the prosodic SAMPA(SAMPROSA).

  • PDF

결합 신경망을 이용한 여권 MRZ 정보 인식 (Recognition of Passport MRZ Information Using Combined Neural Networks)

  • 김진호
    • 디지털산업정보학회논문지
    • /
    • 제15권4호
    • /
    • pp.149-157
    • /
    • 2019
  • In case of reading passport using a smart phone in contrast with a dedicated passport reading system, MRZ(Machine Readable Zone) character recognition can be hard when the character strokes were broken, touched or blurred according to the lighting condition, and the position and size of MRZ character lines were varied due to the camera distance and angle. In this paper, the effective recognition algorithm of the passport MRZ information using a combined neural network recognizer of CNN(Convolutional Neural Network) and ANN( Artificial Neural Network), is proposed under the various sized and skewed passport images. The MRZ line detection using connected component analysis algorithm and the skew correction using perspective transform algorithm are also designed in order to achieve effective character segmentation results. Each of the MRZ field recognition results is verified by using five check digits for deciding whether retrying the recognition process of passport MRZ information or not. After we implement the proposed recognition algorithm of passport MRZ information, the excellent recognition performance of the passport MRZ information was obtained in the experimental results for PC off-line mode and smart phone on-line mode.

모바일 폰의 영상 촬영에 대한 바코드 워터마킹 (Barcode watermarking for photographs of mobile phone)

  • 황태원;서정희;박흥복
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2017년도 춘계학술대회
    • /
    • pp.763-764
    • /
    • 2017
  • 모바일 폰에서의 바코드 사용은 일반화되었고, 바코드는 모바일 지불과 개인 식별을 포함하여 보안에 민감한 응용 프로그램에 폭넓게 사용되고 있다. 휴대 전화로 촬영된 영상은 일반적으로 촬영각도에 의해 기하학적 왜곡이 심해서 낮은 품질의 영상을 생성한다. 낮은 품질의 영상에 대한 워터마크 내장은 비지각성을 만족시키기 어렵게 한다. 이런 문제를 해결하기 위해 본 논문은 휴대 전화로 촬영된 영상에 바코드 영상을 내장하는 기법에 초점을 맞추고 모바일 기반의 소유권 보호를 위한 바코드 워터마킹을 제안한다. 영상에 내장된 바코드 워터마크는 불법 복제와 같이 머신에서의 판독이 가능함으로써 소유권을 증명하는데 사용할 수 있다.

  • PDF

A Covariance-matching-based Model for Musical Symbol Recognition

  • Do, Luu-Ngoc;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang;Dinh, Cong Minh
    • 스마트미디어저널
    • /
    • 제7권2호
    • /
    • pp.23-33
    • /
    • 2018
  • A musical sheet is read by optical music recognition (OMR) systems that automatically recognize and reconstruct the read data to convert them into a machine-readable format such as XML so that the music can be played. This process, however, is very challenging due to the large variety of musical styles, symbol notation, and other distortions. In this paper, we present a model for the recognition of musical symbols through the use of a mobile application, whereby a camera is used to capture the input image; therefore, additional difficulties arise due to variations of the illumination and distortions. For our proposed model, we first generate a line adjacency graph (LAG) to remove the staff lines and to perform primitive detection. After symbol segmentation using the primitive information, we use a covariance-matching method to estimate the similarity between every symbol and pre-defined templates. This method generates the three hypotheses with the highest scores for likelihood measurement. We also add a global consistency (time measurements) to verify the three hypotheses in accordance with the structure of the musical sheets; one of the three hypotheses is chosen through a final decision. The results of the experiment show that our proposed method leads to promising results.

한국어 명사의 시소러스 구축을 위한 시스템 설계 및 구현 (Design and Implementation of a System for Constructing Thesaurus of Korean Nouns)

  • 이종인;한광록;양승현;김영섬
    • 한국정보처리학회논문지
    • /
    • 제6권2호
    • /
    • pp.347-356
    • /
    • 1999
  • 본 논문에서는 한국어 명사의 의미 개념의 계층을 생성하기 위한 시소러스 구성 방법과 시소러스를 구축하기 위한 개발 시스템을 구현하였다. 기존의 시소러스 구축에 있어서 나타나는 계층 설정의 비객관성 및 작업속도 문제, 비구조성, 비일관성 등의 문제를 해결하기 위하여 상향식과 하향식 방법을 혼합 적용하는 다단계 구축 방법을 사용한다. 온라인 전자 사전의 뜻풀이 문을 이용하여 객관성을 유지하고, 기존 시소러스의 기본 모델을 참조하여 비구조성과 비일관성의 문제를 해결한다. 또한 방대한 양의 표제어를 포함하는 시소러스를 빠른 시간 내에 구축하기 위하여 클라이언트/서버 환경의 개발 도구를 구현하여 여러 사람이 다중 입력 작업을 할 수 있도록 하였다.

  • PDF

국립중앙도서관 자료관리의 전산화연구 -기계가독목록의 개발과 활용- (Computerization of the Central National Library; Development of Korean MARC System)

  • 정영미;현규섭
    • 한국문헌정보학회지
    • /
    • 제8권
    • /
    • pp.3-72
    • /
    • 1981
  • The necessity of computerizing the Central National Library of Korea has been widely recognized. The purpose of this research is to develop Korean Machine Readable Cataloging system as a first step toward an integrated library system and suggests the ways of utilizing the MARC file in Korean libraries as well as in the Central National Library. In this paper the following studies are included: 1. An analysis of the functions and current procedures of the Central National Library 2. Development of a standard record format for KOR MARC 3. Production and utilization of KOR MARC files 4. Identification of problems in developing KOR MARC system In conclusion, the following recommendations are made: 1. Standardization of the internal code and input/output equipments should be proceded for hangul and chinese data processing. 2. The libraries planning or having accomplished computerization process should be cooperative in standardizing and distributing bibliographic data bases including KOR MARC tapes. 3. Training of competent librarians and strong support from the government are required for a successful implementation of the Library's computerization project.

  • PDF