• Title/Summary/Keyword: texts

Search Result 1,726, Processing Time 0.025 seconds

Digitization of Old Korean Texts with Obsolete Korean Characters and Suggestion for Improvement of Information Sharing (옛한글 문서의 전자문서화와 정보공유 방법 제안)

  • Kim, Ha Young;Yoo, Woo Sik
    • Journal of Conservation Science
    • /
    • v.37 no.3
    • /
    • pp.255-269
    • /
    • 2021
  • A vast amount of materials-such as prints, woodblock prints, manuscripts, old novels, and letters-written in old Korean and using old grammar and/or obsolete characters, are collected in many institutions, including the Jangseogak at the Academy of Korean Studies. Digitization of these texts has required a prolonged manual inputting process. Individual researchers, who majored in old Korean, have read and typed the characters into electronic documents, which depends upon individual skill, effort, and approach, and is particularly limiting because none can be significantly increased. To date, only a small proportion of the old Korean document collections, currently kept in storage, have been digitized and made available to the public. Even the electronic formats of the texts prove difficult to displaying correctly, due to the incompatibility between the old Korean characters and the character set on today's electronic devices. To improve the techniques and efficiency of digitizing old Korean texts, it is necessary to develop optical character recognition (OCR), which will analyze images of old Korean documents, as well as input, display, and storage methods.

Techniques for Location Mapping and Querying of Geo-Texts in Web Documents (웹 문서상의 공간 텍스트 위치 맵핑과 질의 기법)

  • Ha, Tae Seok;Nam, Kwang Woo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.3
    • /
    • pp.1-10
    • /
    • 2022
  • With the development of web technology, large amounts of web documents are being produced. This web document contains various spatial texts, and by converting these texts into spatial information, it is the basis for searching for text documents with spatial query. These spatial texts consist of a wide range of areas, including postal codes and local phone numbers, as well as administrative place names and POI names. This paper presents algorithms that can map locations based on spatial text information existing within web documents. Through these algorithms, web documents can be searched for documents describing the region on a map rather than a general web search. In this paper, we demonstrated the presented algorithms are useful by implementing a web geo-text query system.

A Study on Smallpox and Measles by BYUN Gwangwon - Based on a formation Yosandnagsinjipuibangkeumnangjibo and The Bojeoksinbang - (변광원(卞光源)의 두진(痘疹)과 마진(麻疹)에 대한 연구 - 『요산당신집의방금낭지보(樂山堂新集醫方錦囊至寶)』와 『보적신방(保赤新方)』의 편제를 중심으로 -)

  • SONG, Jichung
    • Journal of Korean Medical classics
    • /
    • v.35 no.3
    • /
    • pp.59-69
    • /
    • 2022
  • Objectives : The existence of specialized medical texts on a certain disease is reflective of its prevalence of the time. Smallpox and measles were major pediatric diseases, of which previous studies examined the outbreak of measles in late Joseon and the relationship among various specialized texts, and how records of the two diseases in the general medical literature has changed chronologically. Research on the two diseases recorded in different texts written by the same author has not been conducted before. Methods : Examination of the organization of the smallpox and measles parts in the Yosandangsinjipuibangkeumnangjibo and Bojeoksinbang, followed by comparative analysis was undertaken. Results : While the two texts show great similarity in the general contents of smallpox and measles, there was difference in the way they were written. In the case of the Yosandangsinjipuibangkeumnangjibo the author lists referenced literature, while in the Bojeoksinbang he does not. Also, compared to the Yosandangsinjipuibangkeumnangjibo, the Bojeoksinbang has detailed titles for the contents in both introduction and the detailed parts, while in the Bojeoksinbang there are contents that could not be found in the Yosandangsinjipuibangkeumnangjibo, along with more pattern differentiation in the former. Conclusions : The Yosandangsinjipuibangkeumnangjibo which was published in May of 1806 is a general type of medical text, in which the part on pediatrics is positioned in the first two volumes out of the entire 12 volumes, indicative of the author's emphasis on pediatric disease. The Bojeoksinbang which was published in December of 1806 discusses in-depth theories on smallpox and measles out of all pediatric disease, from which we can glimpse a specialized field of pediatrics in the late Joseon period.

Reading and Teaching "Snow White" from a Critical Literacy Stance: the Original, the Animated Version, and Parodies (크리티컬 리터러시를 활용한 "백설공주" 읽기교육 -원작과 영화, 패러디 작품을 중심으로)

  • Choi, Seokmoo
    • Journal of English Language & Literature
    • /
    • v.55 no.5
    • /
    • pp.885-906
    • /
    • 2009
  • In terms of class, race, or gender, critical literacy takes seriously the problem of inequality and injustice embedded in texts. Texts are considered as tools that are used for maintaining the status quo by constructing and communicating our identities, particularly in relation to others. While reading texts and identifying our roles in society, some feel empowered, and others, marginalized. Thus we need to challenge the characterization and the message included in those texts by asking problem-posing questions. In this paper I have demonstrated how to read and teach four versions of "Snow White" from a critical literacy stance. By the use of problem-posing questions, students are led to discover that one of Grimms' fairy tales, the original version of "Snow White," was written from the perspective of men with power, thus marginalizing women in general, as well as the seven dwarfs. Through a critical analysis of Snow White's personality, the typical theme of fairy tales - good is rewarded while evil is punished - should be challenged. In the animation, Snow White and the Seven Dwarfs, power is given to the marginalized people in the original, the seven dwarfs and women in general. In "Snow Night,"a feminist short story, women in general are empowered while men, who should be judged by their looks, are powerless. "Snow-Drop"reminds us of the original, but challenges stereotypes, prejudices, and the theme inherent in the story. In those three stories many parts from the original are rewritten from the perspectives of the marginalized, but still some people are described prejudicially. So students should be guided to write another story from a new perspective. When those four works were taught with problem-posing questions in a university, this approach proved to be quite successful: most students acknowledged the effectiveness of critical literacy in teaching literary works.

A Study of the Algorithm that Standardizes Processing of Information and Taking Indications of East Asian Medicine Formula (비정형 한의약텍스트 조제복용사항 정형화알고리즘연구 - 동의보감 처방정보를 중심으로)

  • CHA Wung-seok;HEO Yo-seob;Kim Namil
    • The Journal of Korean Medical History
    • /
    • v.35 no.2
    • /
    • pp.45-67
    • /
    • 2022
  • Currently, there are about 20,000 or so known ancient medical texts from the East Asian medical traditions. Although the most famous texts are widely known, many texts still exist only as original manuscripts. We are interested exploring these texts to uncover the potential benefits of their therapeutic knowledge. This study aims to develop a database program that automatically converts the treatment skills described in the text version into a more structured version. In the previous study, our team analyzed patterns in the way that treatment skills are described and then tried to design a database program algorithm that identified every meaningful keyword used to describe treatment skills and put that word in the right cell of a structured table. This study continues the development of this program. East Asian medical herbal treatment information is broken down into 4 elements: the first one is the name or title of treatment skills, and the second is the symptoms to which the treatment is applied, the third is ingredients used, the fourth is how information is processed and the indications taken. This study presents the algorithm's principles on how to analyze and structure the fourth element, the processing of information and taking of indications, which is described in a form of ancient natural language.

Laboratory Abilities to Carry-out Experimentations of Matter in the Middle School Science Texts (중학교 과학 교과서의 '물질 영역' 실험 활동에 포함된 실험 수행 능력)

  • Park, Hyun-Ju;Min, Byoung-Wook;Jeong, Dae-Hong
    • Journal of The Korean Association For Science Education
    • /
    • v.28 no.8
    • /
    • pp.870-879
    • /
    • 2008
  • The purpose of this study is to investigate laboratory abilities to carry-out experimentations in the field of 'Matter' in middle school science texts. A total of 359 chemistry experiments from 26 textbooks has been analyzed. The authors of this study are interested in what science process skills are required for students to perform the experiments and how often these skills are needed. This article introduced a framework for analyzing the science process skills and their frequency. There are similar patterns of science process skill use among the different grades of middle school texts. The process skills of organizing results, interpreting data and making generalizations are most needed by the order of frequency. However, abilities related to alternative activities and/or conditions show relatively low frequency. For seniors, various laboratory abilities to carry out experiments are needed, whereas abilities for operating and setting up an experimental apparatus are required in freshmen and juniors. These results suggest avenues for science teachers that make lesson plans involving science experiments.

A Study on the Famine Relief and Fasting Formulas - Focusing on Korean Medical Texts - (구황피곡방(救荒辟穀方)에 대한 고찰(考察) - 한국(韓國) 의서(醫書)를 중심으로 -)

  • Baik Yousang
    • Journal of Korean Medical classics
    • /
    • v.37 no.2
    • /
    • pp.101-119
    • /
    • 2024
  • Objectives : This study examined the characteristics of famine relief and fasting formulas in Korean Medical Texts from early Joseon to early modern period. Methods : In addition to previous studies and texts, basic materials were collected from various academic database such as the Korean Medical Classics Database, Korean History Database, Chinese Text Project, Weijiwenku, etc., then analyzed. Results : In Korean Medicine from the early Joseon to early modern Korea, there was a strong awareness to use fasting prescriptions which were applied in Daosim for the purpose of famine relief, using both medicinals and common food ingredients together as complex prescriptions rather than single ingredient formulas. Famine relief and fasting formulas were continuously listed in many medical texts published after the Donguibogam, in modified or newly improved forms. Moreover, the food ingredients and medicinals used in these formulas were consisted of those which could be easily found in the famished nation of the time. Many of these formulas were tried and tested prescriptions, frequently used in clinical settings. Most of the ingredients and medicinals used in the famine relief and fasting formulas were sweet, bland, and neutral in nature, supporting Qi circulation and tonifying the Spleen and Stomach. Therefore in times of famine, these medicinals could help prevent digestive problems and decline of stamina. Conclusions : Research and contemporary interpretation on the famine relief and fasting formulas could contribute to not only health management but to relieving nutrition imbalance and famine, expanding the field of Korean Medicine application.

Detecting and Segmenting Text from Images for a Mobile Translator System

  • Chalidabhongse, Thanarat H.;Jeeraboon, Poonsak
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.875-878
    • /
    • 2004
  • Researching in text detection and segmentation has been done for a long period in the OCR area. However, there is some other area that the text detection and segmentation from images can be very useful. In this report, we first propose the design of a mobile translator system which helps non-native speakers to understand the foreign language using ubiquitous mobile network and camera mobile phones. The main focus of the paper will be the algorithm in detecting and segmenting texts embedded in the natural scenes from taken images. The image, which is captured by a camera mobile phone, is transmitted to a translator server. It is initially passed through some preprocessing processes to smooth the image as well as suppress noises. A threshold is applied to binarize the image. Afterward, an edge detection algorithm and connected component analysis are performed on the filtered image to find edges and segment the components in the image. Finally, the pre-defined layout relation constraints are utilized in order to decide which components likely to be texts in the image. A preliminary experiment was done and the system yielded a recognition rate of 94.44% on a set of 36 various natural scene images that contain texts.

  • PDF

A Study on the Way of Organizing Contents of State Sponsored Medical Text in Ancient China (중국 주요 국가간행의학서의 편제구성과 질병분류인식에 대한 소고)

  • Cha, Wung-Seok;Kim, Namil;Ahn, Sang-Woo;Kim, Dong-Ryul
    • The Journal of Korean Medical History
    • /
    • v.30 no.2
    • /
    • pp.1-12
    • /
    • 2017
  • This paper is focused on the 'contents' of database level medical texts sponsored by the Chinese government. The premise of the study is that the contents of state-sponsored medical texts would show how medical policy makers and practitioners approached the body and diseases of the time, and by association the medical text would reveal the policy associated with state medical education and distribution of medical resources associated with the practitioners' approaches. This paper analyzes the contents of four representative state-sponsored medical texts: Cao's Treatise on the Origins and Symptoms of Various Diseases (巢氏諸病源候論, 610, Sui China); Great Peace and Sagely Benevolence Formulas (太平聖惠方, 996, Song China); Complete Record of Sagely Benevolence (聖濟總錄, 1117, Song China); Formulas for Universal Relief (普濟方, 1406, Ming China).

Resolving the Ambigities in World Sense by using Automatic Keyword Network in Information Retrieval (정보검색에서의 어의 중의성 해소를 위한 자동 키워드망의 이용)

  • Kim, Jung-Sae;Jang, Duk-Sung
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.12
    • /
    • pp.3855-3865
    • /
    • 2000
  • The automatic indexing is a compulsory part for the text retrieval system. However it is impossible to rank the appropriate texts at top. Furthermore, it is more difficult to prevent to rank the inappropriate texts having homonyms at top by only the automatic indexing. In this paper, we proposed the two-level retrieval system to enhance the retrieval efficiency, in which Automatic Keyword Network (AKN) is used at the second-level process. The firsHevel search is carried out with an inverted index file generated by the automatic indexing. On the other hand the second-level search exploits AKN based on the degree of asslxiation between terms. We have developed several formulas for rearranging the rank of texts at second-level search, and evaluated the performance of the effects of them on resolving the word sense ambiguities.

  • PDF