• Title/Summary/Keyword: 용어사전

Search Result 399, Processing Time 0.034 seconds

Method Customizing From Web-based English-Korean MT System To English-Korean MT System for Patent Documents (웹 영한 번역기로부터 특허 영한 번역기로의 특화 방법)

  • Choi, Sung-Kwon;Kwon, Oh-Woog;Lee, Ki-Young;Roh, Yoon-Hyung;Park, Sang-Kyu
    • Annual Conference on Human and Language Technology
    • /
    • 2006.10e
    • /
    • pp.57-64
    • /
    • 2006
  • 본 논문에서는 웹과 같은 일반적인 도메인의 영한 자동 번역기를 특허용 영한 자동번역기로 특화하는 방법에 대해 기술한다. 특허용 영한 파동번역기로의 특화는 다음과 같은 절차에 의해 이루어진다: 1) 대용량 특허 문서에 대한 언어학적 특성 분석, 2) 대용량 특허문서 대상 전문용어 추출 및 대역어 구축, 3) 기존 번역사전 대역어의 특화, 4) 특허문서 고유의 번역 패턴 추출 및 구축, 5) 언어학적 특성 분석에 따른 번역 엔진 모듈의 특화 및 개선, 6) 특화된 번역 지식 및 번역 엔진 모듈에 따른 번역률 평가. 이와 같은 절차에 의해 만들어진 특허 영한 자동 번역기는 특허 전문번역가의 평가에 의해 전분야 평균 81.03%의 번역률을 내었으며, 분야별로는 기계분야(80.54%), 전기전자분야(81.58%), 화학일반분야(79.92%), 의료위생분야(80.79%), 컴퓨터분야(82.29%)의 성능을 보였으며 계속 개선 중에 있다. 현재 본 논문에서 기술된 영한 특허 자동번역 시스템은 산업자원부의 특허지원센터에서 변리사 및 특허 심사관이 영어 전기전자분야 특허 문서를 검색할 때 한국어 번역서비스를 제공받도록 이용되고 있으며($\underline{http://www.ipac.or.kr}$), 2007년에는 전분야 특허문서에 대한 영한 자동번역 서비스를 제공할 예정이다.

  • PDF

Progress of Management Policy and Research of Place Names in North Korea (북한의 지명관리 정책과 연구 동향 분석)

  • Kim, Kihyuk
    • Journal of the Korean association of regional geographers
    • /
    • v.19 no.1
    • /
    • pp.14-30
    • /
    • 2013
  • Place names in North Korea has been regarded as an effective instrument of revolution since division of territory(1945) and as typical case which politcal ideology affected the place names. Especially in North Korea, self-reliance ideology(Juche Idea) and idolization of Kim Il Sung influenced the place names. With local administrative district reform in 1952, names of district and village were changed on national scale. National survey of place names were proceeded in 1964~1966 with direct support of Kim, Ilsung. After this survey, North Korea made alteration of place names in terms of idolization of Kim Il Sung family as well as socialist revolution. Encyclopedia of place names were widely published. Almost linguist were forced to produce writing and papers for the praise of the legitimacy of new place names. But it should be attended that research trend are slowly changed since 2000s. Research for idolization of Kim Il Sung has become a little importance.

  • PDF

Feature-selection algorithm based on genetic algorithms using unstructured data for attack mail identification (공격 메일 식별을 위한 비정형 데이터를 사용한 유전자 알고리즘 기반의 특징선택 알고리즘)

  • Hong, Sung-Sam;Kim, Dong-Wook;Han, Myung-Mook
    • Journal of Internet Computing and Services
    • /
    • v.20 no.1
    • /
    • pp.1-10
    • /
    • 2019
  • Since big-data text mining extracts many features and data, clustering and classification can result in high computational complexity and low reliability of the analysis results. In particular, a term document matrix obtained through text mining represents term-document features, but produces a sparse matrix. We designed an advanced genetic algorithm (GA) to extract features in text mining for detection model. Term frequency inverse document frequency (TF-IDF) is used to reflect the document-term relationships in feature extraction. Through a repetitive process, a predetermined number of features are selected. And, we used the sparsity score to improve the performance of detection model. If a spam mail data set has the high sparsity, detection model have low performance and is difficult to search the optimization detection model. In addition, we find a low sparsity model that have also high TF-IDF score by using s(F) where the numerator in fitness function. We also verified its performance by applying the proposed algorithm to text classification. As a result, we have found that our algorithm shows higher performance (speed and accuracy) in attack mail classification.

Using similarity based image caption to aid visual question answering (유사도 기반 이미지 캡션을 이용한 시각질의응답 연구)

  • Kang, Joonseo;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.191-204
    • /
    • 2021
  • Visual Question Answering (VQA) and image captioning are tasks that require understanding of the features of images and linguistic features of text. Therefore, co-attention may be the key to both tasks, which can connect image and text. In this paper, we propose a model to achieve high performance for VQA by image caption generated using a pretrained standard transformer model based on MSCOCO dataset. Captions unrelated to the question can rather interfere with answering, so some captions similar to the question were selected to use based on a similarity to the question. In addition, stopwords in the caption could not affect or interfere with answering, so the experiment was conducted after removing stopwords. Experiments were conducted on VQA-v2 data to compare the proposed model with the deep modular co-attention network (MCAN) model, which showed good performance by using co-attention between images and text. As a result, the proposed model outperformed the MCAN model.

A Study on Implementation of Humane Resource Pool Recruitment system Using Blockchain

  • Lee, Ji-Woon;Seo, Hee-Suk
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.2
    • /
    • pp.69-78
    • /
    • 2021
  • In this paper, we propose a implementation plan of the human resource pool recruitment system using private (permitted) blockchain. The term Human Resource has become commonly used and has come to recognize human resources as resources. Despite these changes, the use of human resource pools has been sluggish. Once entered, information is often not updated on a regular basis and does not provide sharing, searching, carrier management and anti-counterfeiting. In this research, in order to provide a human resource pool recruitment system that utilizes private (permitted) blockchain, we first used the blockchain network to enable sharing and searching of human resource pools, and to use keywords. Used to get results that meet certain conditions. Second, we added an institutional verification process to ensure the integrity of the input data and prepared preventive measures in the non-technical part by utilizing the structural characteristics of the blockchain to prevent counterfeiting and alteration. Third, we designed and implemented a Dapp (Decentralized application) that includes a Web UI so that each of the three groups can control the blockchain and the predefined processes and business logic.

Improving the Functions of Digital Textbooks to Prepare for the post COVID-19 (포스트 코로나를 대비한 디지털교과서의 기능 개선)

  • Kim, Hong-sun;Jeong, Young-sik
    • 한국정보교육학회:학술대회논문집
    • /
    • 2021.08a
    • /
    • pp.283-288
    • /
    • 2021
  • In the COVID-19 situation, digital textbooks have been used in many schools. In order for digital textbooks to become active even in the post COVID-19 era, the functions of digital textbooks must be improved. Digital textbooks are traditional book-type textbooks with glossaries, video materials, and evaluation questions added. Recently, they are being used usefully for practical education by providing realistic contents such as Augmented Reality, Virtual Reality, and 360 images. Therefore, in this study, in order to prepare for the post COVID-19, we found the functional problems of digital textbooks and suggested a way to improve them. First, the layout of digital textbooks should be developed as a responsive layout, deviating from the same form as a book-type textbook. Second, digital textbooks and learning management systems must be integrated. Third, by developing a digital textbook for teachers, teachers should be able to directly reorganize the contents or add external materials. Fourth, learning analysis should be possible using data recorded in digital textbooks. Fifth, in the 2022 revised curriculum, various subjects should be developed as digital textbooks.

  • PDF

Semantic analysis of unstructured information considering the step in progress of water quality accidents in the water supply systems (상수도시스템 수질사고의 전개양상을 고려한 비정형정보 의미분석)

  • Hong, Sungjin;Moon, Gihoon;Yang, Seong Hun;Yoo, Do Guen
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.378-378
    • /
    • 2022
  • 상수도시스템의 과정 중 최종 단계인 급수단계에서 지역전반에 수질문제가 발생할 경우, 직간접적인 피해의 해결은 장기간 지속될 수 있다. 본 연구에서는 실시간 비정형정보의 빅데이터 분석을 통해 상수도시스템에서 수질사고 문제의 파급력과 2차 피해 등의 연결 관계 변화 추적을 위한 기초적 분석을 수행하였다. 과거 대규모 수질사고가 발생된 바 있는 인천광역시 유충발생 사고를 대상으로 뉴스 기사 웹크롤링 절차를 정립하고, 그 결과를 분석하였다. '인천 유충'이 최초 보도되었던 2020년 7월 13일 부터 이후 1년을 대상으로 네이버 통합검색에 의해 표출되는 뉴스기사를 웹크롤링하였으며, 프로그래밍을 통한 불용어 제거 및 관련성 검토를 통해 총 920건의 기사를 분석하였다. 수질사고의 전개양상에 따라 사고발생, 확산, 수습, 그리고 보상의 4단계로 임의 구분하여 분석하였다. 의미분석을 위한 토픽모델링 기법은 잠재 디리클레 할당(Latent Dirichlet Allocation, LDA) 방법을 적용하였으며, 긍부정 감정분석은 KNU 한국어 감성사전(KNU sentiment lexicon)을 활용하여 수행하였다. 토픽 모델링 결과, 사고 발생에서부터 확산, 수습, 보상의 단계에 맞춰 적절한 주제어의 조합에 따른 기사들이 도출되었으며, 단계별 긍부정 기사 비율역시 사고의 전개단계에 따라 적절히 나타남을 확인하였다. 제시된 수질사고 관련 비정형정보 분석 방법론과 결과는 과거 사고 사례 분석을 통한 검색 및 긍부정 키워드 확정, 키워드 발생 비율 변동(사고전과 후)에 따른 상황판단 기준설정 등에 활용이 가능하다.

  • PDF

Development and Application of Issue-Centered Teaching.Learning Process Plan for Environment-Friendly Housing Education (환경친화적 주생활 교육을 위한 쟁점중심 교수.학습 과정안 개발 및 적용)

  • Park, Hee-Jeong;Cho, Jae-Soon
    • Journal of Korean Home Economics Education Association
    • /
    • v.21 no.3
    • /
    • pp.45-64
    • /
    • 2009
  • The purpose of this study was to develope issue-centered teaching learning process plan for environment-friendly housing education and to apply it to the housing section of Technology Home Economics in a middle school. PRO-CON cooperative group model was used for the teaching learning process plans of 2-session lessons according to the ADDIE model. In the development stage, 7 activity materials and 20 teaching learning materials (4 reading texts, 12 pictures and photos, & 5 moving pictures) were developed for 2-session lessons. The plans applied to the 7 classes, 222 students, in the third grade of the G middle school in Gyeonggi-do during July 10th-17th, 2008. The results showed that the final pro-con was influenced by the rationals of juries' pro-con of each team and the representative's discussion besides one's environmental perspective. The intention to implement environment-friendly housing activities was significantly increased between before and after the lessons. The contents, methods, goals, and process of the 2-session lessons were evaluated over medium to somewhat higher levels. Those evaluations except methods and general satisfaction with the lessons were differed by sex, students' and their families' interests in environments but not by the type of housing. These results might support that pro-con cooperative group model of controversial issues on parking lot would be appropriate to environment-friendly housing lessons and could apply to broad issues on housing and various schools in other areas.

  • PDF

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

  • Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.141-166
    • /
    • 2019
  • Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.

Recognition and Narrative Aspects of the History of Korean Classic Literature from Two Korean Literature History Works Written in China (중국 한국문학사 2종의 한국고전문학사 인식과 서술 양상: 남북한문학사와 자국문학사의 수용과 변용을 중심으로)

  • Lee, Deung-yearn
    • Cross-Cultural Studies
    • /
    • v.48
    • /
    • pp.67-106
    • /
    • 2017
  • This study focuses on two specific history of Korean literature in Chinese: the outline of The History of Joseon Literature (2010) by Li Yan and The History of Joseon Literature (1988, 2008) by Wei Xu-sheng; it was conducted to compare narrative viewpoints to the history of South and North Korean literature and therefore identify distinguishable characteristics. As a result, the following was concluded. First, The History of Korean Literature by Cho Dong-il and The History of Korean Literature in North Korea (15 volumes) include thorough discussions on division of historical eras, concept of genres as well as individual literary works and applied such discussions on writing literary history. However, Wei Xu-sheng and Li Yan's The History of Korean Literature did not illuminate theoretical discussion of South and North Korea. Li Yan's outline of The History of Joseon Literature was published in 2010 and the first edition of Wei Xu-sheng's The History of Joseon Literature was published in 1986 and later was published as revised editions in 2000 and 2008. Regarding published dates, it is a matter of course to reference Cho Dong-il's The History of Korean Literature, published in the 1980s, or The History of Korean Literature in North Korea (15 volumes), published in the 1990s; nevertheless, neither Wei Xu-sheng nor Li Yan used those texts in their works. Their works were heavily influenced by the narrative tradition of the history of national literature and therefore, entailed unsophisticated discussion on the division of historical eras or the concept of genres. Second, those two texts also emphasized external factors such as politics, society, economy and culture and explicitly mention these factors in historical overview of each chapter. Such an approach is commonly used in narratives of literary history under socialist regimes, including The History of Korean Literature in North Korea (15 volumes). Accordingly, evaluations based on 'political standards' - stress of people, nationality, practicality and so forth - in main texts are particularly accentuated, akin to narratives of literary history under socialist regimes. Finally, since those two Korean literature history works are written by Chinese scholars, they focus on correlation between Chinese literature history and Korean literature history. However, several genre-related terminologies such as Xiaopin (a kind of essay), Yuefu (a kind of popular song/poem), Yuyan (fable), Shuochang (telling of popular stories with the interspersal songs), Shizhuan (biography or/and memoirs in history) were adopted directly from Chinese literature. In analyzing Korean literature using terminologies introduced from Chinese literature, differences between original and alternative definitions were not examined in detail. While some terminologies and concepts were adopted directly without further consideration as to state of the two nations, it is also interesting to note that dichotomy, mainly used in Korean literature history, was used to discuss the genre of Cheonki (romance tale), relevant to Suyichon and Keumosinhua, rather than follow traditions of Chinese literature history.