Search | Korea Science

A Study of automatic indexing based on the linguistic analysis for newspaper articles (언어학적 분석기법에 의한 신문기사 자동색인시스팀 설계에 관한 연구)

Seo, Gyeong-Ju;SaGong, Cheol
- Journal of the Korean Society for information Management
- /
- v.8 no.1
- /
- pp.78-99
- /
- 1991
So far, most of Korea's newspapers indexing have been done manually using tesaurus. In recent years, however, the need for automatic indexing system has grown stronger so as for indexers to save time, efforts and money. And some newspapers have started establishing their databases along with introducing electronic newspapers and CTS. This thesis is on establishing and automatic indexing system for the full-text of the Korea Economic Daily's articles, which have been accumulated in its database, KETEL. In my thesis, I suggest methods to create a keyword file, a stopword list, an auxiliary word list and an infected word list by applying linguistic analysis methods to Hangul, taking advantage of the language's morphological peculiarity. Through these studies, I was able to reach four conclusions as follows. First, we can obtain satisfactory keywords by automatic indexing methods that were made through morphological analysis. Second, an indexer can improve the efficiency of indexing work by controlling extracted vocabulary, as syntax analysis and semantic analysis is not complete in Hangul. Third, The keyword file in this system which is made of about 20,000 most-frequently-used newspaper terms can be used in the future in compiling a thesaurus. Finally, the suggested methods to prepare an auxiliary word list and an infected word list can be applicable to designing other automatic systems.
PDF

Determination of Fire Risk Assessment Indicators for Building using Big Data (빅데이터를 활용한 건축물 화재위험도 평가 지표 결정)

Joo, Hong-Jun;Choi, Yun-Jeong;Ok, Chi-Yeol;An, Jae-Hong
- Journal of the Korea Institute of Building Construction
- /
- v.22 no.3
- /
- pp.281-291
- /
- 2022
This study attempts to use big data to determine the indicators necessary for a fire risk assessment of buildings. Because most of the causes affecting the fire risk of buildings are fixed as indicators considering only the building itself, previously only limited and subjective assessment has been performed. Therefore, if various internal and external indicators can be considered using big data, effective measures can be taken to reduce the fire risk of buildings. To collect the data necessary to determine indicators, a query language was first selected, and professional literature was collected in the form of unstructured data using a web crawling technique. To collect the words in the literature, pre-processing was performed such as user dictionary registration, duplicate literature, and stopwords. Then, through a review of previous research, words were classified into four components, and representative keywords related to risk were selected from each component. Risk-related indicators were collected through analysis of related words of representative keywords. By examining the indicators according to their selection criteria, 20 indicators could be determined. This research methodology indicates the applicability of big data analysis for establishing measures to reduce fire risk in buildings, and the determined risk indicators can be used as reference materials for assessment.
https://doi.org/10.5345/JKIBC.2022.22.3.281 인용 PDF KSCI

Subject-Balanced Intelligent Text Summarization Scheme (주제 균형 지능형 텍스트 요약 기법)

Yun, Yeoil;Ko, Eunjung;Kim, Namgyu
- Journal of Intelligence and Information Systems
- /
- v.25 no.2
- /
- pp.141-166
- /
- 2019
Recently, channels like social media and SNS create enormous amount of data. In all kinds of data, portions of unstructured data which represented as text data has increased geometrically. But there are some difficulties to check all text data, so it is important to access those data rapidly and grasp key points of text. Due to needs of efficient understanding, many studies about text summarization for handling and using tremendous amounts of text data have been proposed. Especially, a lot of summarization methods using machine learning and artificial intelligence algorithms have been proposed lately to generate summary objectively and effectively which called "automatic summarization". However almost text summarization methods proposed up to date construct summary focused on frequency of contents in original documents. Those summaries have a limitation for contain small-weight subjects that mentioned less in original text. If summaries include contents with only major subject, bias occurs and it causes loss of information so that it is hard to ascertain every subject documents have. To avoid those bias, it is possible to summarize in point of balance between topics document have so all subject in document can be ascertained, but still unbalance of distribution between those subjects remains. To retain balance of subjects in summary, it is necessary to consider proportion of every subject documents originally have and also allocate the portion of subjects equally so that even sentences of minor subjects can be included in summary sufficiently. In this study, we propose "subject-balanced" text summarization method that procure balance between all subjects and minimize omission of low-frequency subjects. For subject-balanced summary, we use two concept of summary evaluation metrics "completeness" and "succinctness". Completeness is the feature that summary should include contents of original documents fully and succinctness means summary has minimum duplication with contents in itself. Proposed method has 3-phases for summarization. First phase is constructing subject term dictionaries. Topic modeling is used for calculating topic-term weight which indicates degrees that each terms are related to each topic. From derived weight, it is possible to figure out highly related terms for every topic and subjects of documents can be found from various topic composed similar meaning terms. And then, few terms are selected which represent subject well. In this method, it is called "seed terms". However, those terms are too small to explain each subject enough, so sufficient similar terms with seed terms are needed for well-constructed subject dictionary. Word2Vec is used for word expansion, finds similar terms with seed terms. Word vectors are created after Word2Vec modeling, and from those vectors, similarity between all terms can be derived by using cosine-similarity. Higher cosine similarity between two terms calculated, higher relationship between two terms defined. So terms that have high similarity values with seed terms for each subjects are selected and filtering those expanded terms subject dictionary is finally constructed. Next phase is allocating subjects to every sentences which original documents have. To grasp contents of all sentences first, frequency analysis is conducted with specific terms that subject dictionaries compose. TF-IDF weight of each subjects are calculated after frequency analysis, and it is possible to figure out how much sentences are explaining about each subjects. However, TF-IDF weight has limitation that the weight can be increased infinitely, so by normalizing TF-IDF weights for every subject sentences have, all values are changed to 0 to 1 values. Then allocating subject for every sentences with maximum TF-IDF weight between all subjects, sentence group are constructed for each subjects finally. Last phase is summary generation parts. Sen2Vec is used to figure out similarity between subject-sentences, and similarity matrix can be formed. By repetitive sentences selecting, it is possible to generate summary that include contents of original documents fully and minimize duplication in summary itself. For evaluation of proposed method, 50,000 reviews of TripAdvisor are used for constructing subject dictionaries and 23,087 reviews are used for generating summary. Also comparison between proposed method summary and frequency-based summary is performed and as a result, it is verified that summary from proposed method can retain balance of all subject more which documents originally have.
https://doi.org/10.13088/jiis.2019.25.2.141 인용 PDF KSCI HTML

A Study on Recordkeeping System in Australia (호주의 레코드키핑 시스템에 대한 연구)

Lee, Young-Sook
- Journal of Korean Society of Archives and Records Management
- /
- v.4 no.2
- /
- pp.76-90
- /
- 2004
There had been substantial demand for record management system with which to efficiently control the information circulation processes, involving accumulation of recorded materials, classification of information resources, and users access to them. It converged to a collaboration of Australian federation, and Sydney Records Centre and finally induced Australian Standard Records Management, commonly known as AS 4390. AS 4390 served later as a model for International Standard of Record Management. This paper introduces the current undertaking of Recordkeeping system development in Australia, which stems from the line of AS 4390 by analysing exhibited research approaches. The analysis includes the definition, regime of Recordkeeping system, design and implementing of guidelines of Recordkeeping System and information on metadata projects. It also highlights the necessity for standardization, as is the prime factor in promoting inter-linking of Tabularium on New Southwales State, CRS(Commonwealth Record Series), database system of Canberra National Archives and Australian Government Locator Service. From year 2005, as dictates, any record management system, serving public agency will be required to adapt Professional Archives Management System, which, by far, will enhance the inter-compatibility. In its application, the government need Thesaurus to eliminate possible redundancy in use of terminology and to promote correct usage of words.
https://doi.org/10.14404/JKSARM.2004.4.2.076 인용 PDF

An Investigation on the Problem in the Local Names of Myrtus communis (도금양나무(Myrtus communis)의 명칭문제 고찰)

Kim, Young-Sook;Ahn, Gye-Bog
- Journal of the Korean Institute of Traditional Landscape Architecture
- /
- v.35 no.2
- /
- pp.69-76
- /
- 2017
The following summarizes the findings from an analysis of literature and 21 versions of the Bible published in Korea, China, and Japan to discuss the name of Myrtus communis. Myrtus communis was an important tree symbolizing love and resurrection since the Ancient Mesopotamia, Egypt, Judas, Greece, Ancient Rome, and Medieval Spain. In the Bible, Myrtus ($h{\acute{a}}das$) was used to make the booths at the Feast of Tabernacles or for various ceremonies. Myrtus symbolized the people of Israel and also symbolized peace, appreciation, indestructibility, and resurrection. In the Bible of Korea, China, and Japan, Myrtus was translated into various names by time, such as '崗拈樹', '千里香', '鳥拈', '番石榴', 桃金孃, Gamtangnamu, Seoglyunamu, Hwaseoglyu, Sogwinamu. 'Myrtle' was translated into '桃金孃' based on Japan's "熟語本位英和中?典(1915)" and it seems that the mistake was directly excerpted by the English-Korean Dictionary(1949) after the Liberation. According to the theory of 'Dynamic Equivalence' in translation, it would be best to use 'Myrtus' was the official name of Myrtus communis.
https://doi.org/10.14700/KITLA.2017.35.2.69 인용 PDF KSCI

Automatic Text Summarization based on Selective Copy mechanism against for Addressing OOV (미등록 어휘에 대한 선택적 복사를 적용한 문서 자동요약)

Lee, Tae-Seok;Seon, Choong-Nyoung;Jung, Youngim;Kang, Seung-Shik
- Smart Media Journal
- /
- v.8 no.2
- /
- pp.58-65
- /
- 2019
Automatic text summarization is a process of shortening a text document by either extraction or abstraction. The abstraction approach inspired by deep learning methods scaling to a large amount of document is applied in recent work. Abstractive text summarization involves utilizing pre-generated word embedding information. Low-frequent but salient words such as terminologies are seldom included to dictionaries, that are so called, out-of-vocabulary(OOV) problems. OOV deteriorates the performance of Encoder-Decoder model in neural network. In order to address OOV words in abstractive text summarization, we propose a copy mechanism to facilitate copying new words in the target document and generating summary sentences. Different from the previous studies, the proposed approach combines accurate pointing information and selective copy mechanism based on bidirectional RNN and bidirectional LSTM. In addition, neural network gate model to estimate the generation probability and the loss function to optimize the entire abstraction model has been applied. The dataset has been constructed from the collection of abstractions and titles of journal articles. Experimental results demonstrate that both ROUGE-1 (based on word recall) and ROUGE-L (employed longest common subsequence) of the proposed Encoding-Decoding model have been improved to 47.01 and 29.55, respectively.
https://doi.org/10.30693/SMJ.2019.8.2.58 인용 PDF KSCI

Mapping Heterogenous Ontologies for the HLP Applications - Sejong Semantic Classes and KorLexNoun 1.5 - (인간언어공학에의 활용을 위한 이종 개념체계 간 사상 - 세종의미부류와 KorLexNoun 1.5 -)

Bae, Sun-Mee;Im, Kyoung-Up;Yoon, Ae-Sun
- Korean Journal of Cognitive Science
- /
- v.21 no.1
- /
- pp.95-126
- /
- 2010
This study proposes a bottom-up and inductive manual mapping methodology for integrating two heterogenous fine-grained ontologies which were built by a top-down and deductive methodology, namely the Sejong semantic classes (SJSC) and the upper nodes in KorLexNoun 1.5 (KLN), for HLP applications. It also discusses various problematics in the mapping processes of two language resources caused by their heterogeneity and proposes the solutions. The mapping methodology of heterogeneous fine-grained ontologies uses terminal nodes of SJSC and Least Upper Bounds (LUB) of KLN as basic mapping units. Mapping procedures are as follows: first, the mapping candidate groups are decided by the lexfollocorrelation between the synsets of KLN and the noun senses of Sejong Noun Dfotionaeci(SJND) which are classified according to SJSC. Secondly, the meanings of the candidate groups are precisely disambiguated by linguistic information provided by the two ontologies, i.e. the hierarchicllostructures, the definitions, and the exae les. Thirdly, the level of LUB is determined by applying the appropriate predicates and definitions of SJSC to the upper-lower and sister nodes of the candidate LUB. Fourthly, the mapping possibility ic inthe terminal node of SJSC is judged by che aring hierarchicllorelations of the two ontologies. Finally, the ituorrect synsets of KLN and terminologiollocandidate groups are excluded in the mapping. This study positively uses various language information described in each ontology for establishing the mapping criteria, and it is indeed the advantage of the fine-grained manual mapping. The result using the proposed methodology shows that 6,487 LUBs are mapped with 474 terminal and non-terminal nodes of SJSC, excluding the multiple mapped nodes, and that 88,255 nodes of KLN are mapped including all lower-level nodes of the mapped LUBs. The total mapping coverage is 97.91% of KLN synsets. This result can be applied in many elaborate syntactic and semantic analyses for Korean language processing.
PDF

Development of case-based learning and co-teaching clinical practice education model for pre-service nurses (예비간호사를 위한 사례기반학습 및 코티칭 임상실습 교육모형 개발)

Hyunjeong Kim;Heekyoung Hyoung;Hyunwoo Kim;Seryeong Kim
- Journal of Christian Education in Korea
- /
- v.72
- /
- pp.245-271
- /
- 2022
The purpose of this study is to develop a nursing clinical practice education model that applies case-based learning and co-teaching to nursing students, and to secure the validity of the developed model. To verify the validity of the nursing clinical practice education model, it was applied to the subject of 'Health Response and Nursing VI (Perception/ Cognition) Practice' in the 2nd semester of 2021 at J University in Jeonju, and the instructor's response to the model was evaluated. Surveys and focus group interviews were conducted on confidence in clinical practice and teaching and learning models. After deriving the case-based learning stage and co-teaching elements through a review of precedent literature and case studies, an initial model was devised after expert review, and the devised model was reviewed for internal validity by nursing education experts, and then modified and supplemented. As a result of the learner response evaluation conducted after applying the model to the clinical practice subject for external validation verification, the confidence in clinical performance was 4.22 points and the satisfaction with the teaching-learning model was 4.68 points. Summarizing the results of the focus group interview, the importance of prior learning and the learning of selected cases based on actual cases, learning terminology and professional knowledge, eliminated fear of the practice field, felt familiar, and learned various cases. He said that he was able to think critically through the time to organize the knowledge learned in the practice field. In addition, through co-teaching, it was found that field leaders and advisors taught the theoretical and practical aspects at the same time through examples, thereby experiencing practical education closer to practice. It is expected that the nursing clinical practice education model developed through this study, applying case-based learning and co-teaching, will be an effective teaching and learning model that can reduce the gap between theory and practice and improve the clinical performance of nursing students.
https://doi.org/10.17968/jcek.2022..72.012 인용 PDF

A Study on the Response Plan through the Analysis of North Korea's Drones Terrorism at Critical National Facilities - Focusing on Improvement of Laws and Systems - (국가중요시설에 대한 북한의 드론테러 위협 분석을 통한 대응방안 연구 - 법적·제도적 개선을 중심으로 -)

Choong soo Ha
- Journal of the Society of Disaster Information
- /
- v.19 no.2
- /
- pp.395-410
- /
- 2023
Purpose: The purpose of this study was to analyze the current state of drone terrorism response at such critical national facilities and derive improvements, especially to identify problems in laws and systems to effectively utilize the anti-drone system and present directions for improvement. Method: A qualitative research method was used for this study by analyzing a variety of issues not discussed in existing research papers and policy documents through in-depth interviews with subject matter experts. In-depth interviews were conducted based on 12 semi-structured interviews by selecting 16 experts in the field of anti-drone and terrorism in Korea. The interview contents were recorded with the prior consent of the study participants, transcribed back to the Korean file, and problems and improvement measures were derived through coding. For this, the threats and types were analyzed based on the cases of drone terrorism occurring abroad and measures to establish anti-drone system were researched from the perspective of laws and systems by evaluating the possibility of drone terrorism in the Republic of Korea. Result: As a result of the study, improvements to some of the problems that need to be preceded in order to effectively respond to drone terrorism at critical national facilities in the Republic of Korea, have been identified. First, terminologies related to critical national facilities and drone terrorism should be clearly defined and reflected in the Integrated Defense Act and the Terrorism Prevention Act. Second, the current concept of protection of critical national facilities should evolve from the current ground-oriented protection to a three-dimensional protection concept that considers air threats and the Integrated Defense Act should reflect a plan to effectively install the anti-drone system that can materialize the concept. Third, a special law against flying over critical national facilities should be enacted. To this end, legislation should be enacted to expand designated facilities subject to flight restrictions while minimizing the range of no fly zone, but the law should be revised so that the two wings of "drone industry development" and "protection of critical national facilities" can develop in a balanced manner. Fourth, illegal flight response system and related systems should be improved and reestablished. For example, it is necessary to prepare a unified manual for general matters, but thorough preparation should be made by customizing it according to the characteristics of each facility, expanding professional manpower, and enhancing response training. Conclusion: The focus of this study is to present directions for policy and technology development to establish an anti-drone system that can effectively respond to drone terrorism and illegal drones at critical national facilities going forward.
https://doi.org/10.15683/kosdi.2023.6.30.395 인용 PDF HTML

Search Result 89, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)