Search | Korea Science

Automated Approaches for Extracting Specialized Terminology in Building Semantic Networks for Classical Languages (고전언어에서의 어휘 의미망 구축을 위한 전문용어 추출 자동화 방안)

Young Yun Baek;Young Bom Park
- Journal of Platform Technology
- /
- v.12 no.1
- /
- pp.85-90
- /
- 2024
The trend of seeking knowledge or information has been increasingly shifting towards the digital implementation on the web rather than relying on analog printed media such as books or publications. This shift is driven by the perception that using digital resources, particularly digital dictionaries, is more effective and time-saving compared to traditional paper dictionaries. Consequently, the construction of a semantic network for vocabulary has emerged as a significant issue for linguists, computational linguists, and natural language processing specialists. To address this, linguists have conducted numerous studies to find methods for structuring and classifying the meanings and concepts of vocabulary. In these studies, specialized terminology for constructing vocabulary semantic networks is as crucial as common language. However, in the process of finding and accumulating specialized terminology, there is still a manual step where individuals directly verify and extract specialized terms from paper documents or vast digital datasets. In this paper, we propose an automated program to extract the specialized terms that users desire from digital materials, aiming to compensate for errors in human-operated tasks and streamline the process.
PDF

Development of Finite Element Model for Dynamic Characteristics of MEMS Piezo Actuator in Consideration of Semiconductor Process (반도체 공정을 고려한 유한요소해석에 의한 MEMS 압전 작동기의 동특성 해석)

Kim, Dong Woohn;Song, Jonghyeong;An, Seungdo;Woo, Kisuk
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2013.04a
- /
- pp.454-459
- /
- 2013
For the purpose of rapid development and superior design quality assurance, sophisticated finite element model for SOM(Spatial Optical Modulator) piezo actuator of MOEMS device has been developed and evaluated for the accuracy of dynamics and residual stress analysis. Parametric finite element model is constructed using ANSYS APDL language to increase the design and analysis performance. Geometric dimensions, mechanical material properties for each thin film layer are input parameters of FE model and residual stresses in all thin film layers are simulated by thermal expansion method with psedu process temperature. $6^{th}$ mask design samples are manufactured and $1^{st}$ natural frequency and 10V PZT driving displacement are measured with LDV. The results of experiment are compared with those of the simulation and validate the good agreement in $1^{st}$ natural frequency within 5% error. But large error over 30% occurred in 10V PZT driving displacement because of insufficient PZT constant $d_{31}$ measurement technology.
PDF

European Integration Processes for the Development of Future Foreign Language Specialists in the Information Society

Lazarenko, Natalia;Zadorozhna, Olga;Prybora, Tetiana;Shevchuk, Аndrii;Sulym, Volodymyr;Rudnytska, Nataliya
- International Journal of Computer Science & Network Security
- /
- v.21 no.12spc
- /
- pp.427-436
- /
- 2021
The article reveals and theoretically substantiates the trends of foreign language teachers' professional training in universities of Ukraine in terms of European integration, which are systematized in three areas: macro-level (system of education), meso-level (universities) and micro-level (subjects of educational process). The article aims to substantiate the trends of foreign language teacher training in the context of European integration and the main directions of creative use of constructive ideas of European experience in the innovative development of education. The article lights up the system for improving foreign language teacher training in universities, which is based on updated goals, content and approaches to the implementation of basic concepts, principles and features of teacher training in European experience, enable us to improve the quality of teacher training, its competitiveness in the European labor market. In the article developed the conceptual model of strategic development of the university in the conditions of European integration. It is emphasized that information technologies provide great opportunities for the development of professional skills and intellectual potential of future professionals. At present, the computerization of the educational process in higher education institutions is considered as one of the first and most promising areas for improving the quality of education. The article offered directions of internationalization of educational activity of university in the conditions of European integration. Diagnostic tools for the development of the university in terms of integration into the European educational space, individual rating and ranking of structural units of the university have been developed; main directions of activity of the laboratory of the skill of the teacher of higher school and methodical recommendations on the creation and the organization of work of scientific laboratories.
https://doi.org/10.22937/IJCSNS.2021.21.12.58 인용 PDF KSCI

Intelligent Information Retrieval Using Interactive Query Processing Agent (대화형 질의 처리 에이전트를 이용한 지능형 정보검색)

이현영;이기오;한용기
- Journal of the Korea Computer Industry Society
- /
- v.4 no.12
- /
- pp.901-910
- /
- 2003
Generally, most commercial retrieval engines adopt boolean query as user's query type. Although boolean query is useful to retrieval engines that need fast retrieval, it is not easy for user to express his demands with boolean operators. So, many researches have been studied for decades about information retrieval systems using natural language query that is convenient for user. To retrieve documents that are suitable for user's demands, they have to express their demands correctly, So, this thesis proposes interactive query process agent using natural language. This agent expresses demands concrete through gradual interaction with user, When users input a natural language Query, this agent analyzes the query and generates boolean query by selecting proper keyword and feedbacks the state of the keyword selected. If the keyword is a synonymy or a polysemy, the agent expands or limits the keyword through interaction with user. It makes user express demands more concrete and improve system performance. So, this agent can improve the precision of Information Retrieval.
PDF

A Study on Efficient Natural Language Processing Method based on Transformer (트랜스포머 기반 효율적인 자연어 처리 방안 연구)

Seung-Cheol Lim;Sung-Gu Youn
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.23 no.4
- /
- pp.115-119
- /
- 2023
The natural language processing models used in current artificial intelligence are huge, causing various difficulties in processing and analyzing data in real time. In order to solve these difficulties, we proposed a method to improve the efficiency of processing by using less memory and checked the performance of the proposed model. The technique applied in this paper to evaluate the performance of the proposed model is to divide the large corpus by adjusting the number of attention heads and embedding size of the BERT[1] model to be small, and the results are calculated by averaging the output values of each forward. In this process, a random offset was assigned to the sentences at every epoch to provide diversity in the input data. The model was then fine-tuned for classification. We found that the split processing model was about 12% less accurate than the unsplit model, but the number of parameters in the model was reduced by 56%.
https://doi.org/10.7236/JIIBC.2023.23.4.115 인용 PDF HTML

Exploratory Research on Automating the Analysis of Scientific Argumentation Using Machine Learning (머신 러닝을 활용한 과학 논변 구성 요소 코딩 자동화 가능성 탐색 연구)

Lee, Gyeong-Geon;Ha, Heesoo;Hong, Hun-Gi;Kim, Heui-Baik
- Journal of The Korean Association For Science Education
- /
- v.38 no.2
- /
- pp.219-234
- /
- 2018
In this study, we explored the possibility of automating the process of analyzing elements of scientific argument in the context of a Korean classroom. To gather training data, we collected 990 sentences from science education journals that illustrate the results of coding elements of argumentation according to Toulmin's argumentation structure framework. We extracted 483 sentences as a test data set from the transcription of students' discourse in scientific argumentation activities. The words and morphemes of each argument were analyzed using the Python 'KoNLPy' package and the 'Kkma' module for Korean Natural Language Processing. After constructing the 'argument-morpheme:class' matrix for 1,473 sentences, five machine learning techniques were applied to generate predictive models relating each sentences to the element of argument with which it corresponded. The accuracy of the predictive models was investigated by comparing them with the results of pre-coding by researchers and confirming the degree of agreement. The predictive model generated by the k-nearest neighbor algorithm (KNN) demonstrated the highest degree of agreement [54.04% (${\kappa}=0.22$)] when machine learning was performed with the consideration of morpheme of each sentence. The predictive model generated by the KNN exhibited higher agreement [55.07% (${\kappa}=0.24$)] when the coding results of the previous sentence were added to the prediction process. In addition, the results indicated importance of considering context of discourse by reflecting the codes of previous sentences to the analysis. The results have significance in that, it showed the possibility of automating the analysis of students' argumentation activities in Korean language by applying machine learning.
https://doi.org/10.14697/jkase.2018.38.2.219 인용 PDF KSCI

Application Development for Text Mining: KoALA (텍스트 마이닝 통합 애플리케이션 개발: KoALA)

Byeong-Jin Jeon;Yoon-Jin Choi;Hee-Woong Kim
- Information Systems Review
- /
- v.21 no.2
- /
- pp.117-137
- /
- 2019
In the Big Data era, data science has become popular with the production of numerous data in various domains, and the power of data has become a competitive power. There is a growing interest in unstructured data, which accounts for more than 80% of the world's data. Along with the everyday use of social media, most of the unstructured data is in the form of text data and plays an important role in various areas such as marketing, finance, and distribution. However, text mining using social media is difficult to access and difficult to use compared to data mining using numerical data. Thus, this study aims to develop Korean Natural Language Application (KoALA) as an integrated application for easy and handy social media text mining without relying on programming language or high-level hardware or solution. KoALA is a specialized application for social media text mining. It is an integrated application that can analyze both Korean and English. KoALA handles the entire process from data collection to preprocessing, analysis and visualization. This paper describes the process of designing, implementing, and applying KoALA applications using the design science methodology. Lastly, we will discuss practical use of KoALA through a block-chain business case. Through this paper, we hope to popularize social media text mining and utilize it for practical and academic use in various domains.
https://doi.org/10.14329/isr.2019.21.2.117 인용 PDF

A method for metadata extraction from a collection of records using Named Entity Recognition in Natural Language Processing (자연어 처리의 개체명 인식을 통한 기록집합체의 메타데이터 추출 방안)

Chiho Song
- Journal of Korean Society of Archives and Records Management
- /
- v.24 no.2
- /
- pp.65-88
- /
- 2024
This pilot study explores a method of extracting metadata values and descriptions from records using named entity recognition (NER), a technique in natural language processing (NLP), a subfield of artificial intelligence. The study focuses on handwritten records from the Guro Industrial Complex, produced during the 1960s and 1970s, comprising approximately 1,200 pages and 80,000 words. After the preprocessing process of the records, which included digitization, the study employed a publicly available language API based on Google's Bidirectional Encoder Representations from Transformers (BERT) language model to recognize entity names within the text. As a result, 173 names of people and 314 of organizations and institutions were extracted from the Guro Industrial Complex's past records. These extracted entities are expected to serve as direct search terms for accessing the contents of the records. Furthermore, the study identified challenges that arose when applying the theoretical methodology of NLP to real-world records consisting of semistructured text. It also presents potential solutions and implications to consider when addressing these issues.
https://doi.org/10.14404/JKSARM.2024.24.2.065 인용 PDF

Performance Comparison and Error Analysis of Korean Bio-medical Named Entity Recognition (한국어 생의학 개체명 인식 성능 비교와 오류 분석)

Jae-Hong Lee
- The Journal of the Korea institute of electronic communication sciences
- /
- v.19 no.4
- /
- pp.701-708
- /
- 2024
The advent of transformer architectures in deep learning has been a major breakthrough in natural language processing research. Object name recognition is a branch of natural language processing and is an important research area for tasks such as information retrieval. It is also important in the biomedical field, but the lack of Korean biomedical corpora for training has limited the development of Korean clinical research using AI. In this study, we built a new biomedical corpus for Korean biomedical entity name recognition and selected language models pre-trained on a large Korean corpus for transfer learning. We compared the name recognition performance of the selected language models by F1-score and the recognition rate by tag, and analyzed the errors. In terms of recognition performance, KlueRoBERTa showed relatively good performance. The error analysis of the tagging process shows that the recognition performance of Disease is excellent, but Body and Treatment are relatively low. This is due to over-segmentation and under-segmentation that fails to properly categorize entity names based on context, and it will be necessary to build a more precise morphological analyzer and a rich lexicon to compensate for the incorrect tagging.
https://doi.org/10.13067/JKIECS.2024.19.4.701 인용 PDF

A Study on the Development of Structural Analysis Program using MATLAB Language (MATLAB 언어를 이용한 구조해석 프로그램 개발에 관한 연구)

배동명;강상중
- Journal of the Korean Society of Fisheries and Ocean Technology
- /
- v.36 no.4
- /
- pp.347-353
- /
- 2000
The construction and ability of CAE program are presented. The merit and ability of MATLAB which is widely using in the field of recently engineering and natural science are also introduced. Also, analysis program of frame structure used the MATLAB language which is divide in 4th generation language is presented. In this paper, the proposed program using MATLB language to be based upon the composition of general CAE program is composed to preprocess, solver and post-process procedure. And it is able to carried out the static and eigenvalue analysis of truss structure and two dimensional frame structure. Also, for the sample pre-processing and post-processing, it is used the characteristic of input window and plot window to be made of the various GUI function. Each finite elements to be required for analysis is formulated by the Galerkin's method, as a kind of weighted residual method. For check of the results of calculation for program used in this paper, the results to be calculated using program to be developed by the author was compared with its of ANSYS code for general structural analysis about two dimensional truss and frame structure.
PDF

Search Result 246, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)