• Title/Summary/Keyword: 텍스트 인식

Search Result 779, Processing Time 0.035 seconds

Analyzing Different Contexts for Energy Terms through Text Mining of Online Science News Articles (온라인 과학 기사 텍스트 마이닝을 통해 분석한 에너지 용어 사용의 맥락)

  • Oh, Chi Yeong;Kang, Nam-Hwa
    • Journal of Science Education
    • /
    • v.45 no.3
    • /
    • pp.292-303
    • /
    • 2021
  • This study identifies the terms frequently used together with energy in online science news articles and topics of the news reports to find out how the term energy is used in everyday life and to draw implications for science curriculum and instruction about energy. A total of 2,171 online news articles in science category published by 11 major newspaper companies in Korea for one year from March 1, 2018 were selected by using energy as a search term. As a result of natural language processing, a total of 51,224 sentences consisting of 507,901 words were compiled for analysis. Using the R program, term frequency analysis, semantic network analysis, and structural topic modeling were performed. The results show that the terms with exceptionally high frequencies were technology, research, and development, which reflected the characteristics of news articles that report new findings. On the other hand, terms used more than once per two articles were industry-related terms (industry, product, system, production, market) and terms that were sufficiently expected as energy-related terms such as 'electricity' and 'environment.' Meanwhile, 'sun', 'heat', 'temperature', and 'power generation', which are frequently used in energy-related science classes, also appeared as terms belonging to the highest frequency. From a network analysis, two clusters were found including terms related to industry and technology and terms related to basic science and research. From the analysis of terms paired with energy, it was also found that terms related to the use of energy such as 'energy efficiency,' 'energy saving,' and 'energy consumption' were the most frequently used. Out of 16 topics found, four contexts of energy were drawn including 'high-tech industry,' 'industry,' 'basic science,' and 'environment and health.' The results suggest that the introduction of the concept of energy degradation as a starting point for energy classes can be effective. It also shows the need to introduce high-tech industries or the context of environment and health into energy learning.

Cultural Education Methods for Overseas Koreans Using Classical Narratives: Focusing on Princess Bari and The Tale of Shim Cheong (고전 서사무가를 활용한 재외동포의 문화 교육 방안 연구 - <바리공주>와 <심청전>을 중심으로 -)

  • Kang Myung-ju
    • Journal of the Daesoon Academy of Sciences
    • /
    • v.47
    • /
    • pp.173-202
    • /
    • 2023
  • In this study, we delve into the potential for innovative cultural education techniques that utilize the timeless tales of Princess Bari and The Tale of Shim Cheong as tailored for the upcoming generations of overseas Korean learners. With a rising number of young overseas Koreans born and raised in their host countries, there emerges a pressing need to craft an educational framework that resonates with the evolving dynamics of their generation. Our endeavor revolves around proposing educational strategies that help solidify identity while carefully considering the intrinsic motivation prevalent among most overseas Koreans. Naturally, the choice of employing the classic epics Princess Bari and The Tale of Shim Cheong as educational resources was deliberate. These narratives are rich in rites of passage and offer profound insights into the transformative journey of their protagonists. Both characters are affluent women in patriarchal societies, and both embark on quests to redefine themselves through new relationships, liberating themselves from the confines of parental ties. This narrative framework provides a unique opportunity for overseas Koreans who are often adrift in the social fabric of their adopted countries. These stories inspire them to introspect and contemplate their own identities. By intertwining their personal narratives with the empowering stories of characters, students are provided a chance to reaffirm their authentic selves. Therein, a paradigm shift can occur that allows individuals to embrace the core elements that define them. Our ultimate objective was to enable students to explore their own stories and immerse themselves in the intricate narratives of classical works. This immersive experience fosters a profound sense of unity with the characters and paves the way for a comprehensive educational plan. This plan not only celebrates the hybrid nature of identity but also cultivates a deep sense of positivity within amalgamated 'subjects.' Such an approach not only fosters a stronger connection with one's heritage but also sparks a genuine curiosity about and affinity for the rich cultural tapestry of one's home country. It's not just education; it's a transformative journey that enriches the lives of overseas Koreans and nurtures a profound bond with their cultural roots.

Korean Sentence Generation Using Phoneme-Level LSTM Language Model (한국어 음소 단위 LSTM 언어모델을 이용한 문장 생성)

  • Ahn, SungMahn;Chung, Yeojin;Lee, Jaejoon;Yang, Jiheon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.71-88
    • /
    • 2017
  • Language models were originally developed for speech recognition and language processing. Using a set of example sentences, a language model predicts the next word or character based on sequential input data. N-gram models have been widely used but this model cannot model the correlation between the input units efficiently since it is a probabilistic model which are based on the frequency of each unit in the training set. Recently, as the deep learning algorithm has been developed, a recurrent neural network (RNN) model and a long short-term memory (LSTM) model have been widely used for the neural language model (Ahn, 2016; Kim et al., 2016; Lee et al., 2016). These models can reflect dependency between the objects that are entered sequentially into the model (Gers and Schmidhuber, 2001; Mikolov et al., 2010; Sundermeyer et al., 2012). In order to learning the neural language model, texts need to be decomposed into words or morphemes. Since, however, a training set of sentences includes a huge number of words or morphemes in general, the size of dictionary is very large and so it increases model complexity. In addition, word-level or morpheme-level models are able to generate vocabularies only which are contained in the training set. Furthermore, with highly morphological languages such as Turkish, Hungarian, Russian, Finnish or Korean, morpheme analyzers have more chance to cause errors in decomposition process (Lankinen et al., 2016). Therefore, this paper proposes a phoneme-level language model for Korean language based on LSTM models. A phoneme such as a vowel or a consonant is the smallest unit that comprises Korean texts. We construct the language model using three or four LSTM layers. Each model was trained using Stochastic Gradient Algorithm and more advanced optimization algorithms such as Adagrad, RMSprop, Adadelta, Adam, Adamax, and Nadam. Simulation study was done with Old Testament texts using a deep learning package Keras based the Theano. After pre-processing the texts, the dataset included 74 of unique characters including vowels, consonants, and punctuation marks. Then we constructed an input vector with 20 consecutive characters and an output with a following 21st character. Finally, total 1,023,411 sets of input-output vectors were included in the dataset and we divided them into training, validation, testsets with proportion 70:15:15. All the simulation were conducted on a system equipped with an Intel Xeon CPU (16 cores) and a NVIDIA GeForce GTX 1080 GPU. We compared the loss function evaluated for the validation set, the perplexity evaluated for the test set, and the time to be taken for training each model. As a result, all the optimization algorithms but the stochastic gradient algorithm showed similar validation loss and perplexity, which are clearly superior to those of the stochastic gradient algorithm. The stochastic gradient algorithm took the longest time to be trained for both 3- and 4-LSTM models. On average, the 4-LSTM layer model took 69% longer training time than the 3-LSTM layer model. However, the validation loss and perplexity were not improved significantly or became even worse for specific conditions. On the other hand, when comparing the automatically generated sentences, the 4-LSTM layer model tended to generate the sentences which are closer to the natural language than the 3-LSTM model. Although there were slight differences in the completeness of the generated sentences between the models, the sentence generation performance was quite satisfactory in any simulation conditions: they generated only legitimate Korean letters and the use of postposition and the conjugation of verbs were almost perfect in the sense of grammar. The results of this study are expected to be widely used for the processing of Korean language in the field of language processing and speech recognition, which are the basis of artificial intelligence systems.

The Aspects of Change of Sijo (시조의 변이 양상)

  • Kang Myeoung-Hye
    • Sijohaknonchong
    • /
    • v.24
    • /
    • pp.5-46
    • /
    • 2006
  • Korean verse has flexibly changed its form and contents according to the historical background of the times. This fact arouses reader sympathy because it has reflected ideas, historical aspects and realities of the times. However, korean verse has kept its own characteristics in some ways, allowing it to exist today. It holds its form as 3 verses of three by three or four meter and three letters of the last of three verses. It makes every different version which has specific aspects of each times in the same 'sijo' area. 'Sijo' in Korean poems, is the first form that has been changed from formal to private functionally. As a result of that common verses in the Goryeo to Joseon eras were going with the stream of the times. Verse was the plate for justice so that there was no double meaning, symbols, or technical sentences. It had to show the idea of Myungchundo Jwonginryun. The theme was commonly fitted within certain areas. such as blessings, fidelity, devotion, etc. Around the end of the Joseon era, there was activation of private verses - a form of sijo with no restrictions on the length of the first two verses. Some ideas had been changed because Sarimpa gained power, domestic conflict, and the introduction of practical science. These things had an effect on the form of Sijo. After all, it shows the ideas of collapsing feudalism, resistance of confucian ideas, equality of the sexes, and opposition to the group who rule the government. Thus Sasul Sijo seems to have the tendency of resistance to reality. It was a specialty of realism poetry It explained our life in detail and reflected real life by being an intermediary of realism. This met and represented the demand of a reader's expectations. After 1905, there was new form of sijo that is very different, in form and content, from the previous versions. It was even different in areas of what people accepted. They started to think sijo was not the form of lyrical verse that is once was. It became a 'record of reading'. The form changed to 'hung or huhung' that satirized the times and the ending of a word in the last verse. Although this form could deliver the tension in statement, it was too iu from the original form. Therefore, it didn't last long, and its position got smaller because of the free verse that had western influence and was emerging in the times. In the middle of 1920, there was a movement of Sijo revival. It was lead by Choinamsun. He wrote poems and Sijo which were effected by western ideas in his early works. Although he worked with that, he took the lead in the movement of Sijo revival. He published the collection of Sijo $\ulcorner$Baekpalbunnwoi$\lrcorner$ that has one major theme-patriotic sentiment. He thought an ancient poem was a part of racial characteristics so that he expressed the main theme which represented the times and situations of his era. Modern Sijo is difficult. Sijo has to have modern and Korean verse characteristics at the same time. If it considers a modern aspect too much, it could not be distinguished from sijo and free verse. If it overly leans toward Sijo. it would seem to be too conservative which it then could be said to have no real charm of a poem. In spite of these problems, it is written constantly, because it has its own specialty. It has been focused on some works because they reflect awareness of modern times, the democratic idea, and realism. Overall, the authors of Modern Sijo express various themes by using different forms. The more what we can guess in this work, Sijo will exist permanently because of its flexibility. Furthermore, one special characteristic-flexibility of the korean verse will make it last forever and it will be a genre in Korean poetry.

  • PDF

A Study on the characteristics of realities and fantasy, portrayed in the Russian animation works from 1960's to the beginning of 1980's (1960-1980년대 초반 사회, 문화적 상황과 관련해 본 러시아 애니메이션의 변화 연구)

  • Lee, Hye-Seung
    • Cartoon and Animation Studies
    • /
    • s.15
    • /
    • pp.29-47
    • /
    • 2009
  • The changes in the field of high tech media promote the development of animation films, which was considered once as a decaying industry. A large success of Disney animation films in 1980's and the possibilities of animation as an economically profitable mass products allowed this art form to play a leading role in mass culture. But, the cultural and philosophical aspects of animation works are not studied enough up to this time, despite its importance. This article is focused on the study of animation as a serious cultural and philosophical text. The object of research is the Russian animation in the period of 1960-1980 years. In this time, new trends are noticed in the history of Russian animation : aesthetical experiments in style and subjects became possible since the society freed from totalitarian atmosphere after the political destalinization by Khrushchev. In addition to, it was the time when the system of state subsidies still functioned, that animation was not the object of cultural industry yet, as it happened in the period of Perestroika. In this condition, lots of short animation films, which were remarkable not only in the context of Soviet art culture, but also in the history of world animation films, were produced. This article proposes to analyze the characteristics of realities and fantasy, portrayed in the films of this period, and examine the role and status of animation films in the social-cultural context.

  • PDF

A Study on Current State of Web Content Accessibility on General Hospital Websites in Korea (국내 종합병원의 웹 접근성 실태에 관한 연구)

  • Kim, Yong-Seob;Oh, Kun-Seok
    • Journal of Internet Computing and Services
    • /
    • v.11 no.3
    • /
    • pp.87-103
    • /
    • 2010
  • In the study, we introduce the trend in domestic and foreign web accessibility, as well as the legal system that ensures web accessibility. Based on Korean Web Content Accessibility Guidelines (KWCAG)1.0, we investigated the web content accessibility of 80 tertiary health-care hospitals and general hospitals in Korea. We evaluated accessibility by combining accessibility-based criteria (ABC) with usability-based criteria (UBC). ABC was limited to an alternative text for Guideline 1, using a small number of frames and keyboard accessibility for Guideline 2. UBC checked the voice service (TTS), resizing text, providing multi-lingual websites, and disclosing web accessibility policy. KADO-WAH2.0 was used for representing the compliance rate. The evaluation result was a considerable improvement from previous results, even though the rate of compliance with web accessibility was generally insufficient. There was a significant difference between those medical centers which did and did not comply with web accessibility. Incidentally, many hospitals were found to have attempted to confront and come to terms with web accessibility. In future, the following factors are advisable for medical centers with publicity or public interest: they must employ active and aggressive promotion of establishment of independent accessibility guidelines to secure web accessibility, they should effect an improvement of the realization of web accessibility, there can be constant education and promotion, and there can be an institutional supplementation, as well as others.

The Press Coverage of the Cyber Defamation Laws: Framing Effects of Core Values and Attributional Patterns (사이버모욕죄 보도의 프레이밍 효과: 핵심 가치와 귀인 양식을 중심으로)

  • Hur, Suk-Jae;Min, Young
    • Korean journal of communication and information
    • /
    • v.52
    • /
    • pp.48-68
    • /
    • 2010
  • In covering the controversies surrounding the so-called cyber defamation laws, the Korean press offered competitive frames in terms of values (security vs. freedom of speech) and attributional patterns (episodic vs. thematic attribution). By attending to core values and attributional patterns as two essential components of news frames, this study explored the cognitive and affective processes of value and attributional framing and their effects on issue opinion. According to a 3-group online experiment, first, it was found that core values increased the perceived importance of relevant beliefs, which further affected individuals' attitudes toward the laws. The affective effects of core values were also found marginally significant. The value of security increased the intensity of anger toward deviant netizens (so-called defamatory repliers), and it further increased individuals' support for the laws. It was not substantiated, however, that individualistic attribution, than social attribution, would provoke stronger anger toward defamatory repliers. Instead, episodic frames appeared to be more effective in driving issue opinion as indicated by the value frame.

  • PDF

Analysis of Consumer Awareness of Cycling Wear Using Web Mining (웹마이닝을 활용한 사이클웨어 소비자 인식 분석)

  • Kim, Chungjeong;Yi, Eunjou
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.5
    • /
    • pp.640-649
    • /
    • 2018
  • This study analyzed the consumer awareness of cycling wear using web mining, one of the big data analysis methods. For this, the texts of postings and comments related to cycling wear from 2006 to 2017 at Naver cafe, 'people who commute by bicycle' were collected and analyzed using R packages. A total of 15,321 documents were used for data analysis. The keywords of cycling wear were extracted using a Korean morphological analyzer (KoNLP) and converted to TDM (Term Document Matrix) and co-occurrence matrix to calculate the frequency of the keywords. The most frequent keyword in cycling wear was 'tights', including the opinion that they feel embarrassed because they are too tight. When they purchase cycling wear, they appeared to consider 'price', 'size', and 'brand'. Recently 'low price' and 'cost effectiveness' have become more frequent since 2016 than before, which indicates that consumers tend to prefer practical products. Moreover, the findings showed that it is necessary to improve not only the design and wearability, but also the material functionality, such as sweat-absorbance and quick drying, and the function of pad. These showed similar results to previous studies using a questionnaire. Therefore, it is expected to be used as an objective indicator that can be reflected in product development by real-time analysis of the opinions and requirements of consumers using web mining.

Analysis method of patent document to Forecast Patent Registration (특허 등록 예측을 위한 특허 문서 분석 방법)

  • Koo, Jung-Min;Park, Sang-Sung;Shin, Young-Geun;Jung, Won-Kyo;Jang, Dong-Sik
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.11 no.4
    • /
    • pp.1458-1467
    • /
    • 2010
  • Recently, imitation and infringement rights of an intellectual property are being recognized as impediments to nation's industrial growth. To prevent the huge loss which comes from theses impediments, many researchers are studying protection and efficient management of an intellectual property in various ways. Especially, the prediction of patent registration is very important part to protect and assert intellectual property rights. In this study, we propose the patent document analysis method by using text mining to predict whether the patent is registered or rejected. In the first instance, the proposed method builds the database by using the word frequencies of the rejected patent documents. And comparing the builded database with another patent documents draws the similarity value between each patent document and the database. In this study, we used k-means which is partitioning clustering algorithm to select criteria value of patent rejection. In result, we found conclusion that some patent which similar to rejected patent have strong possibility of rejection. We used U.S.A patent documents about bluetooth technology, solar battery technology and display technology for experiment data.

Awareness of Reality and Tradition in Oh Yun's Theory of Arts during His Final Period(1984~86) - Review on the Text of "Expansion of Artistic Imagination and World" (오윤의 말기(1984~86) 예술론에서의 현실과 전통 인식 - "미술적 상상력과 세계의 확대"에 대한 텍스트 검토)

  • Park, Ca-Rey
    • The Journal of Art Theory & Practice
    • /
    • no.6
    • /
    • pp.101-121
    • /
    • 2008
  • An artist, Oh Yun(1946~86)'s theory of people's art during his final period is summed up in his essay 'Expansion of Artistic Imagination and World' (1985). Emphasizing the mystic and traditional characteristics of Oh Yun's artistic oeuvre during his final period, some critics focus on Oh Yun's experience of medical treatment and shamanistic custom at Jin Do island, and his belief in Jeung San Do, the dao of Jeung-san, the Ruler of the Universe. However, they forget the practical intention and implication of his theory of art during his final period, which aimed to overcome the contradiction of revelation itself. Oh Yun's essay criticized the loss of artistic imagination and the ignorance of traditional culture that resulted from the elevation of science to a religion, and insisted that the stereotyped idealism, scientism and elitism in art should be overcome in order to recover the full reality in realism and to continue traditional cultures. The essay is comprised of 18 paragraphs. Oh Yun criticized monochromatic art, conceptual art, hyper-realistic art, objet d'art, and neo-dadaist art, saying that they were simply mechanical forms of modern art derived from scientism and a fetishistic lens culture. In addition, he criticized naturalism in art, which had continued as a tendency in the development of western art, for the same reason. He pointed out that even the world of realism had been diminished by elite stereotypes and diagrams. He declared the need to overcome the imitation of shells or stereotyped propaganda, and recover full realism, which seems to have started with a reflective examination of current problems in 'Reality and Utterance', in which he participated. Especially, he thought that universality and the extension of full realism could be achieved by building on the views of traditional cultures, which is meaningful. This logic is same as the theory of epic theatre that Bertolt Brecht(1898~1956) has developed under the ancient Greek masque and Pieter Bruegel the Elder(1525~69)'s story-like picture style. The universality of realism and the extension of acquisition to include incantation art, rather than move toward incantation art, is what Oh Yun intended to propose in 'Artistic Imagination'. This attitude is same as Bertoh Brecht's aesthetic viewpoint in the 1930s. But regrettably, Oh Yun's style wording, which seems covert and far-sighted, is often misunderstood as 'mysticism'. In the flow of people's art in the 1980s, Oh Yun was a traditionalist in a narrow sense, and an realist in a broad sense. However, his critical mind, which comprehends tradition and reality, was attempting to expand universality and extend full realism, and this attempt found many sympathizers and had an influence on the next generation of people's artists, such as "Levee" which is field-centered, to which we should pay attention. This means that while their works thought about 'tradition', we should be careful not to connect them with 'aesthetic conservatism' or 'classical art'. This is the why the meaning of Oh Yun's theory of art during his final period should be closely examined again.

  • PDF