• Title/Summary/Keyword: 자체 학습

Search Result 580, Processing Time 0.027 seconds

Semantic Visualization of Dynamic Topic Modeling (다이내믹 토픽 모델링의 의미적 시각화 방법론)

  • Yeon, Jinwook;Boo, Hyunkyung;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.131-154
    • /
    • 2022
  • Recently, researches on unstructured data analysis have been actively conducted with the development of information and communication technology. In particular, topic modeling is a representative technique for discovering core topics from massive text data. In the early stages of topic modeling, most studies focused only on topic discovery. As the topic modeling field matured, studies on the change of the topic according to the change of time began to be carried out. Accordingly, interest in dynamic topic modeling that handle changes in keywords constituting the topic is also increasing. Dynamic topic modeling identifies major topics from the data of the initial period and manages the change and flow of topics in a way that utilizes topic information of the previous period to derive further topics in subsequent periods. However, it is very difficult to understand and interpret the results of dynamic topic modeling. The results of traditional dynamic topic modeling simply reveal changes in keywords and their rankings. However, this information is insufficient to represent how the meaning of the topic has changed. Therefore, in this study, we propose a method to visualize topics by period by reflecting the meaning of keywords in each topic. In addition, we propose a method that can intuitively interpret changes in topics and relationships between or among topics. The detailed method of visualizing topics by period is as follows. In the first step, dynamic topic modeling is implemented to derive the top keywords of each period and their weight from text data. In the second step, we derive vectors of top keywords of each topic from the pre-trained word embedding model. Then, we perform dimension reduction for the extracted vectors. Then, we formulate a semantic vector of each topic by calculating weight sum of keywords in each vector using topic weight of each keyword. In the third step, we visualize the semantic vector of each topic using matplotlib, and analyze the relationship between or among the topics based on the visualized result. The change of topic can be interpreted in the following manners. From the result of dynamic topic modeling, we identify rising top 5 keywords and descending top 5 keywords for each period to show the change of the topic. Existing many topic visualization studies usually visualize keywords of each topic, but our approach proposed in this study differs from previous studies in that it attempts to visualize each topic itself. To evaluate the practical applicability of the proposed methodology, we performed an experiment on 1,847 abstracts of artificial intelligence-related papers. The experiment was performed by dividing abstracts of artificial intelligence-related papers into three periods (2016-2017, 2018-2019, 2020-2021). We selected seven topics based on the consistency score, and utilized the pre-trained word embedding model of Word2vec trained with 'Wikipedia', an Internet encyclopedia. Based on the proposed methodology, we generated a semantic vector for each topic. Through this, by reflecting the meaning of keywords, we visualized and interpreted the themes by period. Through these experiments, we confirmed that the rising and descending of the topic weight of a keyword can be usefully used to interpret the semantic change of the corresponding topic and to grasp the relationship among topics. In this study, to overcome the limitations of dynamic topic modeling results, we used word embedding and dimension reduction techniques to visualize topics by era. The results of this study are meaningful in that they broadened the scope of topic understanding through the visualization of dynamic topic modeling results. In addition, the academic contribution can be acknowledged in that it laid the foundation for follow-up studies using various word embeddings and dimensionality reduction techniques to improve the performance of the proposed methodology.

Contactless Data Society and Reterritorialization of the Archive (비접촉 데이터 사회와 아카이브 재영토화)

  • Jo, Min-ji
    • The Korean Journal of Archival Studies
    • /
    • no.79
    • /
    • pp.5-32
    • /
    • 2024
  • The Korean government ranked 3rd among 193 UN member countries in the UN's 2022 e-Government Development Index. Korea, which has consistently been evaluated as a top country, can clearly be said to be a leading country in the world of e-government. The lubricant of e-government is data. Data itself is neither information nor a record, but it is a source of information and records and a resource of knowledge. Since administrative actions through electronic systems have become widespread, the production and technology of data-based records have naturally expanded and evolved. Technology may seem value-neutral, but in fact, technology itself reflects a specific worldview. The digital order of new technologies, armed with hyper-connectivity and super-intelligence, not only has a profound influence on traditional power structures, but also has an a similar influence on existing information and knowledge transmission media. Moreover, new technologies and media, including data-based generative artificial intelligence, are by far the hot topic. It can be seen that the all-round growth and spread of digital technology has led to the augmentation of human capabilities and the outsourcing of thinking. This also involves a variety of problems, ranging from deep fakes and other fake images, auto profiling, AI lies hallucination that creates them as if they were real, and copyright infringement of machine learning data. Moreover, radical connectivity capabilities enable the instantaneous sharing of vast amounts of data and rely on the technological unconscious to generate actions without awareness. Another irony of the digital world and online network, which is based on immaterial distribution and logical existence, is that access and contact can only be made through physical tools. Digital information is a logical object, but digital resources cannot be read or utilized without some type of device to relay it. In that respect, machines in today's technological society have gone beyond the level of simple assistance, and there are points at which it is difficult to say that the entry of machines into human society is a natural change pattern due to advanced technological development. This is because perspectives on machines will change over time. Important is the social and cultural implications of changes in the way records are produced as a result of communication and actions through machines. Even in the archive field, what problems will a data-based archive society face due to technological changes toward a hyper-intelligence and hyper-connected society, and who will prove the continuous activity of records and data and what will be the main drivers of media change? It is time to research whether this will happen. This study began with the need to recognize that archives are not only records that are the result of actions, but also data as strategic assets. Through this, author considered how to expand traditional boundaries and achieves reterritorialization in a data-driven society.

Development of the Efficiency-Evaluation Model for the Mechanism of CO2 Sequestration in a Deep Saline Aquifer (심부 대염수층 CO2 격리 메커니즘에 관한 효율성 평가 모델 개발)

  • Kim, Jung-Gyun;Lee, Young-Soo;Lee, Jeong-Hwan
    • Journal of the Korean Institute of Gas
    • /
    • v.16 no.6
    • /
    • pp.55-66
    • /
    • 2012
  • The practical way to minimize the greenhouse gas is to reduce the emission of carbon dioxide. For this reason, CCS(Carbon Capture and Storage) technology, which could reduce carbon dioxide emission, has risen as a realistic alternative in recent years. In addition, the researcher is recently working into ways of applying CCS technologies with deep saline aquifer. In this study, the evaluation model on the feasibility of $CO_2$ sequestration in the deep saline aquifer using ANN(Artificial Neural Network) was developed. In order to develop the efficiency-evaluation model, basic model was created in the deep saline aquifer and sensitivity analysis was performed for the aquifer characteristics by utilizing the commercial simulator of GEM. Based on the sensitivity analysis, the factors and ranges affecting $CO_2$ sequestration in the deep saline aquifer were chosen. The result from ANN training scenario were confirmed $CO_2$ sequestration by solubility trapping and residual trapping mechanism. The result from ANN model evaluation indicated there is the increase of correlation coefficient up to 0.99. It has been confirmed that the developed model can be utilized in feasibility of $CO_2$ sequestration at deep saline aquifer.

A Benchmark of Open Source Data Mining Package for Thermal Environment Modeling in Smart Farm(R, OpenCV, OpenNN and Orange) (스마트팜 열환경 모델링을 위한 Open source 기반 Data mining 기법 분석)

  • Lee, Jun-Yeob;Oh, Jong-wo;Lee, DongHoon
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2017.04a
    • /
    • pp.168-168
    • /
    • 2017
  • ICT 융합 스마트팜 내의 환경계측 센서, 영상 및 사양관리 시스템의 증가에도 불구하고 이들 장비에서 확보되는 데이터를 적절히 유효하게 활용하는 기술이 미흡한 실정이다. 돈사의 경우 가축의 복지수준, 성장 변화를 실시간으로 모니터링 및 예측할 수 있는 데이터 분석 및 모델링 기술 확보가 필요하다. 이를 위해선 가축의 생리적 변화 및 행동적 변화를 조기에 감지하고 가축의 복지수준을 실시간으로 감시하고 분석 및 예측 기술이 필요한데 이를 위한 대표적인 정보 통신 공학적 접근법 중에 하나가 Data mining 이다. Data mining에 대한 연구 수행에 필요한 다양한 소프트웨어 중에서 Open source로 제공이 되는 4가지 도구를 비교 분석하였다. 스마트 돈사 내에서 열환경 모델링을 목표로 한 데이터 분석에서 고려해야할 요인으로 데이터 분석 알고리즘 도출 시간, 시각화 기능, 타 라이브러리와 연계 기능 등을 중점 적으로 분석하였다. 선정된 4가지 분석 도구는 1) R(https://cran.r-project.org), 2) OpenCV(http://opencv.org), 3) OpenNN (http://www.opennn.net), 4) Orange(http://orange.biolab.si) 이다. 비교 분석을 수행한 운영체제는 Linux-Ubuntu 16.04.4 LTS(X64)이며, CPU의 클럭속도는 3.6 Ghz, 메모리는 64 Gb를 설치하였다. 개발언어 측면에서 살펴보면 1) R 스크립트, 2) C/C++, Python, Java, 3) C++, 4) C/C++, Python, Cython을 지원하여 C/C++ 언어와 Python 개발 언어가 상대적으로 유리하였다. 데이터 분석 알고리즘의 경우 소스코드 범위에서 라이브러리를 제공하는 경우 Cross-Platform 개발이 가능하여 여러 운영체제에서 개발한 결과를 별도의 Porting 과정을 거치지 않고 사용할 수 있었다. 빌트인 라이브러리 경우 순서대로 R 의 경우 가장 많은 수의 Data mining 알고리즘을 제공하고 있다. 이는 R 운영 환경 자체가 개방형으로 되어 있어 온라인에서 추가되는 새로운 라이브러리를 클라우드를 통하여 공유하기 때문인 것으로 판단되었다. OpenCV의 경우 영상 처리에 강점이 있었으며, OpenNN은 신경망학습과 관련된 라이브러리를 소스코드 레벨에서 공개한 것이 강점이라 할 수 있다. Orage의 경우 라이브러리 집합을 제공하는 것에 중점을 둔 다른 패키지와 달리 시각화 기능 및 망 구성 등 사용자 인터페이스를 통합하여 운영한 것이 강점이라 할 수 있다. 열환경 모델링에 요구되는 시간 복잡도에 대응하기 위한 부가 정보 처리 기술에 대한 연구를 수행하여 스마트팜 열환경 모델링을 실시간으로 구현할 수 있는 방안 연구를 수행할 것이다.

  • PDF

A Study on Lifestyle as a Basic Curriculum for Design Education (디자인교육을 위한 기초교과목으로서 라이프스타일 연구)

  • Mun, Keum-Hi
    • Archives of design research
    • /
    • v.19 no.1 s.63
    • /
    • pp.171-180
    • /
    • 2006
  • Modern design has to be satisfied to user in various social class. Therefore a study of lifestyle is vary important as a basic of design. The logical starting point to grasp the user's needs is begun by consumer analysis. There are many methods of consumer analysis, In them, the study of lifestyle is extremely much used for understanding consumer who is classified with groups. Basic education of design have to teach knowledge and ability which is to be a groundwork for designer. The study of lifestyle is not a purpose itself but one of theoretical knowledge which is requested for design. By induding practical techniques and theory, theory can be achieved through practical techniques and the meaning of theory itself could be recalled. The value of this study is in the suggestion which is the way of approach to research and to study for oneself, by introduce the study through conceptual approach for lifestyle into basic design educational program. For the study of lifestyle, let students have the theoretical knowledge which is concept of lifestyle, modern social form, consumer culture, the way of using lifestyle and theorizing lifestyle, etc, And after, let students make teams for the study of lifestyle for oneself. Every team creates the subject and studies lifestyle of each social dass by the practical way such as Inquiry, observation and interview etc. As a result they have to make a report after the form of paper and let them have a presentation in each team. Students can understand lifestyle through this process not only theoretical but also practical and can be earned an ability for apply to business.

  • PDF

A Study about the Practices of Teachers Who Changed the Subject to Mathematics Based on Their Belief (과목변경수학교사의 신념에 따른 교수 실제에 관한 연구)

  • Kim, Soo Sun;Choi-Koh, Sang Sook
    • Communications of Mathematical Education
    • /
    • v.29 no.3
    • /
    • pp.373-389
    • /
    • 2015
  • This study was to investigate the practice of the teachers who changed their teaching subject to Mathematics from other subjects. Teacher, A who had traditional belief and Teacher, B, non-traditional belief were chosen for the study through the questionnaire in Sep. 2014. The result indicated that Teacher, A in traditional belief showed teacher-centered teaching but Teacher, B in nontraditional belief showed inconsistent way of teaching in comparison to the original perspective. The later said she could not teach students as she wanted to teach because of the lack of knowledge of teaching as a math teacher. The difficulties Teacher, A encountered were: to handle too many works beyond teaching and to teach too many contents to cover without having enough time to prepare. Teacher, B didn't know how to teach students math in a constructivism way. They asked to offer them more in-service training program to develop their expertise for teaching mathematics.

A Design of Multiplication Unit of Elementary Mathematics Textbook by Making the Best Use of Diversity of Algorithm (알고리즘의 다양성을 활용한 두 자리 수 곱셈의 지도 방안과 그에 따른 초등학교 3학년 학생의 곱셈 알고리즘 이해 과정 분석)

  • Kang, Heung-Kyu;Sim, Sun-Young
    • Journal of Elementary Mathematics Education in Korea
    • /
    • v.14 no.2
    • /
    • pp.287-314
    • /
    • 2010
  • The algorithm is a chain of mechanical procedures, capable of solving a problem. In modern mathematics educations, the teaching algorithm is performing an important role, even though contracted than in the past. The conspicuous characteristic of current elementary mathematics textbook's manner of manipulating multiplication algorithm is exceeding converge to 'standard algorithm.' But there are many algorithm other than standard algorithm in calculating multiplication, and this diversity is important with respect to didactical dimension. In this thesis, we have reconstructed the experimental learning and teaching plan of multiplication algorithm unit by making the best use of diversity of multiplication algorithm. It's core contents are as follows. Firstly, It handled various modified algorithms in addition to standard algorithm. Secondly, It did not order children to use standard algorithm exclusively, but encouraged children to select algorithm according to his interest. As stated above, we have performed teaching experiment which is ruled by new lesson design and analysed the effects of teaching experiment. Through this study, we obtained the following results and suggestions. Firstly, the experimental learning and teaching plan was effective on understanding of the place-value principle and the distributive law. The experimental group which was learned through various modified algorithm in addition to standard algorithm displayed higher degree of understanding than the control group. Secondly, as for computational ability, the experimental group did not show better achievement than the control group. It's cause is, in my guess, that we taught the children the various modified algorithm and allowed the children to select a algorithm by preference. The experimental group was more interested in diversity of algorithm and it's application itself than correct computation. Thirdly, the lattice method was not adopted in the majority of present mathematics school textbooks, but ranked high in the children's preference. I suggest that the mathematics school textbooks which will be developed henceforth should accept the lattice method.

  • PDF

A Study on the Recognition and Satisfaction of Food Styling for Cooks in a Deluxe Hotel (특급호텔 조리사의 푸드 스타일링 인지 및 만족도에 관한 연구)

  • Kim, Byung-Hee;Shin, Mee-Hye;Kang, Kun-Og
    • Culinary science and hospitality research
    • /
    • v.17 no.3
    • /
    • pp.76-88
    • /
    • 2011
  • This study surveyed the recognition and satisfaction of food styling for cooks in a deluxe hotel. The most important item for cooks while cooking was 'taste' (57.7%) followed by 'sanitation' (25.9%). For food styling education in hotels, only 35.5% of the hotels surveyed had their own education program, and the media was the main source of food styling information (54.9%). Also, chefs studied food styling(49.6% of respondents) by themselves, for 'personal development'(73.8%). Average scale of recognition of food trends was $3.35{\pm}.83$, and that of the principles of food styling was $3.41{\pm}.84$. There were the significant differences in terms of the recognition of food styling in the age groups, cooking experience, and the positions. Moreover, cooks' overall satisfaction with food styling was found low with the average of $2.90{\pm}.89$. For the satisfaction with food styling based on the general characteristics, there was a significant difference in 'positions' (p<0.1) and 'education' (p<0.1), which suggested that the higher positions and education, the greater satisfaction with food styling.

  • PDF

Comparative Analysis on Selecting Process of Gifted students in Science Education Institute for the Gifted (우리나라 대학부설 과학영재교육원의 영재학생 선발과정에 대한 비교 분석)

  • Jang, Nak-Han;Ryoo, Jin-Wo;Ryu, Hai-Il
    • Journal of Gifted/Talented Education
    • /
    • v.16 no.2
    • /
    • pp.101-122
    • /
    • 2006
  • In this study, selecting process for scientifically gifted students was investigated and analyzed to science education institute for the gifted in university and was compared with foreign education institute for the gifted. The number of science education institute for the gifted was unequality located to the regions, comparing with the ratio of students. Therefore it is necessary for balancing location of science education institute for the gifted according to the ratio of students to let various students have the gifted education. Comparing with the ratio of gifted students between elementary school students and middle school students, the shape of ratios seems to be a inversed-pyramid. It is necessary to select students with the shape of the pyramid, supporting them to be educated, systematically and officially. The gifted students were selected through the procedures of creative test, intelligent test and individual interview in science education institute

A Top-Down Approach to the Hardware Design Education Focusing on the Logic Design Courses (하드웨어 설계 교육에서의 TOP-DOWN 접근방법 : 논리설계 과목을 중심으로)

  • Yi Kang;Jung Kyeong-Hoon;Han Youn-Sik
    • Journal of Engineering Education Research
    • /
    • v.6 no.2
    • /
    • pp.22-29
    • /
    • 2003
  • The ultimate goal of a hardware design course is to equip the students with the system design ability. However, the majority of the current structures of the design courses are focused on the understanding of the operational principles of each device which is used later as a building block for the design of a system. The shortcomings of this approach are, first, that it is very hard to keep the students motivated to the end of the course where system design concepts are dealt, and, second, the students do not have enough experience of the system design which is usually required in the field. As an alternative to solve these problems, it is necessary to reverse the order of contents of the course. Namely we introduce the high level of the abstract concept of the system design in the very beginning of the course and later by lowering the level of abstraction to the operational principle of the internal devices. In this paper, we propose a new top-down methodology for the introductory hardware design course of logic design, where the design expression and verification in the system-level are introduced first and then detail knowledge on each device is introduced later. Also, we report a case result from a student's working group as part of an extracurricular education in order to verify the validity of our proposed approach