• Title/Summary/Keyword: Information processing knowledge

Search Result 1,093, Processing Time 0.027 seconds

A Document Collection Method for More Accurate Search Engine (정확도 높은 검색 엔진을 위한 문서 수집 방법)

  • Ha, Eun-Yong;Gwon, Hui-Yong;Hwang, Ho-Yeong
    • The KIPS Transactions:PartA
    • /
    • v.10A no.5
    • /
    • pp.469-478
    • /
    • 2003
  • Internet information search engines using web robots visit servers conneted to the Internet periodically or non-periodically. They extract and classify data collected according to their own method and construct their database, which are the basis of web information search engines. There procedure are repeated very frequently on the Web. Many search engine sites operate this processing strategically to become popular interneet portal sites which provede users ways how to information on the web. Web search engine contacts to thousands of thousands web servers and maintains its existed databases and navigates to get data about newly connected web servers. But these jobs are decided and conducted by search engines. They run web robots to collect data from web servers without knowledge on the states of web servers. Each search engine issues lots of requests and receives responses from web servers. This is one cause to increase internet traffic on the web. If each web server notify web robots about summary on its public documents and then each web robot runs collecting operations using this summary to the corresponding documents on the web servers, the unnecessary internet traffic is eliminated and also the accuracy of data on search engines will become higher. And the processing overhead concerned with web related jobs on web servers and search engines will become lower. In this paper, a monitoring system on the web server is designed and implemented, which monitors states of documents on the web server and summarizes changes of modified documents and sends the summary information to web robots which want to get documents from the web server. And an efficient web robot on the web search engine is also designed and implemented, which uses the notified summary and gets corresponding documents from the web servers and extracts index and updates its databases.

The Development of Software Teaching-Learning Model based on Machine Learning Platform (머신러닝 플랫폼을 활용한 소프트웨어 교수-학습 모형 개발)

  • Park, Daeryoon;Ahn, Joongmin;Jang, Junhyeok;Yu, Wonjin;Kim, Wooyeol;Bae, Youngkwon;Yoo, Inhwan
    • Journal of The Korean Association of Information Education
    • /
    • v.24 no.1
    • /
    • pp.49-57
    • /
    • 2020
  • The society we are living in has being changed to the age of the intelligent information society after passing through the knowledge-based information society in the early 21st century. In this study, we have developed the instructional model for software education based on the machine learning which is a field of artificial intelligence(AI) to enhance the core competencies of learners required in the intelligent information society. This model is focusing on enhancing the core competencies through the process of problem-solving as well as reducing the burden of learning about AI itself. The specific stages of the developed model are consisted of seven levels which are 'Problem Recognition and Analysis', 'Data Collection', 'Data Processing and Feature Extraction', 'ML Model Training and Evaluation', 'ML Programming', 'Application and Problem Solving', and 'Share and Feedback'. As a result of applying the developed model in this study, we were able to observe the positive response about learning from the students and parents. We hope that this research could suggest the future direction of not only the instructional design but also operation of software education program based on machine learning.

An Investigation on Digital Humanities Research Trend by Analyzing the Papers of Digital Humanities Conferences (디지털 인문학 연구 동향 분석 - Digital Humanities 학술대회 논문을 중심으로 -)

  • Chung, EunKyung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.55 no.1
    • /
    • pp.393-413
    • /
    • 2021
  • Digital humanities, which creates new and innovative knowledge through the combination of digital information technology and humanities research problems, can be seen as a representative multidisciplinary field of study. To investigate the intellectual structure of the digital humanities field, a network analysis of authors and keywords co-word was performed on a total of 441 papers in the last two years (2019, 2020) at the Digital Humanities Conference. As the results of the author and keyword analysis show, we can find out the active activities of Europe, North America, and Japanese and Chinese authors in East Asia. Through the co-author network, 11 dis-connected sub-networks are identified, which can be seen as a result of closed co-authoring activities. Through keyword analysis, 16 sub-subject areas are identified, which are machine learning, pedagogy, metadata, topic modeling, stylometry, cultural heritage, network, digital archive, natural language processing, digital library, twitter, drama, big data, neural network, virtual reality, and ethics. This results imply that a diver variety of digital information technologies are playing a major role in the digital humanities. In addition, keywords with high frequency can be classified into humanities-based keywords, digital information technology-based keywords, and convergence keywords. The dynamics of the growth and development of digital humanities can represented in these combinations of keywords.

Development of a Prototype System for Aquaculture Facility Auto Detection Using KOMPSAT-3 Satellite Imagery (KOMPSAT-3 위성영상 기반 양식시설물 자동 검출 프로토타입 시스템 개발)

  • KIM, Do-Ryeong;KIM, Hyeong-Hun;KIM, Woo-Hyeon;RYU, Dong-Ha;GANG, Su-Myung;CHOUNG, Yun-Jae
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.19 no.4
    • /
    • pp.63-75
    • /
    • 2016
  • Aquaculture has historically delivered marine products because the country is surrounded by ocean on three sides. Surveys on production have been conducted recently to systematically manage aquaculture facilities. Based on survey results, pricing controls on marine products has been implemented to stabilize local fishery resources and to ensure minimum income for fishermen. Such surveys on aquaculture facilities depend on manual digitization of aerial photographs each year. These surveys that incorporate manual digitization using high-resolution aerial photographs can accurately evaluate aquaculture with the knowledge of experts, who are aware of each aquaculture facility's characteristics and deployment of those facilities. However, using aerial photographs has monetary and time limitations for monitoring aquaculture resources with different life cycles, and also requires a number of experts. Therefore, in this study, we investigated an automatic prototype system for detecting boundary information and monitoring aquaculture facilities based on satellite images. KOMPSAT-3 (13 Scene), a local high-resolution satellite provided the satellite imagery collected between October and April, a time period in which many aquaculture facilities were operating. The ANN classification method was used for automatic detecting such as cage, longline and buoy type. Furthermore, shape files were generated using a digitizing image processing method that incorporates polygon generation techniques. In this study, our newly developed prototype method detected aquaculture facilities at a rate of 93%. The suggested method overcomes the limits of existing monitoring method using aerial photographs, but also assists experts in detecting aquaculture facilities. Aquaculture facility detection systems must be developed in the future through application of image processing techniques and classification of aquaculture facilities. Such systems will assist in related decision-making through aquaculture facility monitoring.

The Analysis of Elementary Pre-service Teachers' Reflective Thinking and Experiment Performance Ability on Photosynthesis Experiment (광합성 실험에서 나타난 초등 예비교사들의 반성적 사고와 실험 수행 능력 분석)

  • Kim, Dong-Ryeul
    • Journal of Korean Elementary Science Education
    • /
    • v.34 no.4
    • /
    • pp.502-518
    • /
    • 2015
  • In order to find out Elementary pre-service teachers' reflective thinking and experiment performance ability related with Photosynthesis Experiment in the Korea Elementary School Science Textbook, the research is conducted targeting Elementary pre-service teachers. They are asked to carry out the experiment and write their own report about the difficulties and solutions of exploration process. This study aims to analyze Elementary pre-service teachers' reflection and experiment performance ability on Photosynthesis experiment based on 10 groups' reports and presentation materials. Reflective thinking extracts 108 statements which is associated with the four types of the sentence 'Knowledge, Procedure, Orientation, Attitude' in 10 reports. There are many sentences about reflective thinking acquired through analysis of the photosynthesis experiment. reflective thinking about the newly discovered type or changed concepts through experimentation in Knowledge is at the highest frequency. 56 sentences in relation to the ability to perform experiments are extracted by adding 4 different types of reflective thinking in 10 groups shown the highest frequency group and the lowest frequency group's report through analyzing 4 steps 'Experimental preparation and safety accident prevention', 'Experiments performance', 'Experimental results and generalization', and 'Experimental results and feedback.' Results of the analysis showed that there are the biggest difference between the two groups in 'experiment results supplement and feedback step.' In the lowest group's report, there's no contents related with 'Computer-assisted information processing' in the 'Experimental results summary and generalization stage', 'Alternative reagents and materials research', and 'Devising alternative experiment methods'.

The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification (청음 음성학적 지식에 기반한 음가분류에 의한 핵심어 검출 시스템 구현)

  • Kim, Hack-Jin;Kim, Soon-Hyub
    • The KIPS Transactions:PartB
    • /
    • v.10B no.2
    • /
    • pp.169-178
    • /
    • 2003
  • This study outlines two viewpoints the classification of phone likely unit (PLU) which is the foundation of korean large vocabulary speech recognition, and the effectiveness of Chiljongseong (7 Final Consonants) and Paljogseong (8 Final Consonants) of the korean language. The phone likely classifies the phoneme phonetically according to the location of and method of articulation, and about 50 phone-likely units are utilized in korean speech recognition. In this study auditory phonetical knowledge was applied to the classification of phone likely unit to present 45 phone likely unit. The vowels 'ㅔ, ㅐ'were classified as phone-likely of (ee) ; 'ㅒ, ㅖ' as [ye] ; and 'ㅚ, ㅙ, ㅞ' as [we]. Secondly, the Chiljongseong System of the draft for unified spelling system which is currently in use and the Paljongseonggajokyong of Korean script haerye were illustrated. The question on whether the phonetic value on 'ㄷ' and 'ㅅ' among the phonemes used in the final consonant of the korean fan guage is the same has been argued in the academic world for a long time. In this study, the transition stages of Korean consonants were investigated, and Ciljonseeng and Paljongseonggajokyong were utilized in speech recognition, and its effectiveness was verified. The experiment was divided into isolated word recognition and speech recognition, and in order to conduct the experiment PBW452 was used to test the isolated word recognition. The experiment was conducted on about 50 men and women - divided into 5 groups - and they vocalized 50 words each. As for the continuous speech recognition experiment to be utilized in the materialized stock exchange system, the sentence corpus of 71 stock exchange sentences and speech corpus vocalizing the sentences were collected and used 5 men and women each vocalized a sentence twice. As the result of the experiment, when the Paljongseonggajokyong was used as the consonant, the recognition performance elevated by an average of about 1.45% : and when phone likely unit with Paljongseonggajokyong and auditory phonetic applied simultaneously, was applied, the rate of recognition increased by an average of 1.5% to 2.02%. In the continuous speech recognition experiment, the recognition performance elevated by an average of about 1% to 2% than when the existing 49 or 56 phone likely units were utilized.

A Classification Method of Delirium Patients Using Local Covering-Based Rule Acquisition Approach with Rough Lower Approximation (러프 하한 근사를 갖는 로컬 커버링 기반 규칙 획득 기법을 이용한 섬망 환자의 분류 방법)

  • Son, Chang Sik;Kang, Won Seok;Lee, Jong Ha;Moon, Kyoung Ja
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.4
    • /
    • pp.137-144
    • /
    • 2020
  • Delirium is among the most common mental disorders encountered in patients with a temporary cognitive impairment such as consciousness disorder, attention disorder, and poor speech, particularly among those who are older. Delirium is distressing for patients and families, can interfere with the management of symptoms such as pain, and is associated with increased elderly mortality. The purpose of this paper is to generate useful clinical knowledge that can be used to distinguish the outcomes of patients with delirium in long-term care facilities. For this purpose, we extracted the clinical classification knowledge associated with delirium using a local covering rule acquisition approach with the rough lower approximation region. The clinical applicability of the proposed method was verified using data collected from a prospective cohort study. From the results of this study, we found six useful clinical pieces of evidence that the duration of delirium could more than 12 days. Also, we confirmed eight factors such as BMI, Charlson Comorbidity Index, hospitalization path, nutrition deficiency, infection, sleep disturbance, bed scores, and diaper use are important in distinguishing the outcomes of delirium patients. The classification performance of the proposed method was verified by comparison with three benchmarking models, ANN, SVM with RBF kernel, and Random Forest, using a statistical five-fold cross-validation method. The proposed method showed an improved average performance of 0.6% and 2.7% in both accuracy and AUC criteria when compared with the SVM model with the highest classification performance of the three models respectively.

Finding Weighted Sequential Patterns over Data Streams via a Gap-based Weighting Approach (발생 간격 기반 가중치 부여 기법을 활용한 데이터 스트림에서 가중치 순차패턴 탐색)

  • Chang, Joong-Hyuk
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.3
    • /
    • pp.55-75
    • /
    • 2010
  • Sequential pattern mining aims to discover interesting sequential patterns in a sequence database, and it is one of the essential data mining tasks widely used in various application fields such as Web access pattern analysis, customer purchase pattern analysis, and DNA sequence analysis. In general sequential pattern mining, only the generation order of data element in a sequence is considered, so that it can easily find simple sequential patterns, but has a limit to find more interesting sequential patterns being widely used in real world applications. One of the essential research topics to compensate the limit is a topic of weighted sequential pattern mining. In weighted sequential pattern mining, not only the generation order of data element but also its weight is considered to get more interesting sequential patterns. In recent, data has been increasingly taking the form of continuous data streams rather than finite stored data sets in various application fields, the database research community has begun focusing its attention on processing over data streams. The data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. In data stream processing, each data element should be examined at most once to analyze the data stream, and the memory usage for data stream analysis should be restricted finitely although new data elements are continuously generated in a data stream. Moreover, newly generated data elements should be processed as fast as possible to produce the up-to-date analysis result of a data stream, so that it can be instantly utilized upon request. To satisfy these requirements, data stream processing sacrifices the correctness of its analysis result by allowing some error. Considering the changes in the form of data generated in real world application fields, many researches have been actively performed to find various kinds of knowledge embedded in data streams. They mainly focus on efficient mining of frequent itemsets and sequential patterns over data streams, which have been proven to be useful in conventional data mining for a finite data set. In addition, mining algorithms have also been proposed to efficiently reflect the changes of data streams over time into their mining results. However, they have been targeting on finding naively interesting patterns such as frequent patterns and simple sequential patterns, which are found intuitively, taking no interest in mining novel interesting patterns that express the characteristics of target data streams better. Therefore, it can be a valuable research topic in the field of mining data streams to define novel interesting patterns and develop a mining method finding the novel patterns, which will be effectively used to analyze recent data streams. This paper proposes a gap-based weighting approach for a sequential pattern and amining method of weighted sequential patterns over sequence data streams via the weighting approach. A gap-based weight of a sequential pattern can be computed from the gaps of data elements in the sequential pattern without any pre-defined weight information. That is, in the approach, the gaps of data elements in each sequential pattern as well as their generation orders are used to get the weight of the sequential pattern, therefore it can help to get more interesting and useful sequential patterns. Recently most of computer application fields generate data as a form of data streams rather than a finite data set. Considering the change of data, the proposed method is mainly focus on sequence data streams.

A Study on the Curriculum for Record Management Science Education - with focus on the Faculty of Cultural Information Resources, Surugadai University; Evolving Program, New Connections (기록관리학의 발전을 위한 교육과정연구 -준하태(駿河台)(스루가다이)대학(大學)의 경우를 중심(中心)으로-)

  • Kim, Yong-Won
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.1 no.1
    • /
    • pp.69-94
    • /
    • 2001
  • The purpose of this paper is to provide an overview of the current status of the records management science education in Japan, and to examine the implications of the rapid growth of this filed while noting some of its significant issues and problems. The goal of records management science education is to improve the quality of information services and to assure an adequate supply of information professionals. Because records management science programs prepare students for a professional career, their curricula must encompass elements of both education and practical training. This is often expressed as a contrast between theory and practice. The confluence of the social, economic and technological realities of the environment where the learning takes place affects both. This paper reviews the historical background and current trends of records management science education in Japan. It also analyzes the various types of curriculum and the teaching staff of these institutions, with focus on the status of the undergraduate program at Surugadai University, the first comprehensive, university level program in Japan. The Faculty of Cultural Information Resources, Surugadai University, a new school toward an integrated information disciplines, was opened in 1994, to explore the theory and practice of the management diverse cultural information resources. Its purpose was to stimulate and promote research in additional fields of information science by offering professional training in archival science, records management, and museum curatorship, as well as librarianship. In 1999, the school introduced a master program, the first in Japan. The Faculty has two departments and each of them has two courses; Department of Sensory Information Resources Management; -Sound and Audiovisual Information Management, -Landscape and Tourism Information Management, Department of Knowledge Information Resources Management; -Library and Information Management, -Records and Archives Management The structure of the entire curriculum is also organized in stages from the time of entrance through basic instruction and onwards. Orientation subjects which a student takes immediately upon entering university is an introduction to specialized education, in which he learns the basic methods of university education and study, During his first and second years, he arranges Basic and Core courses as essential steps towards specialization at university. For this purpose, the courses offer a wide variety of study topics. The number of courses offered, including these, amounts to approximately 150. While from his third year onwards, he begins specific courses that apply to his major field, and in a gradual accumulation of seminar classes and practical training, puts his knowledge grained to practical use. Courses pertaining to these departments are offered to students beginning their second year. However, there is no impenetrable wall between the two departments, and there are only minor differences with regard requirements for graduation. Students may select third or fourth year seminars regardless of the department to which they belong. To be awarded a B.A. in Cultural Information Resources, the student is required to earn 34 credits in Basic Courses(such as, Social History of Cultural Information, Cultural Anthropology, History of Science, Behavioral Sciences, Communication, etc.), 16 credits in Foreign Languages(including 10 in English), 14 credits on Information Processing(including both theory and practice), and 60 credits in the courses for his or her major. Finally, several of the issues and problems currently facing records management science education in Japan are briefly summarized below; -Integration and Incorporation of related areas and similar programs, -Curriculum Improvement, -Insufficient of Textbooks, -Lack of qualified Teachers, -Problems of the employment of Graduates. As we moved toward more sophisticated, integrated, multimedia information services, information professionals will need to work more closely with colleagues in other specialties. It will become essential to the survival of the information professions for librarians to work with archivists, record managers and museum curators. Managing the changes in our increasingly information-intensive society demands strong coalitions among everyone in cultural Institutions. To provide our future colleagues with these competencies will require building and strengthening partnerships within and across the information professions and across national borders.

A Study on Prediction of Wake Distribution by Neuro-Fuzzy System (뉴로퍼지시스템에 의한 반류분포 추정에 관한 연구)

  • Shin, Sung-Chul
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.2
    • /
    • pp.154-159
    • /
    • 2007
  • Wake distribution data of stem flow fields have been accumulated systematically by model tests. If the correlation between geometrical hull information and wake distribution is grasped through the accumulated data, this correlation can be helpful to designing similar ships. In this paper, Neuro-Fuzzy system that is emerging as a new knowledge over a wide range of fields nowadays is tried to estimate the wake distribution on the propeller plan. Neuro-Fuzzy system is well known as one of prospective and representative analysis method for prediction, classification, diagnosis of real complicated world problem, and it is widely applied even in the engineering fields. For this study three-dimensional stern hull forms and nominal wake values from a model test ate structured as processing elements of input and output layer, respectively. The proposed method is proved as an useful technique in ship design by comparing measured wake distribution with predicted wake distribution.