• Title/Summary/Keyword: reasoning pattern

Search Result 141, Processing Time 0.021 seconds

A New Incremental Instance-Based Learning Using Recursive Partitioning (재귀분할을 이용한 새로운 점진적 인스턴스 기반 학습기법)

  • Han Jin-Chul;Kim Sang-Kwi;Yoon Chung-Hwa
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.127-132
    • /
    • 2006
  • K-NN (k-Nearest Neighbors), which is a well-known instance-based learning algorithm, simply stores entire training patterns in memory, and uses a distance function to classify a test pattern. K-NN is proven to show satisfactory performance, but it is notorious formemory usage and lengthy computation. Various studies have been found in the literature in order to minimize memory usage and computation time, and NGE (Nested Generalized Exemplar) theory is one of them. In this paper, we propose RPA (Recursive Partition Averaging) and IRPA (Incremental RPA) which is an incremental version of RPA. RPA partitions the entire pattern space recursively, and generates representatives from each partition. Also, due to the fact that RPA is prone to produce excessive number of partitions as the number of features in a pattern increases, we present IRPA which reduces the number of representative patterns by processing the training set in an incremental manner. Our proposed methods have been successfully shown to exhibit comparable performance to k-NN with a lot less number of patterns and better result than EACH system which implements the NGE theory.

An Analysis on Teaching Methods of Patterns in Elementary Mathematics Textbooks (초등학교 수학 교과서에 제시된 패턴 지도방안에 대한 분석)

  • Pang, JeongSuk;Sunwoo, Jin
    • Education of Primary School Mathematics
    • /
    • v.19 no.1
    • /
    • pp.1-18
    • /
    • 2016
  • Patterns are of great significance to develop algebraic thinking of elementary students. This study analyzed teaching methods of patterns in current elementary mathematics textbook series in terms of three main activities related to pattern generalization (i.e., analyzing the structure of patterns, investigating the relationship between two variables, and reasoning and representing the generalized rules). The results of this study showed that such activities to analyze the structure of patterns are not explicitly considered in the textbooks, whereas those to explore the relationship between two variables in a pattern are emphasized throughout all grade levels using function table. The activities to reason and represent the generalized rules of patterns are dealt in a way both for lower grade students to use informal representations and for upper grade students to employ formal representations with expressions or symbols. The results of this study also illustrated that patterns in the textbooks are treated rather as a separate strand than as something connected to other content strands. This paper closes with several implications to teach patterns in a way to foster early algebraic thinking of elementary school students.

Behavior Pattern Modeling based Game Bot detection (행동 패턴 모델을 이용한 게임 봇 검출 방법)

  • Park, Sang-Hyun;Jung, Hye-Wuk;Yoon, Tae-Bok;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.3
    • /
    • pp.422-427
    • /
    • 2010
  • Korean Game industry, especially MMORPG(Massively Multiplayer Online Game) has been rapidly expanding in these days. But As game industry is growing, lots of online game security incidents have also been increasing and getting prevailing. One of the most critical security incidents is 'Game Bots', which are programs to play MMORPG instead of human players. If player let the game bots play for them, they can get a lot of benefic game elements (experience points, items, etc.) without any effort, and it is considered unfair to other players. Plenty of game companies try to prevent bots, but it does not work well. In this paper, we propose a behavior pattern model for detecting bots. We analyzed behaviors of human players as well as bots and identified six game features to build the model to differentiate game bots from human players. Based on these features, we made a Naive Bayesian classifier to reasoning the game bot or not. To evaluated our method, we used 10 game bot data and 6 human Player data. As a result, we classify Game bot and human player with 88% accuracy.

Analyzing Contextual Polarity of Unstructured Data for Measuring Subjective Well-Being (주관적 웰빙 상태 측정을 위한 비정형 데이터의 상황기반 긍부정성 분석 방법)

  • Choi, Sukjae;Song, Yeongeun;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.83-105
    • /
    • 2016
  • Measuring an individual's subjective wellbeing in an accurate, unobtrusive, and cost-effective manner is a core success factor of the wellbeing support system, which is a type of medical IT service. However, measurements with a self-report questionnaire and wearable sensors are cost-intensive and obtrusive when the wellbeing support system should be running in real-time, despite being very accurate. Recently, reasoning the state of subjective wellbeing with conventional sentiment analysis and unstructured data has been proposed as an alternative to resolve the drawbacks of the self-report questionnaire and wearable sensors. However, this approach does not consider contextual polarity, which results in lower measurement accuracy. Moreover, there is no sentimental word net or ontology for the subjective wellbeing area. Hence, this paper proposes a method to extract keywords and their contextual polarity representing the subjective wellbeing state from the unstructured text in online websites in order to improve the reasoning accuracy of the sentiment analysis. The proposed method is as follows. First, a set of general sentimental words is proposed. SentiWordNet was adopted; this is the most widely used dictionary and contains about 100,000 words such as nouns, verbs, adjectives, and adverbs with polarities from -1.0 (extremely negative) to 1.0 (extremely positive). Second, corpora on subjective wellbeing (SWB corpora) were obtained by crawling online text. A survey was conducted to prepare a learning dataset that includes an individual's opinion and the level of self-report wellness, such as stress and depression. The participants were asked to respond with their feelings about online news on two topics. Next, three data sources were extracted from the SWB corpora: demographic information, psychographic information, and the structural characteristics of the text (e.g., the number of words used in the text, simple statistics on the special characters used). These were considered to adjust the level of a specific SWB. Finally, a set of reasoning rules was generated for each wellbeing factor to estimate the SWB of an individual based on the text written by the individual. The experimental results suggested that using contextual polarity for each SWB factor (e.g., stress, depression) significantly improved the estimation accuracy compared to conventional sentiment analysis methods incorporating SentiWordNet. Even though literature is available on Korean sentiment analysis, such studies only used only a limited set of sentimental words. Due to the small number of words, many sentences are overlooked and ignored when estimating the level of sentiment. However, the proposed method can identify multiple sentiment-neutral words as sentiment words in the context of a specific SWB factor. The results also suggest that a specific type of senti-word dictionary containing contextual polarity needs to be constructed along with a dictionary based on common sense such as SenticNet. These efforts will enrich and enlarge the application area of sentic computing. The study is helpful to practitioners and managers of wellness services in that a couple of characteristics of unstructured text have been identified for improving SWB. Consistent with the literature, the results showed that the gender and age affect the SWB state when the individual is exposed to an identical queue from the online text. In addition, the length of the textual response and usage pattern of special characters were found to indicate the individual's SWB. These imply that better SWB measurement should involve collecting the textual structure and the individual's demographic conditions. In the future, the proposed method should be improved by automated identification of the contextual polarity in order to enlarge the vocabulary in a cost-effective manner.

Response Modeling for the Marketing Promotion with Weighted Case Based Reasoning Under Imbalanced Data Distribution (불균형 데이터 환경에서 변수가중치를 적용한 사례기반추론 기반의 고객반응 예측)

  • Kim, Eunmi;Hong, Taeho
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.29-45
    • /
    • 2015
  • Response modeling is a well-known research issue for those who have tried to get more superior performance in the capability of predicting the customers' response for the marketing promotion. The response model for customers would reduce the marketing cost by identifying prospective customers from very large customer database and predicting the purchasing intention of the selected customers while the promotion which is derived from an undifferentiated marketing strategy results in unnecessary cost. In addition, the big data environment has accelerated developing the response model with data mining techniques such as CBR, neural networks and support vector machines. And CBR is one of the most major tools in business because it is known as simple and robust to apply to the response model. However, CBR is an attractive data mining technique for data mining applications in business even though it hasn't shown high performance compared to other machine learning techniques. Thus many studies have tried to improve CBR and utilized in business data mining with the enhanced algorithms or the support of other techniques such as genetic algorithm, decision tree and AHP (Analytic Process Hierarchy). Ahn and Kim(2008) utilized logit, neural networks, CBR to predict that which customers would purchase the items promoted by marketing department and tried to optimized the number of k for k-nearest neighbor with genetic algorithm for the purpose of improving the performance of the integrated model. Hong and Park(2009) noted that the integrated approach with CBR for logit, neural networks, and Support Vector Machine (SVM) showed more improved prediction ability for response of customers to marketing promotion than each data mining models such as logit, neural networks, and SVM. This paper presented an approach to predict customers' response of marketing promotion with Case Based Reasoning. The proposed model was developed by applying different weights to each feature. We deployed logit model with a database including the promotion and the purchasing data of bath soap. After that, the coefficients were used to give different weights of CBR. We analyzed the performance of proposed weighted CBR based model compared to neural networks and pure CBR based model empirically and found that the proposed weighted CBR based model showed more superior performance than pure CBR model. Imbalanced data is a common problem to build data mining model to classify a class with real data such as bankruptcy prediction, intrusion detection, fraud detection, churn management, and response modeling. Imbalanced data means that the number of instance in one class is remarkably small or large compared to the number of instance in other classes. The classification model such as response modeling has a lot of trouble to recognize the pattern from data through learning because the model tends to ignore a small number of classes while classifying a large number of classes correctly. To resolve the problem caused from imbalanced data distribution, sampling method is one of the most representative approach. The sampling method could be categorized to under sampling and over sampling. However, CBR is not sensitive to data distribution because it doesn't learn from data unlike machine learning algorithm. In this study, we investigated the robustness of our proposed model while changing the ratio of response customers and nonresponse customers to the promotion program because the response customers for the suggested promotion is always a small part of nonresponse customers in the real world. We simulated the proposed model 100 times to validate the robustness with different ratio of response customers to response customers under the imbalanced data distribution. Finally, we found that our proposed CBR based model showed superior performance than compared models under the imbalanced data sets. Our study is expected to improve the performance of response model for the promotion program with CBR under imbalanced data distribution in the real world.

Pre-service mathematics teachers' noticing competency: Focusing on teaching for robust understanding of mathematics (예비 수학교사의 수학적 사고 중심 수업에 관한 노티싱 역량 탐색)

  • Kim, Hee-jeong
    • The Mathematical Education
    • /
    • v.61 no.2
    • /
    • pp.339-357
    • /
    • 2022
  • This study explores pre-service secondary mathematics teachers (PSTs)' noticing competency. 17 PSTs participated in this study as a part of the mathematics teaching method class. Individual PST's essays regarding the question 'what effective mathematics teaching would be?' that they discussed and wrote at the beginning of the course were collected as the first data. PSTs' written analysis of an expert teacher's teaching video, colleague PSTs' demo-teaching video, and own demo-teaching video were also collected and analyzed. Findings showed that most PSTs' noticing level improved as the class progressed and showed a pattern of focusing on each key aspect in terms of the Teaching for Robust Understanding of Mathematics (TRU Math) framework, but their reasoning strategies were somewhat varied. This suggests that the TRU Math framework can support PSTs to improve the competency of 'what to attend' among the noticing components. In addition, the instructional reasoning strategies imply that PSTs' noticing reasoning strategy was mostly related to their interpretation of noticing components, which should be also emphasized in the teacher education program.

Internationalization Strategy of the Fisheries - Processing Firms of Korea (우리나라 수산물가공기업의 국제화전략)

  • 하종욱;박영병;어윤양
    • The Journal of Fisheries Business Administration
    • /
    • v.22 no.2
    • /
    • pp.19-51
    • /
    • 1991
  • The objectives of the study are to look into the fisheries processing industry, analyze problems the industry has, and develop strategies for the industry to take care of the problems. The study was performed in two different dimensions : industry level and firm level. Nevertheless, the study focused on the following main problem areas raw material, production, technology development, internationalization, and managerial performance. The secondary data were utilized to analyze problems at the industry level, For analyzing the firm level situation, an empirical study by using a mail survey with a questionnaire was accomplished. The main problems found were as follows : First, difficulty in procuring raw material was the most serious and main problem. It was caused, externally, by the announcement of 200 nautical miles by most of fishery abundant countries and, internally, by drying fishery resources in the nation's coastal areas ; Second, the rate of fishery processing has been continuously increased and the degree of the processing has also been sophisticated, which implies the pattern of demand for the fishery has been changing widely and deeply. The industry, however, seemed to be unable to meet the consumers' satisfaction ; Third, with the importance of technology for ensuring the changing demands in the fishery processing industry, there has been little effort in research and development both at industry level and at the firm level ; Next, the industry has mainly involved in exporting in association with internationalization. Not to mention about foreign direct investment, technology transfer was not active ; Finaliy, most of firms were densely located in a few areas. The managerical performance in terms of main financial ratios still needs to be improved. Thus, strategies, which would take care of the repective problems, were developed. At industry level, the strategies were developed by reasoning mostly based on the findings from the literature survey. A scheme for internationalization of the firms was suggested. This was made by extracting the factors which would differentiate the firms' internationalization stages. In order to achieve this analysis, discriminant approach was employed. Despite the utility of the findings, it was mostly emphasized that harmonious efforts among government, the industry supporting institutions such as banks, and firms are needed for the successful operation of the strategies. Also, a list of areas for further study was provided especially in relation to the validity threatening parts of the study.

  • PDF

Investigating the Hierarchical Nature of Content and Cognitive Domains in the Mathematics Curriculum for Korean Middle School Students via Assessment Items (평가 문항을 활용한 중학교 수학 교육과정의 내용 및 인지행동의 위계성 조사)

  • Song, Mi-Young;Kim, Sun-Hee
    • School Mathematics
    • /
    • v.9 no.2
    • /
    • pp.223-240
    • /
    • 2007
  • The purpose of this study was to investigate the degree to which the middle school mathematics curriculum matched the item difficulty levels of representative mathematics items. The items used in this study were developed for the National Assessment of Educational Achievement. Ranks for difficulty values of the 60 multiple-choice item were calculated via both Classical Test Theory and Item Response Theory and correlated with the rank order of the mathematics content and cognitive domains sequence. There are six content domains; number and operation, algebra, measurement, figure, pattern and function, and probability and statistics. The cognitive domains include computation, understanding, reasoning and problem-solving. Results suggest a congruence between cognitive domain's sequence and item difficulty levels of items based on that sequence. This finding indicates that the linear or hierarchical assumptions concerning the sequence appears to be reasonable. The characteristics of items that were exceptions to this trend were addressed.

  • PDF

The Generalization of the Area of Internal Triangles for the GSP Use of Mathematically Gifted Students (중등 영재학생들의 GSP를 활용한 내분삼각형 넓이의 일반화)

  • Lee, Heon-Soo;Lee, Kwang-Ho
    • Journal of the Korean School Mathematics Society
    • /
    • v.15 no.3
    • /
    • pp.565-584
    • /
    • 2012
  • This study investigates how the GSP helps gifted and talented students understand geometric principles and concepts during the inquiry process in the generalization of the internal triangle, and how the students logically proceeded to visualize the content during the process of generalization. Four mathematically gifted students were chosen for the study. They investigated the pattern between the area of the original triangle and the area of the internal triangle with the ratio of each sides on m:n respectively. Digital audio, video and written data were collected and analyzed. From the analysis the researcher found four results. First, the visualization used the GSP helps the students to understand the geometric principles and concepts intuitively. Second, the GSP helps the students to develop their inductive reasoning skills by proving the various cases. Third, the lessons used GSP increases interest in apathetic students and improves their mathematical communication and self-efficiency.

  • PDF

An Effect of Analogy Scaffolding for Middle School Students' Explanatory Hypothesis Generation on Water Wave Interference Phenomenon (물결파의 간섭 현상에 대한 중학생들의 설명 가설 생성을 위한 비유추론 비계 전략의 효과)

  • Kim, Wonsook;Kim, Youngmin
    • Journal of The Korean Association For Science Education
    • /
    • v.37 no.6
    • /
    • pp.1015-1023
    • /
    • 2017
  • The purpose of this study is to investigate the process of development of explanatory hypothesis being generated by middle school students and the factors that affect the process in water wave interference phenomena by analogy scaffolding. For this purpose, the processes on how explanatory hypotheses were generated, revised, and then elaborated by analogy scaffolding were investigated and analyzed. The subjects for the study were 60 eighth grades students in one middle school divided into 12 groups according to their cognitive level measured by GALT. The research findings are as follows: First, it was found that there is a regular pattern in development of explanatory hypothesis by students about water wave phenomenon; second, the cognitive level of the student affected apparently on the students' hypothesis development process, however, it was also observed that students with inferior cognitive level could form a scientific explanatory hypothesis in the second stage of the scaffolding; third, the analogy scaffolding actually helped the students in formulating hypothesis. In conclusion, analogical reasoning can be a meaningful and powerful strategy for secondary school students to formulate explanatory hypothesis.