• Title/Summary/Keyword: Classification of Difficulty

Search Result 247, Processing Time 0.022 seconds

Analysis of Korea Earth Science Olympiad Items for the Enhancement of Item Quality (한국 지구과학 올림피아드 문항 분석을 통한 문항의 질 향상 방안)

  • Lee Ki-Young;Kim Chan-Jong
    • Journal of the Korean earth science society
    • /
    • v.26 no.6
    • /
    • pp.511-523
    • /
    • 2005
  • The purpose of this study is to analyze the 1st and 2nd Korea Earth Science Olympiad (KESO) items, in order to find informations to enhance item quality. To do this, internal and external item classification frameworks are developed. Item difficulty (P), discrimination index (DI), correlation, and reliability are estimated by using classical test theory. Generalizability is also estimated by applying the generalizability theory. The results of item classification are as follows: (1) ‘Geology’, ‘astronomy’ and ‘data analysis and interpretation’ are dominant in content and inquiry process domain, respectively. Nearly every item has textbook context. (2) There is no difference between the preliminary and final tests in terms of their thinking skills sections. (3) As a whole, the ratio of items with pictures is high in item representation. However, multiple-choice and short answer items are more common in preliminary competition, and essay type items are found more often in final competition. The ratio of simple items is high in middle school section and preliminary competition, but composite items are dominant in high school section and final competition. The findings of item analysis are as follows: (1) In the middle school section, P is low and DI is moderate. But in the high school section, there is a considerable differences between science high schools and other high schools in general. (2) The highest correlation is reported between the scores of meteorology domain and total score in middle school, whereas in high school astronomy domain and total score show the highest correlation. (3) General high school section show the highest Cronbach $\alpha$ and generalizability. (4) General high school section show acceptable generalizability coefficient (> 0.80), but middle and science high school section should increase the number of items to reach acceptable generalizability level.

A Study on GIS Component Classification considering Functional/Non-Functional Elements (기능적/비기능적 요소를 고려한 GIS 컴포넌트 분류에 관한 연구)

  • Jo, Yun-Won;Jo, Myung-Hee
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.5 no.3
    • /
    • pp.77-86
    • /
    • 2002
  • Recently software industry in GIS(geographic information system) becomes an interesting issue by performing a large scale of national GIS application development as well as even small unit of FMS(facility management system). Also, there exist many cases to combine GIS with various business domains such as MIS(marketing information system), CNS(car navigation system) and ITS(intelligent transportation system). In this situation, in order to develop an efficient and useful GIS application for a short term, there must be a deep consideration of not only developing GIS component but also managing GIS component. In fact, even though there exist many certain components having high reusability, excellent interoperability and good quality, their reusability may be reduced because of their difficulty to access in a certain repository. Therefore, it is important to classify components having common characteristic based on their particular rule with reflecting their functionality and non-functionality before cataloging them. Here, there are two non-functional classification categories discussed such as GIS content-dependent metadata and GIS content-independent metadata. This cataloged components will help application developers to select easily their desired components. Moreover, new components may be easily producted by modifying and combining previous components. Finally, the original goal of all this effort can be defined through obtaining high reusability and interoperability of GIS component.

  • PDF

Learning Method for Regression Model by Analysis of Relationship Between Input and Output Data with Periodicity (주기성을 갖는 입출력 데이터의 연관성 분석을 통한 회귀 모델 학습 방법)

  • Kim, Hye-Jin;Park, Ye-Seul;Lee, Jung-Won
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.7
    • /
    • pp.299-306
    • /
    • 2022
  • In recent, sensors embedded in robots, equipment, and circuits have become common, and research for diagnosing device failures by learning measured sensor data is being actively conducted. This failure diagnosis study is divided into a classification model for predicting failure situations or types and a regression model for numerically predicting failure conditions. In the case of a classification model, it simply checks the presence or absence of a failure or defect (Class), whereas a regression model has a higher learning difficulty because it has to predict one value among countless numbers. So, the reason that regression modeling is more difficult is that there are many irregular situations in which it is difficult to determine one output from a similar input when predicting by matching input and output. Therefore, in this paper, we focus on input and output data with periodicity, analyze the input/output relationship, and secure regularity between input and output data by performing sliding window-based input data patterning. In order to apply the proposed method, in this study, current and temperature data with periodicity were collected from MMC(Modular Multilevel Converter) circuit system and learning was carried out using ANN. As a result of the experiment, it was confirmed that when a window of 2% or more of one cycle was applied, performance of 97% or more of fit could be secured.

Increasing Accuracy of Classifying Useful Reviews by Removing Neutral Terms (중립도 기반 선택적 단어 제거를 통한 유용 리뷰 분류 정확도 향상 방안)

  • Lee, Minsik;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.129-142
    • /
    • 2016
  • Customer product reviews have become one of the important factors for purchase decision makings. Customers believe that reviews written by others who have already had an experience with the product offer more reliable information than that provided by sellers. However, there are too many products and reviews, the advantage of e-commerce can be overwhelmed by increasing search costs. Reading all of the reviews to find out the pros and cons of a certain product can be exhausting. To help users find the most useful information about products without much difficulty, e-commerce companies try to provide various ways for customers to write and rate product reviews. To assist potential customers, online stores have devised various ways to provide useful customer reviews. Different methods have been developed to classify and recommend useful reviews to customers, primarily using feedback provided by customers about the helpfulness of reviews. Most shopping websites provide customer reviews and offer the following information: the average preference of a product, the number of customers who have participated in preference voting, and preference distribution. Most information on the helpfulness of product reviews is collected through a voting system. Amazon.com asks customers whether a review on a certain product is helpful, and it places the most helpful favorable and the most helpful critical review at the top of the list of product reviews. Some companies also predict the usefulness of a review based on certain attributes including length, author(s), and the words used, publishing only reviews that are likely to be useful. Text mining approaches have been used for classifying useful reviews in advance. To apply a text mining approach based on all reviews for a product, we need to build a term-document matrix. We have to extract all words from reviews and build a matrix with the number of occurrences of a term in a review. Since there are many reviews, the size of term-document matrix is so large. It caused difficulties to apply text mining algorithms with the large term-document matrix. Thus, researchers need to delete some terms in terms of sparsity since sparse words have little effects on classifications or predictions. The purpose of this study is to suggest a better way of building term-document matrix by deleting useless terms for review classification. In this study, we propose neutrality index to select words to be deleted. Many words still appear in both classifications - useful and not useful - and these words have little or negative effects on classification performances. Thus, we defined these words as neutral terms and deleted neutral terms which are appeared in both classifications similarly. After deleting sparse words, we selected words to be deleted in terms of neutrality. We tested our approach with Amazon.com's review data from five different product categories: Cellphones & Accessories, Movies & TV program, Automotive, CDs & Vinyl, Clothing, Shoes & Jewelry. We used reviews which got greater than four votes by users and 60% of the ratio of useful votes among total votes is the threshold to classify useful and not-useful reviews. We randomly selected 1,500 useful reviews and 1,500 not-useful reviews for each product category. And then we applied Information Gain and Support Vector Machine algorithms to classify the reviews and compared the classification performances in terms of precision, recall, and F-measure. Though the performances vary according to product categories and data sets, deleting terms with sparsity and neutrality showed the best performances in terms of F-measure for the two classification algorithms. However, deleting terms with sparsity only showed the best performances in terms of Recall for Information Gain and using all terms showed the best performances in terms of precision for SVM. Thus, it needs to be careful for selecting term deleting methods and classification algorithms based on data sets.

Risk factors affecting the difficulty of fiberoptic nasotracheal intubation

  • Rhee, Seung-Hyun;Yun, Hye Joo;Kim, Jieun;Karm, Myong-Hwan;Ryoo, Seung-Hwa;Kim, Hyun Jeong;Seo, Kwang-Suk
    • Journal of Dental Anesthesia and Pain Medicine
    • /
    • v.20 no.5
    • /
    • pp.293-301
    • /
    • 2020
  • Background: The success rate of intubation under direct laryngoscopy is greatly influenced by laryngoscopic grade using the Cormack-Lehane classification. However, it is not known whether grade under direct laryngoscopy can also affects the success rate of nasotracheal intubation using a fiberoptic bronchoscpe, so this study investigated the same. In addition, we investigated other factors that influence the success rate of fiberoptic nasotracheal intubation (FNI). Methods: FNI was performed by 18 anesthesiology residents under general anesthesia in patients over 15 years of age who underwent elective oral and maxillofacial operations. In all patients, the Mallampati grade was measured. Laryngeal view grade under direct laryngoscopy, and the degree of secretion and bleeding in the oral cavity was measured and divided into 3 grades. The time required for successful FNI was measured. If the intubation time was > 5 minutes, it was evaluated as a failure and the airway was managed by another method. The failure rate was evaluated using appropriate statistical method. Receiver operating characteristic (ROC) curves and area under the curve (AUC) were also measured. Results: A total of 650 patients were included in the study, and the failure rate of FNI was 4.5%. The patient's sex, age, height, weight, Mallampati, and laryngoscopic view grade did not affect the success rate of FNI (P > 0.05). BMI, the number of FNI performed by residents (P = 0.03), secretion (P < 0.001), and bleeding (P < 0.001) grades influenced the success rate. The AUCs of bleeding and secretion were 0.864 and 0.798, respectively, but the AUC of BMI, the number of FNI performed by residents, Mallampati, and laryngoscopic view grade were 0.527, 0.616, 0.614, and 0.544, respectively. Conclusion: Unlike in intubation under direct laryngoscopy, in the case of FNI, oral secretion and nasal bleeding had a significant effect on FNI difficulty than Mallampati grade or Laryngeal view grade.

Ensemble Learning with Support Vector Machines for Bond Rating (회사채 신용등급 예측을 위한 SVM 앙상블학습)

  • Kim, Myoung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.18 no.2
    • /
    • pp.29-45
    • /
    • 2012
  • Bond rating is regarded as an important event for measuring financial risk of companies and for determining the investment returns of investors. As a result, it has been a popular research topic for researchers to predict companies' credit ratings by applying statistical and machine learning techniques. The statistical techniques, including multiple regression, multiple discriminant analysis (MDA), logistic models (LOGIT), and probit analysis, have been traditionally used in bond rating. However, one major drawback is that it should be based on strict assumptions. Such strict assumptions include linearity, normality, independence among predictor variables and pre-existing functional forms relating the criterion variablesand the predictor variables. Those strict assumptions of traditional statistics have limited their application to the real world. Machine learning techniques also used in bond rating prediction models include decision trees (DT), neural networks (NN), and Support Vector Machine (SVM). Especially, SVM is recognized as a new and promising classification and regression analysis method. SVM learns a separating hyperplane that can maximize the margin between two categories. SVM is simple enough to be analyzed mathematical, and leads to high performance in practical applications. SVM implements the structuralrisk minimization principle and searches to minimize an upper bound of the generalization error. In addition, the solution of SVM may be a global optimum and thus, overfitting is unlikely to occur with SVM. In addition, SVM does not require too many data sample for training since it builds prediction models by only using some representative sample near the boundaries called support vectors. A number of experimental researches have indicated that SVM has been successfully applied in a variety of pattern recognition fields. However, there are three major drawbacks that can be potential causes for degrading SVM's performance. First, SVM is originally proposed for solving binary-class classification problems. Methods for combining SVMs for multi-class classification such as One-Against-One, One-Against-All have been proposed, but they do not improve the performance in multi-class classification problem as much as SVM for binary-class classification. Second, approximation algorithms (e.g. decomposition methods, sequential minimal optimization algorithm) could be used for effective multi-class computation to reduce computation time, but it could deteriorate classification performance. Third, the difficulty in multi-class prediction problems is in data imbalance problem that can occur when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. SVM ensemble learning is one of machine learning methods to cope with the above drawbacks. Ensemble learning is a method for improving the performance of classification and prediction algorithms. AdaBoost is one of the widely used ensemble learning techniques. It constructs a composite classifier by sequentially training classifiers while increasing weight on the misclassified observations through iterations. The observations that are incorrectly predicted by previous classifiers are chosen more often than examples that are correctly predicted. Thus Boosting attempts to produce new classifiers that are better able to predict examples for which the current ensemble's performance is poor. In this way, it can reinforce the training of the misclassified observations of the minority class. This paper proposes a multiclass Geometric Mean-based Boosting (MGM-Boost) to resolve multiclass prediction problem. Since MGM-Boost introduces the notion of geometric mean into AdaBoost, it can perform learning process considering the geometric mean-based accuracy and errors of multiclass. This study applies MGM-Boost to the real-world bond rating case for Korean companies to examine the feasibility of MGM-Boost. 10-fold cross validations for threetimes with different random seeds are performed in order to ensure that the comparison among three different classifiers does not happen by chance. For each of 10-fold cross validation, the entire data set is first partitioned into tenequal-sized sets, and then each set is in turn used as the test set while the classifier trains on the other nine sets. That is, cross-validated folds have been tested independently of each algorithm. Through these steps, we have obtained the results for classifiers on each of the 30 experiments. In the comparison of arithmetic mean-based prediction accuracy between individual classifiers, MGM-Boost (52.95%) shows higher prediction accuracy than both AdaBoost (51.69%) and SVM (49.47%). MGM-Boost (28.12%) also shows the higher prediction accuracy than AdaBoost (24.65%) and SVM (15.42%)in terms of geometric mean-based prediction accuracy. T-test is used to examine whether the performance of each classifiers for 30 folds is significantly different. The results indicate that performance of MGM-Boost is significantly different from AdaBoost and SVM classifiers at 1% level. These results mean that MGM-Boost can provide robust and stable solutions to multi-classproblems such as bond rating.

A Study on Diagnostic Criteria of Noise-Induced Hearing Loss among Workers in an Iron Foundry (철강공장 근로자를 대상으로 살펴본 소음성 난청 진단기준에 관한 조사)

  • Kim, Ji-Yong;Lim, Hyun-Sul;Cheong, Hae-Kwan;Moon, Ok-Ryun
    • Journal of Preventive Medicine and Public Health
    • /
    • v.26 no.3 s.43
    • /
    • pp.371-386
    • /
    • 1993
  • This study was carried out to evaluate diagnostic criteria of noise-induced hearing loss (NIHL) among-workers in an iron foundry. Of 1,093 workers under the observation of noise-specific health examination, 184 workers were selected by way of first and second screening audiometric examination. A questionnaire survey, otological examinations, Rinne test and audiometric test were performed and the results were as follows ; The degree of hearing impairment in the left ear was more severe than in the right ear (p<0.05). The difference between hearing threshold of the first and the second hearing test at 1,000 Hz was about 5 dB with a narrow range of deviations while the difference at 4,000 Hz was about -7 dB with a wide range. Of the total study workers, 84.8% were tested within 15 hours away from noise exposure, and the rest after 16 hours. This study has identified that mean hearing loss at 4,000 Hz showed a significant statistical difference among the two study groups while mean hearing loss by 4-divided classification did not. The same phenomena were observed between the group with and without tinnitus and between the group with and without difficulty in hearing (p<0.05). Among 184 workers, 10 workers (5.4%) diagnosed as NIHL by old diagnostic criteria in contrast to 150 workers diagnosed as NIHL by the new diagnostic criteria. There was a significant difference between the two groups in the average hearing loss at 4,000 Hz and 4-divided classification (p<0.01), but there were no significant differences in age, the duration of employment, blood pressure and the duration wearing the personal hearing protector (p>0.05). If we apply Early Loss Index (ELI) method, some workers in younger age group diagnosed as NIHL by the new diagnostic criteria were fallen into within the normal range. In the mean time older age group show reverse results in contrast to the above finding. It is too early to confirm the value of the usage of the new diagnostic criteria in hearing examination. Further study is called for to verify the value of this criteria.

  • PDF

Discussions on the Accessibility of School Library DLS Catalogue Records - Focused on Literary Collections - (학교도서관 DLS 목록의 자료 접근성에 대한 논의 - 문학 분야 장서를 중심으로 -)

  • Kang, Bong-Suk;Jung, Youngmi
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.4
    • /
    • pp.539-559
    • /
    • 2019
  • One of the fundamental roles of libraries is to provide users with efficient and easy retrieval of materials. Various discussions have been made at domestic and abroad to improve the accessibility of materials by category, user, and collection, and at the center of this is the issue of improving classification and cataloging systems. However, there are few studies in this area dealing with the data accessibility of the DLS catalog, which is a central tool for accessing domestic school library materials. This study started from the appeal of school library users to the difficulty of searching and accessing books, especially literature. This study is an exploratory study that attempts to derive problems by finding the causes of there difficulties from various aspects. To this study, we surveyed and analyzed the current status of school library collections, the data registration of the school library support system DLS, the subject accessibility of catalog records produced through this, and the recognition and opinions of school library professionals. As a result, school library collections were highly concentrated in the literature field, and it was found that there was not enough catalog bibliographic records to provide efficient access to these collections. In addition, it was found to be somewhat lacking through the DLS search function to compensate for this. Surveys of school librarians and librarians have also identified this problem, and a rich topic index and search keyword assignments have been drawn to the majority of opinions as a way to improve access to materials in school library catalogs. As a continuous discussion on this subject, the plan for improving access to school library materials will be more concrete through future user studies and new challenges for bookshelf classification.

Motion Study of Treatment Robot for Autistic Children Using Speech Data Classification Based on Artificial Neural Network (음성 분류 인공신경망을 활용한 자폐아 치료용 로봇의 지능화 동작 연구)

  • Lee, Jin-Gyu;Lee, Bo-Hee
    • Journal of IKEEE
    • /
    • v.23 no.4
    • /
    • pp.1440-1447
    • /
    • 2019
  • Currently, the prevalence of autism spectrum disorders in children is reported to be higher and shows various types of disorders. In particular, they are having difficulty in communication due to communication impairment in the area of social communication and need to be improved through training. Thus, this study proposes a method of acquiring voice information through a microphone mounted on a robot designed through preliminary research and using this information to make intelligent motions. An ANN(Artificial Neural Network) was used to classify the speech data into robot motions, and we tried to improve the accuracy by combining the Recurrent Neural Network based on Convolutional Neural Network. The preprocessing of input speech data was analyzed using MFCC(Mel-Frequency Cepstral Coefficient), and the motion of the robot was estimated using various data normalization and neural network optimization techniques. In addition, the designed ANN showed a high accuracy by conducting an experiment comparing the accuracy with the existing architecture and the method of human intervention. In order to design robot motions with higher accuracy in the future and to apply them in the treatment and education environment of children with autism.

Dust Explosion Characteristics of Multi-Walled Carbon Nano Tube (다중벽 탄소나노튜브의 분진폭발 특성)

  • Han, In Soo;Lee, Keun Won;Choi, Yi Rac
    • Korean Chemical Engineering Research
    • /
    • v.55 no.1
    • /
    • pp.40-47
    • /
    • 2017
  • Dust explosion hazards are always present when combustible dusts are manufactured or handled in the process. However, industries is experiencing difficulty in establishing chemical accident prevention measures because of insufficiency of information on dust explosion characteristics of combustible dust handled in industry. In this study, we investigated experimentally dust explosion characteristics of two kinds of multi-walled carbon nano tubes (MWCNT) different in particle size distribution and examined classification of dust explosion hazardous area for MWCNT manufacturing or handling process by applying the NFPA 499 code. As a result, $P_{max}$, $K_{st}$, LEL, MIE and MIT of MWCNT 1 having $124.2{\mu}m$ median diameter are obtained 6.3 bar, $56bar{\cdot}m/s$, $125g/m^3$, over 1000 mJ, and over $650^{\circ}C$. $P_{max}$, $K_{st}$, LEL, MIE and MIT of MWCNT 2 having $293.5{\mu}m$ median diameter are 6.2 bar, $42bar{\cdot}m/s$, $100g/m^3$, over 1000 mJ, and over $650^{\circ}C$, respectively. MWCNT 1, 2 are not categorized as combustible dust listed in the NFPA 499 Code for classification of dust explosion hazardous area because explosion severity and ignition sensitivity of MWCNT 1, 2 are below 0.35 and 0.01, respectively.