• Title/Summary/Keyword: information search effectiveness

Search Result 343, Processing Time 0.019 seconds

The Performance Bottleneck of Subsequence Matching in Time-Series Databases: Observation, Solution, and Performance Evaluation (시계열 데이타베이스에서 서브시퀀스 매칭의 성능 병목 : 관찰, 해결 방안, 성능 평가)

  • 김상욱
    • Journal of KIISE:Databases
    • /
    • v.30 no.4
    • /
    • pp.381-396
    • /
    • 2003
  • Subsequence matching is an operation that finds subsequences whose changing patterns are similar to a given query sequence from time-series databases. This paper points out the performance bottleneck in subsequence matching, and then proposes an effective method that improves the performance of entire subsequence matching significantly by resolving the performance bottleneck. First, we analyze the disk access and CPU processing times required during the index searching and post processing steps through preliminary experiments. Based on their results, we show that the post processing step is the main performance bottleneck in subsequence matching, and them claim that its optimization is a crucial issue overlooked in previous approaches. In order to resolve the performance bottleneck, we propose a simple but quite effective method that processes the post processing step in the optimal way. By rearranging the order of candidate subsequences to be compared with a query sequence, our method completely eliminates the redundancy of disk accesses and CPU processing occurred in the post processing step. We formally prove that our method is optimal and also does not incur any false dismissal. We show the effectiveness of our method by extensive experiments. The results show that our method achieves significant speed-up in the post processing step 3.91 to 9.42 times when using a data set of real-world stock sequences and 4.97 to 5.61 times when using data sets of a large volume of synthetic sequences. Also, the results show that our method reduces the weight of the post processing step in entire subsequence matching from about 90% to less than 70%. This implies that our method successfully resolves th performance bottleneck in subsequence matching. As a result, our method provides excellent performance in entire subsequence matching. The experimental results reveal that it is 3.05 to 5.60 times faster when using a data set of real-world stock sequences and 3.68 to 4.21 times faster when using data sets of a large volume of synthetic sequences compared with the previous one.

Steel Plate Faults Diagnosis with S-MTS (S-MTS를 이용한 강판의 표면 결함 진단)

  • Kim, Joon-Young;Cha, Jae-Min;Shin, Junguk;Yeom, Choongsub
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.47-67
    • /
    • 2017
  • Steel plate faults is one of important factors to affect the quality and price of the steel plates. So far many steelmakers generally have used visual inspection method that could be based on an inspector's intuition or experience. Specifically, the inspector checks the steel plate faults by looking the surface of the steel plates. However, the accuracy of this method is critically low that it can cause errors above 30% in judgment. Therefore, accurate steel plate faults diagnosis system has been continuously required in the industry. In order to meet the needs, this study proposed a new steel plate faults diagnosis system using Simultaneous MTS (S-MTS), which is an advanced Mahalanobis Taguchi System (MTS) algorithm, to classify various surface defects of the steel plates. MTS has generally been used to solve binary classification problems in various fields, but MTS was not used for multiclass classification due to its low accuracy. The reason is that only one mahalanobis space is established in the MTS. In contrast, S-MTS is suitable for multi-class classification. That is, S-MTS establishes individual mahalanobis space for each class. 'Simultaneous' implies comparing mahalanobis distances at the same time. The proposed steel plate faults diagnosis system was developed in four main stages. In the first stage, after various reference groups and related variables are defined, data of the steel plate faults is collected and used to establish the individual mahalanobis space per the reference groups and construct the full measurement scale. In the second stage, the mahalanobis distances of test groups is calculated based on the established mahalanobis spaces of the reference groups. Then, appropriateness of the spaces is verified by examining the separability of the mahalanobis diatances. In the third stage, orthogonal arrays and Signal-to-Noise (SN) ratio of dynamic type are applied for variable optimization. Also, Overall SN ratio gain is derived from the SN ratio and SN ratio gain. If the derived overall SN ratio gain is negative, it means that the variable should be removed. However, the variable with the positive gain may be considered as worth keeping. Finally, in the fourth stage, the measurement scale that is composed of selected useful variables is reconstructed. Next, an experimental test should be implemented to verify the ability of multi-class classification and thus the accuracy of the classification is acquired. If the accuracy is acceptable, this diagnosis system can be used for future applications. Also, this study compared the accuracy of the proposed steel plate faults diagnosis system with that of other popular classification algorithms including Decision Tree, Multi Perception Neural Network (MLPNN), Logistic Regression (LR), Support Vector Machine (SVM), Tree Bagger Random Forest, Grid Search (GS), Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The steel plates faults dataset used in the study is taken from the University of California at Irvine (UCI) machine learning repository. As a result, the proposed steel plate faults diagnosis system based on S-MTS shows 90.79% of classification accuracy. The accuracy of the proposed diagnosis system is 6-27% higher than MLPNN, LR, GS, GA and PSO. Based on the fact that the accuracy of commercial systems is only about 75-80%, it means that the proposed system has enough classification performance to be applied in the industry. In addition, the proposed system can reduce the number of measurement sensors that are installed in the fields because of variable optimization process. These results show that the proposed system not only can have a good ability on the steel plate faults diagnosis but also reduce operation and maintenance cost. For our future work, it will be applied in the fields to validate actual effectiveness of the proposed system and plan to improve the accuracy based on the results.

Trends of Study and Classification of Reference on Occupational Health Management in Korea after Liberation (해방 이후 우리나라 산업보건관리에 관한 문헌분류 및 연구동향)

  • Ha, Eun-Hee;Park, Hye-Sook;Kim, Young-Bok;Song, Hyun-Jong
    • Journal of Preventive Medicine and Public Health
    • /
    • v.28 no.4 s.51
    • /
    • pp.809-844
    • /
    • 1995
  • The purposes of this study are to define the scope of occupational health management and to classify occupational management by review of related journals from 1945 to 1994 in Korea. The steps of this study were as follows: (1) Search of secondary reference; (2) Collection and review of primary reference; (3) Survey; and (4) Analysis and discussion. The results were as follows ; 1. Most of the respondents majored in occupational health(71.6%), and were working in university (68.3%), males and over the age 40. Seventy percent of the respondents agreed with the idea that classification of occupational health management is necessary, and 10% disagreed. 2. After integration of the idea of respondents, we reclassified the scope of occupational health management. It was defined 3 parts, that is , occupational health system, occupational health service and others (such as assessment, epidemiology, cost-effectiveness analysis and so on). 3. The number of journals on occupational health management was 510. It was sightly increased from 1986 and abruptly increased after 1991. The kinds of journals related to occupational health management were The Korean Journal of Occupational Medicine(18.2%), Several Kinds of Medical Colloge Journal(17.0%), The Korean Journal Occupational Health(15.1%), The Korean Journal of Preventive Medicine(15.1%) and others(34.6%). As for the contents, the number of journals on occupational health management systems was 33(6.5%) and occupational health services 477(93.5%). Of the journals on occupational health management systems, the number of journals on the occupational health resource system was 15(45.5%), occupational finance system 8(24.2%), occupational health management system 6(18.2%), occupational organization 3(9.1%) and occupational health delivery system 1 (3.0%). Of the journals on occupational health services, the number of journals on disease management was 269(57.2%), health management 116(24.7%), working environmental management 85(18.1%). As for the subjects, the number of journals on general workers was 185(71.1%), followed by women worker, white coiler workers and so on. 4. Respondents made occupational health service(such as health management, working environmental management and health education) the first priority of occupational health management. Tied for the second are quality analysis(such as education, training and job contents of occupational health manager) and occupational health systems(such as the recommendation of systems of occupational and general disease and occupational health organization). 5. Thirty seven respondents suggested 48 ideas about the future research of occupational health management. The results were as follows: (1) Study of occupational health service 40.5%; (2) Study of organization system 27.1%; (3) Study of occupational health system (e.g. information network) 8.3%; (4) Study of working condition 6.2%; and (5) Study of occupational health service analysis 4.2%.

  • PDF