• Title/Summary/Keyword: Component mining

Search Result 142, Processing Time 0.029 seconds

Development of Active Data Mining Component for Web Database Applications (웹 데이터베이스 응용을 위한 액티브데이터마이닝 컴포넌트 개발)

  • Choi, Yong-Goo
    • Journal of Information Technology Applications and Management
    • /
    • v.15 no.2
    • /
    • pp.1-14
    • /
    • 2008
  • The distinguished prosperity of information technologies from great progress of e-business during the last decade has unavoidably made software development for active data mining to discovery hidden predictive information regarding business trends and behavior from vary large databases. Therefore this paper develops an active mining object(ADMO) component, which provides real-time predictive information from web databases. The ADMO component is to extended ADO(ActiveX Data Object) component to active data mining component based on COM(Component Object Model) for application program interface(API). ADMO component development made use of window script component(WSC) based on XML(eXtensible Markup Language). For the purpose of investigating the application environments and the practical schemes of the ADMO component, experiments for diverse practical applications were performed in this paper. As a result, ADMO component confirmed that it could effectively extract the analytic information of classification and aggregation from vary large databases for Web services.

  • PDF

Control of Electrically Excited Synchronous Motors with a Low Switching Frequency

  • Yuan, Qing-Qing;Wu, Xiao-Jie;Dai, Peng;Fu, Xiao
    • Journal of Power Electronics
    • /
    • v.12 no.4
    • /
    • pp.615-622
    • /
    • 2012
  • The switching frequency of the power electronic devices used in large synchronous motor drives is usually kept low (less than 1 kHz) to reduce the switching losses and to improve the converter power capability. However, this results in a couple of problems, e.g. an increase in the harmonic components of the stator current, and an undesired cross-coupling between the magnetization current component ($i_m$) and the torque component ($i_t$). In this paper, a novel complex matrix model of electrically excited synchronous motors (EESM) was established with a new control scheme for coping with the low switching frequency issues. First, a hybrid observer was proposed to identify the instantaneous fundamental component of the stator current, which results in an obvious reduction of both the total harmonic distortion (THD) and the low order harmonics. Then, a novel complex current controller was designed to realize the decoupling between $i_m$ and $i_t$. Simulation and experimental results verify the effectiveness of this novel control system for EESM drives.

Performance Analysis of Perturbation-based Privacy Preserving Techniques: An Experimental Perspective

  • Ritu Ratra;Preeti Gulia;Nasib Singh Gill
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.10
    • /
    • pp.81-88
    • /
    • 2023
  • In the present scenario, enormous amounts of data are produced every second. These data also contain private information from sources including media platforms, the banking sector, finance, healthcare, and criminal histories. Data mining is a method for looking through and analyzing massive volumes of data to find usable information. Preserving personal data during data mining has become difficult, thus privacy-preserving data mining (PPDM) is used to do so. Data perturbation is one of the several tactics used by the PPDM data privacy protection mechanism. In Perturbation, datasets are perturbed in order to preserve personal information. Both data accuracy and data privacy are addressed by it. This paper will explore and compare several perturbation strategies that may be used to protect data privacy. For this experiment, two perturbation techniques based on random projection and principal component analysis were used. These techniques include Improved Random Projection Perturbation (IRPP) and Enhanced Principal Component Analysis based Technique (EPCAT). The Naive Bayes classification algorithm is used for data mining approaches. These methods are employed to assess the precision, run time, and accuracy of the experimental results. The best perturbation method in the Nave-Bayes classification is determined to be a random projection-based technique (IRPP) for both the cardiovascular and hypothyroid datasets.

A Study on Forecasting Spare Parts Demand based on Data-Mining (데이터 마이닝 기반의 수리부속 수요예측 연구)

  • Kim, Jaedong;Lee, Hanjun
    • Journal of Internet Computing and Services
    • /
    • v.18 no.1
    • /
    • pp.121-129
    • /
    • 2017
  • Demand forecasting is one of the most critical tasks in defense logistics, because the failure of the task can bring about a huge waste of budget. Up to date, ROK-MND(Republic of Korea - Ministry of National Defense) has analyzed past component consumption data with time-series techniques to predict each component's demand. However, the accuracy of the prediction still needs to be improved. In our study, we attempted to find consumption pattern using data mining techniques. We gathered an 18,476 component consumption data first, and then derived diverse features to utilize them in identification of demanding patterns in the consumption data. The results show that our approach improves demand forecasting with higher accuracy.

Mitigating the ICA Attack against Rotation-Based Transformation for Privacy Preserving Clustering

  • Mohaisen, Abedelaziz;Hong, Do-Won
    • ETRI Journal
    • /
    • v.30 no.6
    • /
    • pp.868-870
    • /
    • 2008
  • The rotation-based transformation (RBT) for privacy preserving data mining is vulnerable to the independent component analysis (ICA) attack. This paper introduces a modified multiple-rotation-based transformation technique for special mining applications, mitigating the ICA attack while maintaining the advantages of the RBT.

  • PDF

Data-Mining Bootstrap Procedure with Potential Predictors in Forecasting Models: Evidence from Eight Countries in the Asia-Pacific Stock Markets

  • Lee, Hojin
    • East Asian Economic Review
    • /
    • v.23 no.4
    • /
    • pp.333-351
    • /
    • 2019
  • We use a data-mining bootstrap procedure to investigate the predictability test in the eight Asia-Pacific regional stock markets using in-sample and out-of-sample forecasting models. We address ourselves to the data-mining bias issues by using the data-mining bootstrap procedure proposed by Inoue and Kilian and applied to the US stock market data by Rapach and Wohar. The empirical findings show that stock returns are predictable not only in-sample but out-of-sample in Hong Kong, Malaysia, Singapore, and Korea with a few exceptions for some forecasting horizons. However, we find some significant disparity between in-sample and out-of-sample predictability in the Korean stock market. For Hong Kong, Malaysia, and Singapore, stock returns have predictable components both in-sample and out-of-sample. For the US, Australia, and Canada, we do not find any evidence of return predictability in-sample and out-of-sample with a few exceptions. For Japan, stock returns have a predictable component with price-earnings ratio as a forecasting variable for some out-of-sample forecasting horizons.

Use of Information Component (IC) and Relative Risk (RR) for Signal Detection of Drug Interactions of Clopidogrel : Data-mining Study Using Health Insurance Review & Assessment Service (HIRA) Claims Database (정보 성분과 상대위험도를 이용한 clopidogrel의 약물상호작용 시그널 검색 : 건강보험데이터베이스를 대상으로 한 데이터마이닝 연구)

  • Kim, Jin-Hyung;Choi, Chung-Am;Oh, Jung-Mi;Son, Sung-Ho;Shin, Wan-Gyoon
    • Korean Journal of Clinical Pharmacy
    • /
    • v.21 no.2
    • /
    • pp.90-99
    • /
    • 2011
  • Health Insurance Review & Assessment Service (HIRA) claims database has a high potential to detect signals of new drug interactions. The aim of this study was to evaluate the usefulness of information component (IC) and relative risk (RR) as a tool for signal detection, and to analyze the possible drug interactions caused by clopidogrel using HIRA claims database. This study was performed in elderly patients over 65 years of age who administered clopidogrel from January 2005 to June 2006 in South Korea. Serious Adverse Events (SAEs) as drug interactions of clopidogrel were defined as any ambulatory hospitalization for ischemic diseases within comcomitant medication period of clopidogrel. Information Component (IC) and Relative Risk (RR) were calculated to compare the proportion of drug-SAE pairs in order to select drug specific SAEs. IC and RR signals of clopidogrel drug interaction were screened when IC's 95% confidence interval was greater than 0 and RR's 95% confidence interval was greater than 1 respectively. All detected signals were compared to references such as $Micromedex^{(R)}$ and 2010 Drug Interaction $Facts^{TM}$. Sensitivity, specificity, positive predicted value and negative predicted value were used to evaluate usefulness of this method. Among 13,252,930 cases of elderly patients who co-administered clopidogrel and other drugs, 47,485 cases were detected as SAE. Of these, one-hundred nine cases were detected by the IC-based data-mining approach and ninety one cases were detected by the RR-based data-mining approach. Total One-hundred sixty three unrecognized signals were detected by IC or RR. Twelve signals from IC-based data-mining (57.1%) were corresponded with drug interactions from references and eight signals from RR-based data-mining (38.1%) were corresponded with drug interactions from references. These signals include proton pump inhibitors, calcium channel blockers and HMG CoA reductase Inhibitors, which were known to affect CYP450 metabolism. Further studies using HIRA claims database are necessary to develop appropriate data-mining measure.

Analysis of Injuries in the Ghanaian Mining Industry and Priority Areas for Research

  • Stemn, Eric
    • Safety and Health at Work
    • /
    • v.10 no.2
    • /
    • pp.151-165
    • /
    • 2019
  • Background: Despite improvements in safety performance, the number and severity of mining-related injuries remain high and unacceptable, indicating that further reduction can be achieved. This study examines occupational accident statistics of the Ghanaian mining industry and identifies priority areas, warranting intervention measures and further investigations. Methods: A total of 202 fatal and nonfatal injury reports over a 10-year period were obtained from five mines and the Inspectorate Division of the Minerals Commission of Ghana, and they were analyzed. Results: Results of the analyses show that the involvement of mining equipment, the task being performed, the injury type, and the mechanism of injury remain as priorities. For instance, mining equipment was associated with 85% of all injuries and 90% of all fatalities, with mobile equipment, component/part, and hand tools being the leading equipment types. In addition, mechanics/repairmen, truck operators, and laborers were the most affected ones, and the most dangerous activities included maintenance, operating mobile equipment, and clean up/clearing. Conclusion: Results of this analysis will enable authorities of mines to develop targeted interventions to improve their safety performance. To improve the safety of the mines, further research and prevention efforts are recommended.

Mining Association Rules of Credit Card Delinquency of Bank Customers in Large Databases

  • Lee, Young-Chan;Shin, Soo-Il
    • Journal of Intelligence and Information Systems
    • /
    • v.9 no.2
    • /
    • pp.135-154
    • /
    • 2003
  • Credit scoring system (CSS) starts from an analysis of delinquency trend of each individual or industry. This paper conducts a research on credit card delinquency of bank customers as a preliminary step for building effective credit scoring system to prevent excess loan or bad credit status. To serve this purpose, we use association rules as a rule generating data mining technique. Specifically, we generate sets of rules of customers who are in bad credit status because of delinquency by association rule mining. We expect that the sets of rules generated by association rule mining could act as an estimator of good or bad credit status classifier and basic component of early warning system.

  • PDF