• Title/Summary/Keyword: Standard Dataset

Search Result 195, Processing Time 0.025 seconds

Efficient Transformer Dissolved Gas Analysis and Classification Method (효율적인 변압기 유중가스 분석 및 분류 방법)

  • Cho, Yoon-Jeong;Kim, Jae-Young;Kim, Jong-Myon
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.3
    • /
    • pp.563-570
    • /
    • 2018
  • This paper proposes an efficient dissolved gas analysis(DGA) and classification method of an oil-filled transformer using machine learning algorithms to solve problems inherent in IEC 60599. In IEC 60599, a certain diagnosis criteria do not exist, and duplication area is existed. Thus, it is difficult to make a decision without any experts since the IEC 60599 standard can not support analysis and classification of gas date of a power transformer in that criteria. To address these issue. we propose a dissolved gas analysis(DGA) and classification method using a machine learning algorithm. We evaluate the performance of the proposed method using support vector machines with dissolved gas dataset extracted from a power transformer in the real industry. To validate the performance of the proposed method, we compares the proposed method with the IEC 60599 standard. Experimental results show that the proposed method outperforms the IEC 60599 in the classification accuracy.

Assessing reproductive performance and predictive models for litter size in Landrace sows under tropical conditions

  • Praew Thiengpimol;Skorn Koonawootrittriron;Thanathip Suwanasopee
    • Animal Bioscience
    • /
    • v.37 no.8
    • /
    • pp.1333-1344
    • /
    • 2024
  • Objective: Litter size and piglet loss at birth significantly impact piglet production and are closely associated with sow parity. Understanding how these traits vary across different parities is crucial for effective herd management. This study investigates the patterns of the number of born alive piglets (NBA), number of piglet losses (NPL), and the proportion of piglet losses (PPL) at birth in Landrace sows under tropical conditions. Additionally, it aims to identify the most suitable model for describing these patterns. Methods: A dataset comprising 2,322 consecutive reproductive records from 258 Landrace sows, spanning parities from 1 to 9, was analyzed. Modeling approaches including 2nd and 3rd degree polynomial models, the Wood gamma function, and a longitudinal model were applied at the individual level to predict NBA, NPL, and PPL. The choice of the best-fitting model was determined based on the lowest mean and standard deviation of the difference between predicted and actual values, Akaike information criterion (AIC), and Bayesian information criterion (BIC). Results: Sow parity significantly influenced NBA, NPL, and PPL (p<0.0001). NBA increased until the 4th parity and then declined. In contrast, NPL and PPL decreased until the 2nd parity and then steadily increased until the 8th parity. The 2nd and 3rd degree polynomials, and longitudinal models showed no significant differences in predicting NBA, NPL, and PPL (p>0.05). The 3rd degree polynomial model had the lowest prediction standard deviation and yielded the smallest AIC and BIC. Conclusion: The 3rd degree polynomial model offers the most suitable description of NBA, NPL, and PPL patterns. It holds promise for applications in genetic evaluations to enhance litter size and reduce piglet loss at birth in sows. These findings highlight the importance of accounting for sow parity effects in swine breeding programs, particularly in tropical conditions, to optimize piglet production and sow performance.

Multiple Instance Mamdani Fuzzy Inference

  • Khalifa, Amine B.;Frigui, Hichem
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.15 no.4
    • /
    • pp.217-231
    • /
    • 2015
  • A novel fuzzy learning framework that employs fuzzy inference to solve the problem of Multiple Instance Learning (MIL) is presented. The framework introduces a new class of fuzzy inference systems called Multiple Instance Mamdani Fuzzy Inference Systems (MI-Mamdani). In multiple instance problems, the training data is ambiguously labeled. Instances are grouped into bags, labels of bags are known but not those of individual instances. MIL deals with learning a classifier at the bag level. Over the years, many solutions to this problem have been proposed. However, no MIL formulation employing fuzzy inference exists in the literature. Fuzzy logic is powerful at modeling knowledge uncertainty and measurements imprecision. It is one of the best frameworks to model vagueness. However, in addition to uncertainty and imprecision, there is a third vagueness concept that fuzzy logic does not address quiet well, yet. This vagueness concept is due to the ambiguity that arises when the data have multiple forms of expression, this is the case for multiple instance problems. In this paper, we introduce multiple instance fuzzy logic that enables fuzzy reasoning with bags of instances. Accordingly, a MI-Mamdani that extends the standard Mamdani inference system to compute with multiple instances is introduced. The proposed framework is tested and validated using a synthetic dataset suitable for MIL problems. Additionally, we apply the proposed multiple instance inference to fuse the output of multiple discrimination algorithms for the purpose of landmine detection using Ground Penetrating Radar.

Analyzing Errors in Bilingual Multi-word Lexicons Automatically Constructed through a Pivot Language

  • Seo, Hyeong-Won;Kim, Jae-Hoon
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.39 no.2
    • /
    • pp.172-178
    • /
    • 2015
  • Constructing a bilingual multi-word lexicon is confronted with many difficulties such as an absence of a commonly accepted gold-standard dataset. Besides, in fact, there is no everybody's definition of what a multi-word unit is. In considering these problems, this paper evaluates and analyzes the context vector approach which is one of a novel alignment method of constructing bilingual lexicons from parallel corpora, by comparing with one of general methods. The approach builds context vectors for both source and target single-word units from two parallel corpora. To adapt the approach to multi-word units, we identify all multi-word candidates (namely noun phrases in this work) first, and then concatenate them into single-word units. As a result, therefore, we can use the context vector approach to satisfy our need for multi-word units. In our experimental results, the context vector approach has shown stronger performance over the other approach. The contribution of the paper is analyzing the various types of errors for the experimental results. For the future works, we will study the similarity measure that not only covers a multi-word unit itself but also covers its constituents.

A Hybrid Query Disambiguation Adaptive Approach for Web Information Retrieval

  • Ibrahim, Roliana;Kamal, Shahid;Ghani, Imran;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.7
    • /
    • pp.2468-2487
    • /
    • 2015
  • In web searching, trustable and precise results are greatly affected by the inherent uncertainty in the input queries. Queries submitted to search engines are by nature ambiguous and constitute a significant proportion of the instances given to web search engines. Ambiguous queries pose real challenges for the web search engines due to versatility of information. Temporal based approaches whereas somehow reduce the uncertainty in queries but still lack to provide results according to users aspirations. Web search science has created an interest for the researchers to incorporate contextual information for resolving the uncertainty in search results. In this paper, we propose an Adaptive Disambiguation Approach (ADA) of hybrid nature that makes use of both the temporal and contextual information to improve user experience. The proposed hybrid approach presents the search results to the users based on their location and temporal information. A Java based prototype of the systems is developed and evaluated using standard dataset to determine its efficacy in terms of precision, accuracy, recall, and F1-measure. Supported by experimental results, ADA demonstrates better results along all the axes as compared to temporal based approaches.

Application of expert systems in prediction of flexural strength of cement mortars

  • Gulbandilar, Eyyup;Kocak, Yilmaz
    • Computers and Concrete
    • /
    • v.18 no.1
    • /
    • pp.1-16
    • /
    • 2016
  • In this study, an Artificial Neural Network (ANN) and Adaptive Network-based Fuzzy Inference Systems (ANFIS) prediction models for flexural strength of the cement mortars have been developed. For purpose of constructing this models, 12 different mixes with 144 specimens of the 2, 7, 28 and 90 days flexural strength experimental results of cement mortars containing pure Portland cement (PC), blast furnace slag (BFS), waste tire rubber powder (WTRP) and BFS+WTRP used in training and testing for ANN and ANFIS were gathered from the standard cement tests. The data used in the ANN and ANFIS models are arranged in a format of four input parameters that cover the Portland cement, BFS, WTRP and age of samples and an output parameter which is flexural strength of cement mortars. The ANN and ANFIS models have produced notable excellent outputs with higher coefficients of determination of $R^2$, RMS and MAPE. For the testing of dataset, the $R^2$, RMS and MAPE values for the ANN model were 0.9892, 0.1715 and 0.0212, respectively. Furthermore, the $R^2$, RMS and MAPE values for the ANFIS model were 0.9831, 0.1947 and 0.0270, respectively. As a result, in the models, the training and testing results indicated that experimental data can be estimated to a superior close extent by the ANN and ANFIS models.

Improving CMD Areal Density Analysis: Algorithms and Strategies

  • Wilson, R.E.
    • Journal of Astronomy and Space Sciences
    • /
    • v.31 no.2
    • /
    • pp.121-130
    • /
    • 2014
  • Essential ideas, successes, and difficulties of Areal Density Analysis (ADA) for color-magnitude diagrams (CMD's) of resolved stellar populations are examined, with explanation of various algorithms and strategies for optimal performance. A CMD-generation program computes theoretical datasets with simulated observational error and a solution program inverts the problem by the method of Differential Corrections (DC) so as to compute parameter values from observed magnitudes and colors, with standard error estimates and correlation coefficients. ADA promises not only impersonal results, but also significant saving of labor, especially where a given dataset is analyzed with several evolution models. Observational errors and multiple star systems, along with various single star characteristics and phenomena, are modeled directly via the Functional Statistics Algorithm (FSA). Unlike Monte Carlo, FSA is not dependent on a random number generator. Discussions include difficulties and overall requirements, such as need for fast evolutionary computation and realization of goals within machine memory limits. Degradation of results due to influence of pixelization on derivatives, Initial Mass Function (IMF) quantization, IMF steepness, low Areal Densities ($\mathcal{A}$), and large variation in $\mathcal{A}$ are reduced or eliminated through a variety of schemes that are explained sufficiently for general application. The Levenberg-Marquardt and MMS algorithms for improvement of solution convergence are contained within the DC program. An example of convergence, which typically is very good, is shown in tabular form. A number of theoretical and practical solution issues are discussed, as are prospects for further development.

Feature Weighting in Projected Clustering for High Dimensional Data (고차원 데이타에 대한 투영 클러스터링에서 특성 가중치 부여)

  • Park, Jong-Soo
    • Journal of KIISE:Databases
    • /
    • v.32 no.3
    • /
    • pp.228-242
    • /
    • 2005
  • The projected clustering seeks to find clusters in different subspaces within a high dimensional dataset. We propose an algorithm to discover near optimal projected clusters without user specified parameters such as the number of output clusters and the average cardinality of subspaces of projected clusters. The objective function of the algorithm computes projected energy, quality, and the number of outliers in each process of clustering. In order to minimize the projected energy and to maximize the quality in clustering, we start to find best subspace of each cluster on the density of input points by comparing standard deviations of the full dimension. The weighting factor for each dimension of the subspace is used to get id of probable error in measuring projected distances. Our extensive experiments show that our algorithm discovers projected clusters accurately and it is scalable to large volume of data sets.

Locally-Weighted Polynomial Neural Network for Daily Short-Term Peak Load Forecasting

  • Yu, Jungwon;Kim, Sungshin
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.16 no.3
    • /
    • pp.163-172
    • /
    • 2016
  • Electric load forecasting is essential for effective power system planning and operation. Complex and nonlinear relationships exist between the electric loads and their exogenous factors. In addition, time-series load data has non-stationary characteristics, such as trend, seasonality and anomalous day effects, making it difficult to predict the future loads. This paper proposes a locally-weighted polynomial neural network (LWPNN), which is a combination of a polynomial neural network (PNN) and locally-weighted regression (LWR) for daily shortterm peak load forecasting. Model over-fitting problems can be prevented effectively because PNN has an automatic structure identification mechanism for nonlinear system modeling. LWR applied to optimize the regression coefficients of LWPNN only uses the locally-weighted learning data points located in the neighborhood of the current query point instead of using all data points. LWPNN is very effective and suitable for predicting an electric load series with nonlinear and non-stationary characteristics. To confirm the effectiveness, the proposed LWPNN, standard PNN, support vector regression and artificial neural network are applied to a real world daily peak load dataset in Korea. The proposed LWPNN shows significantly good prediction accuracy compared to the other methods.

Chinese Corporate Leverage Determinants

  • Ferrarini, Benno;Hinojales, Marthe;Scaramozzino, Pasquale
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.4 no.1
    • /
    • pp.5-18
    • /
    • 2017
  • Total debt in the People's Republic of China surged to nearly 290% as a ratio to GDP by the second quarter of 2016, mostly on account of non-financial corporate debt. The outpouring of credit to stem the impact of the global financial crisis accentuated industrial overcapacity in traditional sectors, such as steel, cement, and energy, while feeding asset bubbles in the property, equity and bond markets. At the Chinese corporate level, this has translated into weakened fundamentals and a fall in industrial profits, particularly of SOEs. As debtors struggle to service interest payments, non-performing loans (NPLs) have been on the rise. This paper assesses the financial fragility of the Chinese economy by looking at risk factors in the non-financial sector. We apply quantile regressions to a dataset containing all Chinese listed companies in Standard & Poor's IQ Capital database. We find higher sensitivity over time of corporate leverage to some of its key determinants, particularly for firms at the upper margin of the distribution. In particular, profitability increasingly acts as a curb on corporate leverage. At a time of falling profitability across the Chinese non-financial corporate sector, this eases the brake on leverage and may contribute to its continuing increase.