• 제목/요약/키워드: Data Input Approach

검색결과 806건 처리시간 0.03초

Quadratic Loss Support Vector Interval Regression Machine for Crisp Input-Output Data

  • Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제15권2호
    • /
    • pp.449-455
    • /
    • 2004
  • Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval regression models for crisp input-output data. The proposed method is based on quadratic loss SVM, which implements quadratic programming approach giving more diverse spread coefficients than a linear programming one. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function. Experimental result is then presented which indicate the performance of this algorithm.

  • PDF

데이터 바이닝을 이용한 로버스트 설계 모형의 최적화 (Optimization of Robust Design Model using Data Mining)

  • 정혜진;구본철
    • 산업경영시스템학회지
    • /
    • 제30권2호
    • /
    • pp.99-105
    • /
    • 2007
  • According to the automated manufacturing processes followed by the development of computer manufacturing technologies, products or quality characteristics produced on the processes have measured and recorded automatically. Much amount of data daily produced on the processes may not be efficiently analyzed by current statistical methodologies (i.e., statistical quality control and statistical process control methodologies) because of the dimensionality associated with many input and response variables. Although a number of statistical methods to handle this situation, there is room for improvement. In order to overcome this limitation, we integrated data mining and robust design approach in this research. We find efficiently the significant input variables that connected with the interesting response variables by using the data mining technique. And we find the optimum operating condition of process by using RSM and robust design approach.

산업 R&D 성과의 시간지연에 관한 분석 (A Study on the Time-lag of Industrial R&D Output)

  • 이재하;권철신
    • 기술혁신연구
    • /
    • 제7권1호
    • /
    • pp.176-186
    • /
    • 1999
  • This paper starts out by reviewing the literature that in different ways utilizes patent data as an output of Research & Development (R&D) investment. The main focus, however, is an analysis of time-lag between industrial R&D input and its output. To achieve this research's purpose, the basic data associated with the industrial R&D input (expenditure, researchers) and output (applied patent and utilities) for the past 15 years, from 1980 to 1994, in the areas of electrical-electronic, mechanical and chemical industries have been collected. And the raw input data were altered into real flow data (but stock data) using Laspeyres approach and analyzed using multiple regression analysis, especially stepwise regression analysis. The result of this study can be summarized as follows: a) The time-lag; between industrial R&D input and its output is within 1 to 3 years. b) The time-lag: of patents was longer than that of utility models. c) The time-lag: in electrical-electronic, chemical industry was longer than that of the mechanical industry.

  • PDF

An Approach to Applying Multiple Linear Regression Models by Interlacing Data in Classifying Similar Software

  • Lim, Hyun-il
    • Journal of Information Processing Systems
    • /
    • 제18권2호
    • /
    • pp.268-281
    • /
    • 2022
  • The development of information technology is bringing many changes to everyday life, and machine learning can be used as a technique to solve a wide range of real-world problems. Analysis and utilization of data are essential processes in applying machine learning to real-world problems. As a method of processing data in machine learning, we propose an approach based on applying multiple linear regression models by interlacing data to the task of classifying similar software. Linear regression is widely used in estimation problems to model the relationship between input and output data. In our approach, multiple linear regression models are generated by training on interlaced feature data. A combination of these multiple models is then used as the prediction model for classifying similar software. Experiments are performed to evaluate the proposed approach as compared to conventional linear regression, and the experimental results show that the proposed method classifies similar software more accurately than the conventional model. We anticipate the proposed approach to be applied to various kinds of classification problems to improve the accuracy of conventional linear regression.

Data-based Stability Analysis for MIMO Linear Time-invariant Discrete-time Systems

  • Park, Un-Sik;Ikeda, Masao
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2005년도 ICCAS
    • /
    • pp.680-684
    • /
    • 2005
  • This paper presents a data-based stability analysis of a MIMO linear time-invariant discrete-time system, as an extension of the previous results for a SISO system. In the MIMO case, a similar discussion as in the case of a SISO system is also applied, except that an augmented input and output space is considered whose dimension is determined in relation to both the orders of the input and output vectors and the numbers of inputs and outputs. As certain subspaces of the input and output space, both output data space and closed-loop data space are defined, which contain all the behaviors of a system, respectively, with zero input in open-loop and with a control input in closed-loop. Then, we can derive the data-based stability conditions, in which the open-loop stability can be checked by using a data matrix whose column vectors span the output data space and the closed-loop stability can also be checked by using a data matrix whose column vectors span the closed-loop data space.

  • PDF

Neural and MTS Algorithms for Feature Selection

  • Su, Chao-Ton;Li, Te-Sheng
    • International Journal of Quality Innovation
    • /
    • 제3권2호
    • /
    • pp.113-131
    • /
    • 2002
  • The relationships among multi-dimensional data (such as medical examination data) with ambiguity and variation are difficult to explore. The traditional approach to building a data classification system requires the formulation of rules by which the input data can be analyzed. The formulation of such rules is very difficult with large sets of input data. This paper first describes two classification approaches using back-propagation (BP) neural network and Mahalanobis distance (MD) classifier, and then proposes two classification approaches for multi-dimensional feature selection. The first one proposed is a feature selection procedure from the trained back-propagation (BP) neural network. The basic idea of this procedure is to compare the multiplication weights between input and hidden layer and hidden and output layer. In order to simplify the structure, only the multiplication weights of large absolute values are used. The second approach is Mahalanobis-Taguchi system (MTS) originally suggested by Dr. Taguchi. The MTS performs Taguchi's fractional factorial design based on the Mahalanobis distance as a performance metric. We combine the automatic thresholding with MD: it can deal with a reduced model, which is the focus of this paper In this work, two case studies will be used as examples to compare and discuss the complete and reduced models employing BP neural network and MD classifier. The implementation results show that proposed approaches are effective and powerful for the classification.

국부 유사사상의 퍼지통합에 기반한 비선형사상의 식별 (Identification of Nonlinear Mapping based on Fuzzy Integration of Local Affine Mappings)

  • 최진영;최종호
    • 전자공학회논문지B
    • /
    • 제32B권5호
    • /
    • pp.812-820
    • /
    • 1995
  • This paper proposes an approach of identifying nonlinear mappings from input/output data. The approach is based on the universal approximation by the fuzzy integration of local affine mappings. A connectionist model realizing the universal approximator is suggested by using a processing unit based on both the radial basis function and the weighted sum scheme. In addition, a learning method with self-organizing capability is proposed for the identifying of nonlinear mapping relationships with the given input/output data. To show the effectiveness of our approach, the proposed model is applied to the function approximation and the prediction of Mackey-Glass chaotic time series, and the performances are compared with other approaches.

  • PDF

A 3D TEXTURE SYNTHESIS APPROACH

  • Su, Ya-Lin;Chang, Chin-Chen;Shih, Zen-Chung
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2009년도 IWAIT
    • /
    • pp.28-31
    • /
    • 2009
  • In this paper, a new approach for solid texture synthesis from input volume data is presented. In the pre-process, feature vectors and a similarity set were constructed for input volume data. The feature vectors were used to construct neighboring vectors for more accurate neighborhood matching. The similarity set which recorded 3 candidates for each voxel helped more effective neighborhood matching. In the synthesis process, the pyramid synthesis method was used to synthesize solid textures from coarse to fine level. The results of the proposed approach were satisfactory.

  • PDF

A Network Partition Approach for MFD-Based Urban Transportation Network Model

  • Xu, Haitao;Zhang, Weiguo;zhuo, Zuozhang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제14권11호
    • /
    • pp.4483-4501
    • /
    • 2020
  • Recent findings identified the scatter and shape of MFD (macroscopic fundamental diagram) is heavily influenced by the spatial distribution of link density in a road network. This implies that the concept of MFD can be utilized to divide a heterogeneous road network with different degrees of congestion into multiple homogeneous subnetworks. Considering the actual traffic data is usually incomplete and inaccurate while most traffic partition algorithms rely on the completeness of the data, we proposed a three-step partitioned algorithm called Iso-MB (Isoperimetric algorithm - Merging - Boundary adjustment) permitting of incompletely input data in this paper. The proposed algorithm was implemented and verified in a simulated urban transportation network. The existence of well-defined MFD in each subnetwork was revealed and discussed and the selection of stop parameter in the isoperimetric algorithm was explained and dissected. The effectiveness of the approach to the missing input data was also demonstrated and elaborated.

항만투자의 유효성 분석 - congestion모형 접근 - (A Study on the Effectiveness of Port Investment: Congestion Model Approach)

  • 박노경
    • 한국항만경제학회:학술대회논문집
    • /
    • 한국항만경제학회 2003년도 정기학술대회지
    • /
    • pp.221-242
    • /
    • 2003
  • The purpose of this paper is to investigate the effectiveness of port investment which is one of the important elements for measuring the port efficiency by using congestion approach of DEA(Data Envelopment Analysis). Congestion is said to be present when increases in inputs result in ouput reductions. Congestion approach takes the forms of weak input disposability and strong input disposability. Empirical analysis by using congestion approach in this paper identified inefficiencies in the inputs including port investment, and indicated inefficient ports like the ports of Sokcho, Gunsan, Pohang, and Seoguipo which shows the large amount of slacks with congestion especially in terms of port investment. Therefore these ports should examine the reason about the inefficiency of port investment. The main policy implication based on the findings of this study is that The Ministry of Maritime Affairs & Fisheries in Korea should introduce congestion approach when the amount of port investment to each port is decided.

  • PDF