• Title/Summary/Keyword: data partition

Search Result 415, Processing Time 0.035 seconds

Near infrared spectroscopy for classification of apples using K-mean neural network algorism

  • Muramatsu, Masahiro;Takefuji, Yoshiyasu;Kawano, Sumio
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1131-1131
    • /
    • 2001
  • To develop a nondestructive quality evaluation technique of fruits, a K-mean algorism is applied to near infrared (NIR) spectroscopy of apples. The K-mean algorism is one of neural network partition methods and the goal is to partition the set of objects O into K disjoint clusters, where K is assumed to be known a priori. The algorism introduced by Macqueen draws an initial partition of the objects at random. It then computes the cluster centroids, assigns objects to the closest of them and iterates until a local minimum is obtained. The advantage of using neural network is that the spectra at the wavelengths having absorptions against chemical bonds including C-H and O-H types can be selected directly as input data. In conventional multiple regression approaches, the first wavelength is selected manually around the absorbance wavelengths as showing a high correlation coefficient between the NIR $2^{nd}$ derivative spectrum and Brix value with a single regression. After that, the second and following wavelengths are selected statistically as the calibration equation shows a high correlation. Therefore, the second and following wavelengths are selected not in a NIR spectroscopic way but in a statistical way. In this research, the spectra at the six wavelengths including 900, 904, 914, 990, 1000 and 1016nm are selected as input data for K-mean analysis. 904nm is selected because the wavelength shows the highest correlation coefficients and is regarded as the absorbance wavelength. The others are selected because they show relatively high correlation coefficients and are revealed as the absorbance wavelengths against the chemical structures by B. G. Osborne. The experiment was performed with two phases. In first phase, a reflectance was acquired using fiber optics. The reflectance was calculated by comparing near infrared energy reflected from a Teflon sphere as a standard reference, and the $2^{nd}$ derivative spectra were used for K-mean analysis. Samples are intact 67 apples which are called Fuji and cultivated in Aomori prefecture in Japan. In second phase, the Brix values were measured with a commercially available refractometer in order to estimate the result of K-mean approach. The result shows a partition of the spectral data sets of 67 samples into eight clusters, and the apples are classified into samples having high Brix value and low Brix value. Consequently, the K-mean analysis realized the classification of apples on the basis of the Brix values.

  • PDF

Statistical Information-Based Hierarchical Fuzzy-Rough Classification Approach (통계적 정보기반 계층적 퍼지-러프 분류기법)

  • Son, Chang-S.;Seo, Suk-T.;Chung, Hwan-M.;Kwon, Soon-H.
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.6
    • /
    • pp.792-798
    • /
    • 2007
  • In this paper, we propose a hierarchical fuzzy-rough classification method based on statistical information for maximizing the performance of pattern classification and reducing the number of rules without learning approaches such as neural network, genetic algorithm. In the proposed method, statistical information is used for extracting the partition intervals of antecedent fuzzy sets at each layer on hierarchical fuzzy-rough classification systems and rough sets are used for minimizing the number of fuzzy if-then rules which are associated with the partition intervals extracted by statistical information. To show the effectiveness of the proposed method, we compared the classification results(e.g. the classification accuracy and the number of rules) of the proposed with those of the conventional methods on the Fisher's IRIS data. From the experimental results, we can confirm the fact that the proposed method considers only statistical information of the given data is similar to the classification performance of the conventional methods.

Transport Characteristics of Alcohol Solutes through Copolymer Hydrogel Membranes Containing Poly(2-Hydroxyethylmethacrylate) (Poly(2-Hydroxyethylmethacrylate)를 포함한 공중합체 수화겔막에 대한 알콜용질의 투과특성)

  • Park, Yu Mi;Kim, Eun Sik;Seong, Yong Gil
    • Journal of the Korean Chemical Society
    • /
    • v.34 no.4
    • /
    • pp.377-383
    • /
    • 1990
  • Three kinds of hydrogel membranes were prepared by the copolymerization of 2-hydroxyethylmethacrylate (HEMA) with acrylamide, N, N-dimethylamide and methylmethacrylate in the presence of solvent and crosslinker respectively. The equilibrium water content, relative permeability and partition coefficient of the membranes for alcohol solutes were measured. It has been found that the permeation of organic solute occurs through the water-filled regions in the hydrogel membrane, and that the gpermeability coefficient of organic solute depends on the molecular size. But the permeability of organic solute was controlled by the interaction of solute-membrane at the low water content. By the partition data, it has been shown that the partition of solute is only controlled by hydrophobic interaction between solute and membrane. The diffusion coefficient data were interpreted on the basis of water-solute interaction. It has been found that the diffusion of organic solute is determined by the free volume of water in the membrane, and that hardly depends on polarity-polarizability and hydrogen bonding ability between water and solute.

  • PDF

Characteristics of Fuzzy Inference Systems by Means of Partition of Input Spaces in Nonlinear Process (비선형 공정에서의 입력 공간 분할에 의한 퍼지 추론 시스템의 특성 분석)

  • Park, Keon-Jun;Lee, Dong-Yoon
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.3
    • /
    • pp.48-55
    • /
    • 2011
  • In this paper, we analyze the input-output characteristics of fuzzy inference systems according to the division of entire input spaces and the fuzzy reasoning methods to identify the fuzzy model for nonlinear process. And fuzzy model is expressed by identifying the structure and parameters of the system by means of input variables, fuzzy partition of input spaces, and consequence polynomial functions. In the premise part of the rules Min-Max method using the minimum and maximum values of input data set and C-Means clustering algorithm forming input data into the hard clusters are used for identification of fuzzy model and membership function is used as a series of triangular membership function. In the consequence part of the rules fuzzy reasoning is conducted by two types of inferences. The identification of the consequence parameters, namely polynomial coefficients, of the rules are carried out by the standard least square method. And lastly, we use gas furnace process which is widely used in nonlinear process and we evaluate the performance for this nonlinear process.

A Path Partitioning Technique for Indexing XML Data (XML 데이타 색인을 위한 경로 분할 기법)

  • 김종익;김형주
    • Journal of KIISE:Databases
    • /
    • v.31 no.3
    • /
    • pp.320-330
    • /
    • 2004
  • Query languages for XML use paths in a data graph to represent queries. Actually, paths in a data graph are used as a basic constructor of an XML query. User can write more expressive Queries by using Patterns (e.g. regular expressions) for paths. There are many identical paths in a data graph because of the feature of semi-structured data. Current researches for indexing XML utilize identical paths in a data graph, but such an index can grow larger than source data graph and cannot guarantee efficient access path. In this paper we propose a partitioning technique that can partition all the paths in a data graph. We develop an index graph that can find appropriate partitions for a path query efficiently. The size of our index graph can be adjusted regardless of the source data. So, we can significantly improve the cost for index graph traversals. In the performance study, we show our index much faster than other graph based indexes.

A Physiologically Based Pharmacokinetic Model for Absorption and Distribution of Imatinib in Human Body

  • Chowdhury, Mohammad Mahfuz;Kim, Do-Hyun;Ahn, Jeong-Keun
    • Bulletin of the Korean Chemical Society
    • /
    • v.32 no.11
    • /
    • pp.3967-3972
    • /
    • 2011
  • A whole body physiologically based pharmacokinetic (PBPK) model was applied to investigate absorption, distribution, and physiologic variations on pharmacokinetics of imatinib in human body. Previously published pharmacokinetic data of the drug after intravenous (i.v.) infusion and oral administration were simulated by the PBPK model. Oral dose absorption kinetics were analyzed by adopting a compartmental absorption and transit model in gut section. Tissue/plasma partition coefficients of drug after i.v. infusion were also used for oral administration. Sensitivity analysis of the PBPK model was carried out by taking parameters that were commonly subject to variation in human. Drug concentration in adipose tissue was found to be higher than those in other tissues, suggesting that adipose tissue plays a role as a storage tissue for the drug. Variations of metabolism in liver, body weight, and blood/plasma partition coefficient were found to be important factors affecting the plasma concentration profile of drug in human body.

A Study on Preference for telecommuting Center design Criteria (텔레커뮤팅 센터의 실내공간계획요소에 대한 선호 조사 연구)

  • 하미경;권미연
    • Korean Institute of Interior Design Journal
    • /
    • no.20
    • /
    • pp.91-97
    • /
    • 1999
  • Telecommuting becomes a new form of work according to the development of information technology. Therefore, the purpose of this study is to provide basic data for the interior space planning of telecommuting centers by means of surveying office workers' opinions. The major findings of this research are as follows. The opinion about whether to use telecommuting center if provided is showed highly positively. In the matter of space type of telecommuting center, 'mixing type I (open plan office but division with high partition)' is the most preferred, the next, is 'closed type'. The most preferred type of workstation is 'independent type', the next is 'X type' and the third is 'link type'. Preferred partition height is '1,300-1,500mm'. When planning telecommuting center, the most important element of space plan is 'size and layout of workstation' and the second is 'private space for confidential work'. In public workplace, the most important element 'refreshment space', and the second one is 'mailing system'.

  • PDF

A Study on Indexing Method using Text Partition (텍스트분할에 의한 색인방법 연구)

  • 강무영;이상구
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.4
    • /
    • pp.75-94
    • /
    • 1999
  • Indexing is a prerequisite function for the information retrieval system in order to retrieve the information of the documents effectively which are saved in database. As a digital data increases in accordance with the development of a computer, the numbers of literatures to be saved in database have also been increased in a large volume. To retrieve such documents of large volume, a lot of system resources and processing time will be required. In this paper, we suggest a advanced indexing method using text partition. This method can retrieve the documents of large volume in short processing time. We applied this suggested indexing method to real information retrieval system, and proved its excellent functions through the demonstration.

  • PDF

A New Learning Algorithm for Neuro-Fuzzy Modeling Using Self-Constructed Clustering

  • Kim, Sung-Suk;Kwak, Keun-Chang;Kim, Sung-Soo;Ryu, Jeong-Woong
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.1254-1259
    • /
    • 2005
  • In this paper, we proposed a learning algorithm for the neuro-fuzzy modeling using a learning rule to adapt clustering. The proposed algorithm includes the data partition, assigning the rule into the process of partition, and optimizing the parameters using predetermined threshold value in self-constructing algorithm. In order to improve the clustering, the learning method of neuro-fuzzy model is extended and the learning scheme has been modified such that the learning of overall model is extended based on the error-derivative learning. The effect of the proposed method is presented using simulation compare with previous ones.

  • PDF

Convective Heat Transfer in Ventilated Space wit=h Various Partitions

  • Bae, Kangyoul;Chung, Hanshik;Jeong, Hyomin
    • Journal of Mechanical Science and Technology
    • /
    • v.16 no.5
    • /
    • pp.676-682
    • /
    • 2002
  • The laminar convective heat transfer in ventilated space with various horizontal partitions was studied numerically and experimentally For the numerical study, the governing equations were solved by using a finite volume method for various numbers Re, Gr, Pr and partition numbers. The experimental study was conducted by using a holographic interferometer. The isotherms and velocity vectors have been presented for various parameters. As the number and length of partition increased, convective heat transfer decreased. Based on the numerical data, correlation equations were obtained for the mean Nusselt number in term of Gr/Re$^2$. In the region of Gr/Re$^2$$\leq$ 1, the mean Nusselt number was small, but in the region of Gr/Re$^2$> 1, the mean Nusselt number was constant.