• Title/Summary/Keyword: Vector Analysis

Search Result 3,500, Processing Time 0.026 seconds

Clustering Analysis by Customer Feature based on SOM for Predicting Purchase Pattern in Recommendation System (추천시스템에서 구매 패턴 예측을 위한 SOM기반 고객 특성에 의한 군집 분석)

  • Cho, Young Sung;Moon, Song Chul;Ryu, Keun Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.2
    • /
    • pp.193-200
    • /
    • 2014
  • Due to the advent of ubiquitous computing environment, it is becoming a part of our common life style. And tremendous information is cumulated rapidly. In these trends, it is becoming a very important technology to find out exact information in a large data to present users. Collaborative filtering is the method based on other users' preferences, can not only reflect exact attributes of user but also still has the problem of sparsity and scalability, though it has been practically used to improve these defects. In this paper, we propose clustering method by user's features based on SOM for predicting purchase pattern in u-Commerce. it is necessary for us to make the cluster with similarity by user's features to be able to reflect attributes of the customer information in order to find the items with same propensity in the cluster rapidly. The proposed makes the task of clustering to apply the variable of featured vector for the user's information and RFM factors based on purchase history data. To verify improved performance of proposing system, we make experiments with dataset collected in a cosmetic internet shopping mall.

Development of Personalized Recommendation System using RFM method and k-means Clustering (RFM기법과 k-means 기법을 이용한 개인화 추천시스템의 개발)

  • Cho, Young-Sung;Gu, Mi-Sug;Ryu, Keun-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.6
    • /
    • pp.163-172
    • /
    • 2012
  • Collaborative filtering which is used explicit method in a existing recommedation system, can not only reflect exact attributes of item but also still has the problem of sparsity and scalability, though it has been practically used to improve these defects. This paper proposes the personalized recommendation system using RFM method and k-means clustering in u-commerce which is required by real time accessablity and agility. In this paper, using a implicit method which is is not used complicated query processing of the request and the response for rating, it is necessary for us to keep the analysis of RFM method and k-means clustering to be able to reflect attributes of the item in order to find the items with high purchasablity. The proposed makes the task of clustering to apply the variable of featured vector for the customer's information and calculating of the preference by each item category based on purchase history data, is able to recommend the items with efficiency. To estimate the performance, the proposed system is compared with existing system. As a result, it can be improved and evaluated according to the criteria of logicality through the experiment with dataset, collected in a cosmetic internet shopping mall.

Predicting Power Generation Patterns Using the Wind Power Data (풍력 데이터를 이용한 발전 패턴 예측)

  • Suh, Dong-Hyok;Kim, Kyu-Ik;Kim, Kwang-Deuk;Ryu, Keun-Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.11
    • /
    • pp.245-253
    • /
    • 2011
  • Due to the imprudent spending of the fossil fuels, the environment was contaminated seriously and the exhaustion problems of the fossil fuels loomed large. Therefore people become taking a great interest in alternative energy resources which can solve problems of fossil fuels. The wind power energy is one of the most interested energy in the new and renewable energy. However, the plants of wind power energy and the traditional power plants should be balanced between the power generation and the power consumption. Therefore, we need analysis and prediction to generate power efficiently using wind energy. In this paper, we have performed a research to predict power generation patterns using the wind power data. Prediction approaches of datamining area can be used for building a prediction model. The research steps are as follows: 1) we performed preprocessing to handle the missing values and anomalous data. And we extracted the characteristic vector data. 2) The representative patterns were found by the MIA(Mean Index Adequacy) measure and the SOM(Self-Organizing Feature Map) clustering approach using the normalized dataset. We assigned the class labels to each data. 3) We built a new predicting model about the wind power generation with classification approach. In this experiment, we built a forecasting model to predict wind power generation patterns using the decision tree.

Molecular cloning of casein gane which is expressed in mammary glands (유선조직에서 특이적으로 발현되는 카제인 유전자의 클로닝(I))

  • Choe, In-Ho;Bae, Bong-Jin;Lee, Chang-Su
    • Journal of the Society of Cosmetic Scientists of Korea
    • /
    • v.21 no.1
    • /
    • pp.53-66
    • /
    • 1995
  • The gene for ${\gamma}$-casein, a milk protein, is a member of a family of casein gene which is expressed in mammary glands of the animal during the late gestation and lactation periods binder the influence of various hormones. In order to elucidate tile mechanisms b)'which hormones regulate the coordinate induction of milk protein genes, the mouse ${\gamma}$-casein gene was isolated and characterized. The ${\gamma}$-casein gene was screened from a mouse genomic library constructed in bacteriophage EMBL3 with the ${\gamma}$-casein CDNA used as probe and one clone was obtained. The ${\gamma}$-casein CDNA as probe was partially sequenced and contained ATG start codon and 5'-noncoding region. The cloned genomic DNA was digested with Sal I restriction enzyme, by which the insert DNA can be isolated from EMBL3 vector. Three DNA bands were observed and the size of DNAs was approximately 28kb, 14kb and 9Kb, respectively Accordingly the size of the insert DNA was calculated with approximately 23Kb. The result of Southern blot analysis, however, showed that the cloned genomic DNA was not hybridized with the synthetic oligonucleotides (40 mer) of cDNA 5'-end region, but it was hybridized with the y -casein CDNA. This means that tile cloned y -casein gene may not contain its promoter region. The ${\gamma}$ -casein genomic DNA containing the promoter region has been screening from mouse genomic library with oligonucleotides of CDNA 5'-end region as probe, and twenty-nine clones was obtained.

  • PDF

Predicting Stock Liquidity by Using Ensemble Data Mining Methods

  • Bae, Eun Chan;Lee, Kun Chang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.6
    • /
    • pp.9-19
    • /
    • 2016
  • In finance literature, stock liquidity showing how stocks can be cashed out in the market has received rich attentions from both academicians and practitioners. The reasons are plenty. First, it is known that stock liquidity affects significantly asset pricing. Second, macroeconomic announcements influence liquidity in the stock market. Therefore, stock liquidity itself affects investors' decision and managers' decision as well. Though there exist a great deal of literature about stock liquidity in finance literature, it is quite clear that there are no studies attempting to investigate the stock liquidity issue as one of decision making problems. In finance literature, most of stock liquidity studies had dealt with limited views such as how much it influences stock price, which variables are associated with describing the stock liquidity significantly, etc. However, this paper posits that stock liquidity issue may become a serious decision-making problem, and then be handled by using data mining techniques to estimate its future extent with statistical validity. In this sense, we collected financial data set from a number of manufacturing companies listed in KRX (Korea Exchange) during the period of 2010 to 2013. The reason why we selected dataset from 2010 was to avoid the after-shocks of financial crisis that occurred in 2008. We used Fn-GuidPro system to gather total 5,700 financial data set. Stock liquidity measure was computed by the procedures proposed by Amihud (2002) which is known to show best metrics for showing relationship with daily return. We applied five data mining techniques (or classifiers) such as Bayesian network, support vector machine (SVM), decision tree, neural network, and ensemble method. Bayesian networks include GBN (General Bayesian Network), NBN (Naive BN), TAN (Tree Augmented NBN). Decision tree uses CART and C4.5. Regression result was used as a benchmarking performance. Ensemble method uses two types-integration of two classifiers, and three classifiers. Ensemble method is based on voting for the sake of integrating classifiers. Among the single classifiers, CART showed best performance with 48.2%, compared with 37.18% by regression. Among the ensemble methods, the result from integrating TAN, CART, and SVM was best with 49.25%. Through the additional analysis in individual industries, those relatively stabilized industries like electronic appliances, wholesale & retailing, woods, leather-bags-shoes showed better performance over 50%.

Evaluation of Optimal Condition for Recombinant Bacterial Ghost Vaccine Production with Four Different Antigens of Streptococcus iniae-enolase, GAPDH, sagA, piaA (연쇄구균증 항원-enolase, GAPDH, sagA, piaA에 대한 재조합 고스트 박테리아 백신의 생산 최적화)

  • Ra, Chae-Hun;Kim, Yeong-Jin;Son, Chang-Woo;Jung, Dae-Young;Kim, Sung-Koo
    • Journal of Life Science
    • /
    • v.19 no.7
    • /
    • pp.845-851
    • /
    • 2009
  • A vector harboring double cassettes; a heterologous gene expression cassette of pHCE-InaN-antigen and a ghost formation cassette of pAPR-cI-E lysis 37 SDM was constructed and introduced to E. coli DH5a. For the production of a bacterial ghost vaccine, bacterial ghosts from E. coli / Streptococcus iniae with four different types of antigens - enolase, GAPDH, sagA and piaA - were produced by the optimization of fermentation parameters such as a glucose concentration of 1 g/l, agitation of 300 rpm and aeration of 1 vvm. Efficiency of ghost bacteria formation was evaluated with cultures of OD$_{600}$=1.0, 2.0 and 3.0. The efficiency of the ghost bacteria formation was 99.54, 99.67, 99.99 and 99.99% with inductions at OD$_{600}$=3.0, 1.0, 2.0 and 1.0 for E. coli/S. iniae antigens enolase, piaA, GAPDH and sagA, respectively. Ghost bacteria as a vaccine was harvested by centrifugation. The antigen protein expressions were analyzed by SDS-PAGE and western blot analysis, and the molecular weights of the enolase, piaA, GAPDH and sagA were 78, 26, 67 and 26 kDa, respectively. The molecular weights of the expressed antigens were consistent with theoretical sizes obtained from the amino acid sequences.

A Newly Identified Glutaminase-Free L-Asparaginase (L-ASPG86) from the Marine Bacterium Mesoflavibacter zeaxanthinifaciens

  • Lee, Su-Jin;Lee, Youngdeuk;Park, Gun-Hoo;Umasuthan, Navaneethaiyer;Heo, Soo-Jin;Zoysa, Mahanama De;Jung, Won-Kyo;Lee, Dae-Won;Kim, Hanjun;Kang, Do-Hyung;Oh, Chulhong
    • Journal of Microbiology and Biotechnology
    • /
    • v.26 no.6
    • /
    • pp.1115-1123
    • /
    • 2016
  • L-Asparaginase (E.C. 3.5.1.1) is an enzyme involved in asparagine hydrolysis and has the potential to effect leukemic cells and various other cancer cells. We identified the L-asparaginase gene (L-ASPG86) in the genus Mesoflavibacter, which consists of a 1,035 bp open reading frame encoding 344 amino acids. Following phylogenetic analysis, the deduced amino acid sequence of L-ASPG86 (L-ASPG86) was grouped as a type I asparaginase with respective homologs in Escherichia coli and Yersinia pseudotuberculosis. The L-ASPG86 gene was cloned into the pET-16b vector to express the respective protein in E. coli BL21 (DE3) cells. Recombinant L-asparaginase (r-L-ASPG86) showed optimum conditions at 37-40℃, pH 9. Moreover, r-L-ASPG86 did not exhibit glutaminase activity. In the metal ions test, its enzymatic activity was highly improved upon addition of 5 mM manganese (3.97-fold) and magnesium (3.35-fold) compared with the untreated control. The specific activity of r-L-ASPG86 was 687.1 units/mg under optimum conditions (37℃, pH 9, and 5 mM MnSO4).

Accelerated Growth of Corynebacterium glutamicum by Up-Regulating Stress-Responsive Genes Based on Transcriptome Analysis of a Fast-Doubling Evolved Strain

  • Park, Jihoon;Lee, SuRin;Lee, Min Ju;Park, Kyunghoon;Lee, Seungki;Kim, Jihyun F.;Kim, Pil
    • Journal of Microbiology and Biotechnology
    • /
    • v.30 no.9
    • /
    • pp.1420-1429
    • /
    • 2020
  • Corynebacterium glutamicum, an important industrial strain, has a relatively slower reproduction rate. To acquire a growth-boosted C. glutamicum, a descendant strain was isolated from a continuous culture after 600 generations. The isolated descendant C. glutamicum, JH41 strain, was able to double 58% faster (td=1.15 h) than the parental type strain (PT, td=1.82 h). To understand the factors boosting reproduction, the transcriptomes of JH41 and PT strains were compared. The mRNAs involved in respiration and TCA cycle were upregulated. The intracellular ATP of the JH41 strain was 50% greater than the PT strain. The upregulation of NCgl1610 operon (a putative dyp-type heme peroxidase, a putative copper chaperone, and a putative copper importer) that presumed to role in the assembly and redox control of cytochrome c oxidase was found in the JH41 transcriptome. Plasmid-driven expression of the operon enabled the PT strain to double 19% faster (td=1.82 h) than its control (td=2.17 h) with 14% greater activity of cytochrome c oxidase and 27% greater intracellular ATP under the oxidative stress conditions. Upregulations of genes those might enhance translation fitness were also found in the JH41 transcriptome. Plasmid-driven expressions of NCgl0171 (encoding a cold-shock protein) and NCgl2435 (encoding a putative peptidyl-tRNA hydrolase) enabled the PT to double 22% and 32% faster than its control, respectively (empty vector: td=1.93 h, CspA: td=1.58 h, and Pth: td=1.44 h). Based on the results, the factors boosting growth rate in C. gluctamicum were further discussed in the viewpoints of cellular energy state, oxidative stress management, and translation.

A Study of the Retrovirus-Mediated Transgenic Chicken Production on Chicken Embryos (닭 수정란에서 Retrovirus를 이용한 형질전환 닭 생산 연구)

  • Byun S. J.;Park C.;Kim S. W.;Park J. K.;Chang W. K.;Yang B. S.;Kim T. Y.;Sohn S. H.;Kim S. H.;Jeon I. S.
    • Korean Journal of Poultry Science
    • /
    • v.32 no.4
    • /
    • pp.225-229
    • /
    • 2005
  • Microinjection of recombinant retrovirus beneath the blastoderm of non-incubated chicken embryo is now the most widespread method for generating transgenic chickens, but transgenesis rates are very low. So to improve this problem, we first introduced retrovirus vector carrying RSV-GFP gene to an one-cell embryo culture system. To investigate whether retrovirus could work on an one-cell chicken embryo, we microinjected the concentrated retrovirus stocks into the germinal disc of one cell or stage-X chicken embryos. Analysis of reporter gene expression on day 4 embryos showed that GFP expression was observed in the only stage-X chicken embryo but was not in the one-cell embryo group. These results suggest that retrovirus system is the most efficient method to generate transgenic chickens in the stage-X embryo.

Suicidal gene therapy with rabbit cytochrome P450 4B1/2-aminoanthracene or 4-ipomeanol system in human colon cancer cell

  • Jang, Su Jin;Kang, Joo Hyun;Moon, Byung Seok;Lee, Yong Jin;Kim, Kwang Il;Lee, Tae Sup;Choe, Jae Gol;Lim, Sang Moo
    • Journal of Radiopharmaceuticals and Molecular Probes
    • /
    • v.1 no.2
    • /
    • pp.118-122
    • /
    • 2015
  • Suicidal gene therapy is based on the transduction of tumor cells with "suicide" genes encoding for prodrug-activating enzymes that render target cells susceptible to prodrug treatment. Suicidal gene therapy results in the death of tumor with the expression of gene encoding enzyme that converts non-toxic prodrug into cytotoxic product. Cytochrome P450 4B1 (CYP4B1) activates 4-ipomeanol (4-IPO) or 2-aminoanthracene (2-AA) to cytotoxic furane epoxide and unsaturated dialdehyde intermediate.In this study, therapeutic effects of suicidal gene therapy with rabbit CYP4B1/2-AA or 4-IPO system were evaluated in HT-29 (human colon cancer cell). pcDNA-CYP4B1 vector was transfected into HT-29 by lipofection and stable transfectant was selected by treatment of hygromycin ($500{\mu}g/mL$) for 3 weeks. Reverse transcription polymerase chain reaction (RT-PCR) analysis was performed for confirmation of CYP4B1 expression in CYP4B1 gene transduced cell. The cytotoxic effects of CYP4B1 transduced cell were determined using dye-exclusion assay after treatment of 2-AA or 4-IPO for 96 hrs. Dye-exclusion assay showed that $IC_{50}$ of HT-29 and CYP4B1 transduced HT-29 was 0.01 mM and 0.003 mM after 4-IPO or 2-AA treatment at 96 hrs exposure, respectively. In conclusion, CYP4B1 based prodrug gene therapy probably have the potential for treatment of colorectal adenocarcinoma.