Search | Korea Science

Multi-objective Genetic Algorithm for Variable Selection in Linear Regression Model and Application (선형회귀모델의 변수선택을 위한 다중목적 유전 알고리즘과 응용)

Kim, Dong-Il;Park, Cheong-Sool;Baek, Jun-Geol;Kim, Sung-Shick
- Journal of the Korea Society for Simulation
- /
- v.18 no.4
- /
- pp.137-148
- /
- 2009
The purpose of this study is to implement variable selection algorithm which helps construct a reliable linear regression model. If we use all candidate variables to construct a linear regression model, the significance of the model will be decreased and it will cause 'Curse of Dimensionality'. And if the number of data is less than the number of variables (dimension), we cannot construct the regression model. Due to these problems, we consider the variable selection problem as a combinatorial optimization problem, and apply GA (Genetic Algorithm) to the problem. Typical measures of estimating statistical significance are $R^2$, F-value of regression model, t-value of regression coefficients, and standard error of estimates. We design GA to solve multi-objective functions, because statistical significance of model is not to be estimated by a single measure. We perform experiments using simulation data, designed to consider various kinds of situations. As a result, it shows better performance than LARS (Least Angle Regression) which is an algorithm to solve variable selection problems. We modify algorithm to solve portfolio selection problem which construct portfolio by selecting stocks. We conclude that the algorithm is able to solve real problems.
https://doi.org/10.9709/JKSS.2009.18.4.137 인용 PDF

Deletion Timing of Cic Alleles during Hematopoiesis Determines the Degree of Peripheral CD4⁺ T Cell Activation and Proliferation

Guk-Yeol Park;Gil-Woo Lee;Soeun Kim;Hyebeen Hong;Jong Seok Park;Jae-Ho Cho;Yoontae Lee
- IMMUNE NETWORK
- /
- v.20 no.5
- /
- pp.43.1-43.11
- /
- 2020
Capicua (CIC) is a transcriptional repressor that regulates several developmental processes. CIC deficiency results in lymphoproliferative autoimmunity accompanied by expansion of CD44^hiCD62L^lo effector/memory and follicular Th cell populations. Deletion of Cic alleles in hematopoietic stem cells (Vav1-Cre-mediated knockout of Cic) causes more severe autoimmunity than that caused by the knockout of Cic in CD4⁺CD8⁺ double positive thymocytes (Cd4-Cre-mediated knockout of Cic). In this study, we compared splenic CD4⁺ T cell activation and proliferation between whole immune cell-specific Cic-null (Cic^f/f;Vav1-Cre) and T cell-specific Cic-null (Cic^f/f;Cd4-Cre) mice. Hyperactivation and hyperproliferation of CD4⁺ T cells were more apparent in Cic^f/f;Vav1-Cre mice than in Cic^f/f;Cd4-Cre mice. Cic^f/f;Vav1-Cre CD4⁺ T cells more rapidly proliferated and secreted larger amounts of IL-2 upon TCR stimulation than did Cic^f/f;Cd4-Cre CD4⁺ T cells, while the TCR stimulation-induced activation of the TCR signaling cascade and calcium flux were comparable between them. Mixed wild-type and Cic^f/f;Vav1-Cre bone marrow chimeras also exhibited more apparent hyperactivation and hyperproliferation of Cic-deficient CD4⁺ T cells than did mixed wild-type and Cic^f/f;Cd4-Cre bone marrow chimeras. Taken together, our data demonstrate that CIC deficiency at the beginning of T cell development endows peripheral CD4⁺ T cells with enhanced T cell activation and proliferative capability.
https://doi.org/10.4110/in.2020.20.e43 인용 PDF

A prognosis discovering lethal-related genes in plants for target identification and inhibitor design (식물 치사관련 유전자를 이용하는 신규 제초제 작용점 탐색 및 조절물질 개발동향)

Hwang, I.T.;Lee, D.H.;Choi, J.S.;Kim, T.J.;Kim, B.T.;Park, Y.S.;Cho, K.Y.
- The Korean Journal of Pesticide Science
- /
- v.5 no.3
- /
- pp.1-11
- /
- 2001
New technologies will have a large impact on the discovery of new herbicide site of action. Genomics, combinatorial chemistry, and bioinformatics help take advantage of serendipity through tile sequencing of huge numbers of genes or the synthesis of large numbers of chemical compounds. There are approximately $10^{30}\;to\;10^{50}$ possible molecules in molecular space of which only a fraction have been synthesized. Combining this potential with having access to 50,000 plant genes in the future elevates tile probability of discovering flew herbicidal site of actions. If 0.1, 1.0 or 10% of total genes in a typical plant are valid for herbicide target, a plant with 50,000 genes would provide about 50, 500, and 5,000 targets, respectively. However, only 11 herbicide targets have been identified and commercialized. The successful design of novel herbicides depends on careful consideration of a number of factors including target enzyme selections and validations, inhibitor designs, and the metabolic fates. Biochemical information can be used to identify enzymes which produce lethal phenotypes. The identification of a lethal target site is an important step to this approach. An examination of the characteristics of known targets provides of crucial insight as to the definition of a lethal target. Recently, antisense RNA suppression of an enzyme translation has been used to determine the genes required for toxicity and offers a strategy for identifying lethal target sites. After the identification of a lethal target, detailed knowledge such as the enzyme kinetics and the protein structure may be used to design potent inhibitors. Various types of inhibitors may be designed for a given enzyme. Strategies for the selection of new enzyme targets giving the desired physiological response upon partial inhibition include identification of chemical leads, lethal mutants and the use of antisense technology. Enzyme inhibitors having agrochemical utility can be categorized into six major groups: ground-state analogues, group specific reagents, affinity labels, suicide substrates, reaction intermediate analogues, and extraneous site inhibitors. In this review, examples of each category, and their advantages and disadvantages, will be discussed. The target identification and construction of a potent inhibitor, in itself, may not lead to develop an effective herbicide. The desired in vivo activity, uptake and translocation, and metabolism of the inhibitor should be studied in detail to assess the full potential of the target. Strategies for delivery of the compound to the target enzyme and avoidance of premature detoxification may include a proherbicidal approach, especially when inhibitors are highly charged or when selective detoxification or activation can be exploited. Utilization of differences in detoxification or activation between weeds and crops may lead to enhance selectivity. Without a full appreciation of each of these facets of herbicide design, the chances for success with the target or enzyme-driven approach are reduced.
PDF

Comparison of Association Rule Learning and Subgroup Discovery for Mining Traffic Accident Data (교통사고 데이터의 마이닝을 위한 연관규칙 학습기법과 서브그룹 발견기법의 비교)

Kim, Jeongmin;Ryu, Kwang Ryel
- Journal of Intelligence and Information Systems
- /
- v.21 no.4
- /
- pp.1-16
- /
- 2015
Traffic accident is one of the major cause of death worldwide for the last several decades. According to the statistics of world health organization, approximately 1.24 million deaths occurred on the world's roads in 2010. In order to reduce future traffic accident, multipronged approaches have been adopted including traffic regulations, injury-reducing technologies, driving training program and so on. Records on traffic accidents are generated and maintained for this purpose. To make these records meaningful and effective, it is necessary to analyze relationship between traffic accident and related factors including vehicle design, road design, weather, driver behavior etc. Insight derived from these analysis can be used for accident prevention approaches. Traffic accident data mining is an activity to find useful knowledges about such relationship that is not well-known and user may interested in it. Many studies about mining accident data have been reported over the past two decades. Most of studies mainly focused on predict risk of accident using accident related factors. Supervised learning methods like decision tree, logistic regression, k-nearest neighbor, neural network are used for these prediction. However, derived prediction model from these algorithms are too complex to understand for human itself because the main purpose of these algorithms are prediction, not explanation of the data. Some of studies use unsupervised clustering algorithm to dividing the data into several groups, but derived group itself is still not easy to understand for human, so it is necessary to do some additional analytic works. Rule based learning methods are adequate when we want to derive comprehensive form of knowledge about the target domain. It derives a set of if-then rules that represent relationship between the target feature with other features. Rules are fairly easy for human to understand its meaning therefore it can help provide insight and comprehensible results for human. Association rule learning methods and subgroup discovery methods are representing rule based learning methods for descriptive task. These two algorithms have been used in a wide range of area from transaction analysis, accident data analysis, detection of statistically significant patient risk groups, discovering key person in social communities and so on. We use both the association rule learning method and the subgroup discovery method to discover useful patterns from a traffic accident dataset consisting of many features including profile of driver, location of accident, types of accident, information of vehicle, violation of regulation and so on. The association rule learning method, which is one of the unsupervised learning methods, searches for frequent item sets from the data and translates them into rules. In contrast, the subgroup discovery method is a kind of supervised learning method that discovers rules of user specified concepts satisfying certain degree of generality and unusualness. Depending on what aspect of the data we are focusing our attention to, we may combine different multiple relevant features of interest to make a synthetic target feature, and give it to the rule learning algorithms. After a set of rules is derived, some postprocessing steps are taken to make the ruleset more compact and easier to understand by removing some uninteresting or redundant rules. We conducted a set of experiments of mining our traffic accident data in both unsupervised mode and supervised mode for comparison of these rule based learning algorithms. Experiments with the traffic accident data reveals that the association rule learning, in its pure unsupervised mode, can discover some hidden relationship among the features. Under supervised learning setting with combinatorial target feature, however, the subgroup discovery method finds good rules much more easily than the association rule learning method that requires a lot of efforts to tune the parameters.
https://doi.org/10.13088/jiis.2015.21.4.001 인용 PDF KSCI

Context Prediction Using Right and Wrong Patterns to Improve Sequential Matching Performance for More Accurate Dynamic Context-Aware Recommendation (보다 정확한 동적 상황인식 추천을 위해 정확 및 오류 패턴을 활용하여 순차적 매칭 성능이 개선된 상황 예측 방법)

Kwon, Oh-Byung
- Asia pacific journal of information systems
- /
- v.19 no.3
- /
- pp.51-67
- /
- 2009
Developing an agile recommender system for nomadic users has been regarded as a promising application in mobile and ubiquitous settings. To increase the quality of personalized recommendation in terms of accuracy and elapsed time, estimating future context of the user in a correct way is highly crucial. Traditionally, time series analysis and Makovian process have been adopted for such forecasting. However, these methods are not adequate in predicting context data, only because most of context data are represented as nominal scale. To resolve these limitations, the alignment-prediction algorithm has been suggested for context prediction, especially for future context from the low-level context. Recently, an ontological approach has been proposed for guided context prediction without context history. However, due to variety of context information, acquiring sufficient context prediction knowledge a priori is not easy in most of service domains. Hence, the purpose of this paper is to propose a novel context prediction methodology, which does not require a priori knowledge, and to increase accuracy and decrease elapsed time for service response. To do so, we have newly developed pattern-based context prediction approach. First of ail, a set of individual rules is derived from each context attribute using context history. Then a pattern consisted of results from reasoning individual rules, is developed for pattern learning. If at least one context property matches, say R, then regard the pattern as right. If the pattern is new, add right pattern, set the value of mismatched properties = 0, freq = 1 and w(R, 1). Otherwise, increase the frequency of the matched right pattern by 1 and then set w(R,freq). After finishing training, if the frequency is greater than a threshold value, then save the right pattern in knowledge base. On the other hand, if at least one context property matches, say W, then regard the pattern as wrong. If the pattern is new, modify the result into wrong answer, add right pattern, and set frequency to 1 and w(W, 1). Or, increase the matched wrong pattern's frequency by 1 and then set w(W, freq). After finishing training, if the frequency value is greater than a threshold level, then save the wrong pattern on the knowledge basis. Then, context prediction is performed with combinatorial rules as follows: first, identify current context. Second, find matched patterns from right patterns. If there is no pattern matched, then find a matching pattern from wrong patterns. If a matching pattern is not found, then choose one context property whose predictability is higher than that of any other properties. To show the feasibility of the methodology proposed in this paper, we collected actual context history from the travelers who had visited the largest amusement park in Korea. As a result, 400 context records were collected in 2009. Then we randomly selected 70% of the records as training data. The rest were selected as testing data. To examine the performance of the methodology, prediction accuracy and elapsed time were chosen as measures. We compared the performance with case-based reasoning and voting methods. Through a simulation test, we conclude that our methodology is clearly better than CBR and voting methods in terms of accuracy and elapsed time. This shows that the methodology is relatively valid and scalable. As a second round of the experiment, we compared a full model to a partial model. A full model indicates that right and wrong patterns are used for reasoning the future context. On the other hand, a partial model means that the reasoning is performed only with right patterns, which is generally adopted in the legacy alignment-prediction method. It turned out that a full model is better than a partial model in terms of the accuracy while partial model is better when considering elapsed time. As a last experiment, we took into our consideration potential privacy problems that might arise among the users. To mediate such concern, we excluded such context properties as date of tour and user profiles such as gender and age. The outcome shows that preserving privacy is endurable. Contributions of this paper are as follows: First, academically, we have improved sequential matching methods to predict accuracy and service time by considering individual rules of each context property and learning from wrong patterns. Second, the proposed method is found to be quite effective for privacy preserving applications, which are frequently required by B2C context-aware services; the privacy preserving system applying the proposed method successfully can also decrease elapsed time. Hence, the method is very practical in establishing privacy preserving context-aware services. Our future research issues taking into account some limitations in this paper can be summarized as follows. First, user acceptance or usability will be tested with actual users in order to prove the value of the prototype system. Second, we will apply the proposed method to more general application domains as this paper focused on tourism in amusement park.
PDF KSCI

The Effect of Photomodulation in Human Dermal Fibroblasts (피부 섬유아세포에서 광자극의 효과)

Kim, Mi Na;Kwak, Taek Jong;Kang, Nae Gyu;Lee, Sang Hwa;Park, Sun Gyoo;Lee, Cheon Koo
- Journal of the Society of Cosmetic Scientists of Korea
- /
- v.41 no.4
- /
- pp.325-331
- /
- 2015
Skin is exposed to sunlight or artificial indoor light on a daily. The reached solar light on the earth surface consist of 50% visible light and 45% infrared (IR) except for ultra violet (UV). The negative effects of UV including UVB and UVA have been steadily investigated within the last decades. However, little is known about the effects of visible or IR light. In this study, we irradiated human dermal fibroblasts using light emitting diode (LED) to investigate the optimal parameter for enhancing cell growth and collagen synthesis. We found that red of 630 nm and green of 520 nm enhance the cell proliferation, but irradiation with purple and blue light exerts toxic effects. To examine the response of irradiation time and light intensity on the fibroblasts, cells were exposed to red or green light with intensities from 0.05 to $0.75mW/cm^2$. Procollagen secretion was increased of 1.4 fold by 10 min irradiation, while 30 min treatment decreased the collagen synthesis of dermal fibroblasts. Treatment with red of $0.3mW/cm^2$ and green of 0.15 and $0.3mW/cm^2$ resulted in enhancement of collagen mRNA. Lastly, we investigated the combinatorial effect of red and green light on dermal fibroblasts. The sequential irradiation of red and green light is an efficient way for the purpose of the increase in the number of fibroblasts than single light treatment. On the other hand, the exposure of red light alone was more effective method for enhancing of collagen secretion. Our study showed that specific light parameters accelerated cell proliferation, gene expression and collagen secretion on human dermal fibroblasts. In conclusion, we demonstrate that light exposure with specific parameter has beneficial effects on the function of dermal fibroblasts, and suggests the possibility of its cosmetically and clinical application.
https://doi.org/10.15230/SCSK.2015.41.4.325 인용 PDF KSCI

Antioxidant Effects of Cysteine-containing Peptides of Different Lengths in Human HaCaT Keratinocytes Exposed to Hydrogen Peroxide (과산화수소에 노출된 인간 각질형성세포에서 길이가 다른 시스테인 함유 펩타이드의 항산화 효과)

Jae Won Ha;Joon Yong Choi;Yong Chool Boo
- Journal of the Society of Cosmetic Scientists of Korea
- /
- v.49 no.3
- /
- pp.193-201
- /
- 2023
Hydrogen peroxide (H₂O₂) is a type of active oxygen species (ROS) that causes oxidative stress in cells and affects cell growth, proliferation, senescence, and death. The purpose of this study is to find active peptides that attenuate cytotoxicity of H₂O₂. A positional scanning synthetic tetrapeptide combinatorial library was screened to predict the sequence of potentially active peptides. As a result of comparing the effect of peptide pools on H₂O₂-induced death of human keratinocytes (HaCaT cells), various active peptide sequences were predicted. Especially, peptides containing cysteine (C) residue were predicted to be active. In follow-up experiments, the cytotoxicity and activity of cysteine-containing peptides of different lengths, such as C-NH₂, CC-NH₂, CCC-NH₂, and CCCC-NH₂ were examined. C-NH₂ and CC-NH₂ showed no significant cytotoxicity up to 1.0 mM, but CCC-NH₂, and CCCC-NH₂ showed relatively strong cytotoxicity. C-NH₂ and CC-NH₂ alleviated H₂O₂-induced cytotoxicity. CC-NH₂ was more cytoprotective compared to C-NH₂, C, N-acetyl cysteine (NAC), and glutathione (GSH). When intracellular ROS was measured by flow cytometry, H₂O₂ increased ROS production, and CC-NH₂ suppressed ROS production more effectively than C-NH₂, and it was as effective as C, NAC, and GSH. This study suggests that CC-NH₂ of the cysteine-containing peptides of different lengths has an antioxidant property that safely and effectively alleviates H₂O₂-induced cytotoxicity and ROS production.
https://doi.org/10.15230/SCSK.2023.49.3.193 인용 PDF

Memory Organization for a Fuzzy Controller.

Jee, K.D.S.;Poluzzi, R.;Russo, B.
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1993.06a
- /
- pp.1041-1043
- /
- 1993
Fuzzy logic based Control Theory has gained much interest in the industrial world, thanks to its ability to formalize and solve in a very natural way many problems that are very difficult to quantify at an analytical level. This paper shows a solution for treating membership function inside hardware circuits. The proposed hardware structure optimizes the memoried size by using particular form of the vectorial representation. The process of memorizing fuzzy sets, i.e. their membership function, has always been one of the more problematic issues for the hardware implementation, due to the quite large memory space that is needed. To simplify such an implementation, it is commonly [1,2,8,9,10,11] used to limit the membership functions either to those having triangular or trapezoidal shape, or pre-definite shape. These kinds of functions are able to cover a large spectrum of applications with a limited usage of memory, since they can be memorized by specifying very few parameters ( ight, base, critical points, etc.). This however results in a loss of computational power due to computation on the medium points. A solution to this problem is obtained by discretizing the universe of discourse U, i.e. by fixing a finite number of points and memorizing the value of the membership functions on such points [3,10,14,15]. Such a solution provides a satisfying computational speed, a very high precision of definitions and gives the users the opportunity to choose membership functions of any shape. However, a significant memory waste can as well be registered. It is indeed possible that for each of the given fuzzy sets many elements of the universe of discourse have a membership value equal to zero. It has also been noticed that almost in all cases common points among fuzzy sets, i.e. points with non null membership values are very few. More specifically, in many applications, for each element u of U, there exists at most three fuzzy sets for which the membership value is ot null [3,5,6,7,12,13]. Our proposal is based on such hypotheses. Moreover, we use a technique that even though it does not restrict the shapes of membership functions, it reduces strongly the computational time for the membership values and optimizes the function memorization. In figure 1 it is represented a term set whose characteristics are common for fuzzy controllers and to which we will refer in the following. The above term set has a universe of discourse with 128 elements (so to have a good resolution), 8 fuzzy sets that describe the term set, 32 levels of discretization for the membership values. Clearly, the number of bits necessary for the given specifications are 5 for 32 truth levels, 3 for 8 membership functions and 7 for 128 levels of resolution. The memory depth is given by the dimension of the universe of the discourse (128 in our case) and it will be represented by the memory rows. The length of a world of memory is defined by: Length = nem (dm(m)＋dm(fm) Where: fm is the maximum number of non null values in every element of the universe of the discourse, dm(m) is the dimension of the values of the membership function m, dm(fm) is the dimension of the word to represent the index of the highest membership function. In our case then Length=24. The memory dimension is therefore 128*24 bits. If we had chosen to memorize all values of the membership functions we would have needed to memorize on each memory row the membership value of each element. Fuzzy sets word dimension is 8*5 bits. Therefore, the dimension of the memory would have been 128*40 bits. Coherently with our hypothesis, in fig. 1 each element of universe of the discourse has a non null membership value on at most three fuzzy sets. Focusing on the elements 32,64,96 of the universe of discourse, they will be memorized as follows: The computation of the rule weights is done by comparing those bits that represent the index of the membership function, with the word of the program memor . The output bus of the Program Memory (μCOD), is given as input a comparator (Combinatory Net). If the index is equal to the bus value then one of the non null weight derives from the rule and it is produced as output, otherwise the output is zero (fig. 2). It is clear, that the memory dimension of the antecedent is in this way reduced since only non null values are memorized. Moreover, the time performance of the system is equivalent to the performance of a system using vectorial memorization of all weights. The dimensioning of the word is influenced by some parameters of the input variable. The most important parameter is the maximum number membership functions (nfm) having a non null value in each element of the universe of discourse. From our study in the field of fuzzy system, we see that typically nfm 3 and there are at most 16 membership function. At any rate, such a value can be increased up to the physical dimensional limit of the antecedent memory. A less important role n the optimization process of the word dimension is played by the number of membership functions defined for each linguistic term. The table below shows the request word dimension as a function of such parameters and compares our proposed method with the method of vectorial memorization[10]. Summing up, the characteristics of our method are: Users are not restricted to membership functions with specific shapes. The number of the fuzzy sets and the resolution of the vertical axis have a very small influence in increasing memory space. Weight computations are done by combinatorial network and therefore the time performance of the system is equivalent to the one of the vectorial method. The number of non null membership values on any element of the universe of discourse is limited. Such a constraint is usually non very restrictive since many controllers obtain a good precision with only three non null weights. The method here briefly described has been adopted by our group in the design of an optimized version of the coprocessor described in [10].
PDF

Search Result 698, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)