• Title/Summary/Keyword: attribute tree

Search Result 105, Processing Time 0.028 seconds

Candidate Marker Identification from Gene Expression Data with Attribute Value Discretization and Negation (속성값 이산화 및 부정값 허용을 하는 의사결정트리 기반의 유전자 발현 데이터의 마커 후보 식별)

  • Lee, Kyung-Mi;Lee, Keon-Myung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.5
    • /
    • pp.575-580
    • /
    • 2011
  • With the increasing expectation on personalized medicine, it is getting importance to analyze medical information in molecular biology perspective. Gene expression data are one of representative ones to show the microscopic phenomena of biological activities. In gene expression data analysis, one of major concerns is to identify markers which can be used to predict disease occurrence, progression or recurrence in the molecular level. Existing markers candidate identification methods mainly depend on statistical hypothesis test methods. This paper proposes a search method based decision tree induction to identify candidate markers which consist of multiple genes. The propose method discretizes numeric expression level into three categorical values and allows candidate markers' genes to be expressed by their negation as well as categorical values. It is desirable to have some number of genes to be included in markers. Hence the method is devised to try to find candidate markers with restricted number of genes.

How different is a web site that many people visit?-focused on the Plastic Surgery Websites in Korea (많은 사람이 방문하는 웹 사이트는 무엇이 다를까? - 2011년 성형외과 웹 사이트의 경우 -)

  • Cho, Yeong-Bin;Kim, Chae-Bogk
    • Management & Information Systems Review
    • /
    • v.32 no.1
    • /
    • pp.43-62
    • /
    • 2013
  • In order to know the characteristics of high visit web sites that many people have visited, 37 high visit websites of plastic surgery were compared to 69 benchmark sites of same industry. We selected 36 web site attributes that can be measured objectively from existing studies and composed the data set of 36 attributes multiplied by 106 websites. For analysis, Multiple Discriminant Analysis(MDA) and Decision Tree Technique are conducted for searching what attributes divide two group definitely. The result of this study shows the dividing attributes fall into 3 categories like 'Community', 'Mobile', 'Up to date'. Thus, we are able to conclude that high visit plastic surgery web sites are community centric site but not contents centric, response a change to mobile environment rapidly and are maintained with tide up to date. The methodology employed in this study provides an efficient way of improving satisfaction of visitors of plastic surgery website.

  • PDF

Security Elevation of XML Document Using DTD Digital Signature (DTD 전자서명을 이용한 XML문서의 보안성 향상)

  • Park, Dou-Joon;Min, Hye-Lan;Lee, Joon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.1
    • /
    • pp.1080-1083
    • /
    • 2005
  • Can speak that DTD is meta data that define meaning of expressed data on XML document. Therefore, in case DTD information is damaged this information to base security of XML document dangerous. Not that attach digital signature on XML document at send-receive process of XML document in this research, proposed method to attach digital signature to DTD. As reading DTD file to end first, do parsing, and store abstracted element or attribute entitys in hash table. Read hash table and achieve message digest if parsing is ended. Compose and create digital signature with individual key after achievement. When sign digital, problem that create entirely other digest cost because do not examine about order that change at message digest process is happened. This solved by method to create DTD's digital signature using DOM that can embody tree structure for standard structure and document.

  • PDF

A deisgn of VHDL compiler front-end for the VHDL-to-C mapping (VHDL-to-C 사상을 위한 VHDL 컴파일러 전반부의 설계)

  • 공진흥;고형일
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.22 no.12
    • /
    • pp.2834-2851
    • /
    • 1997
  • In this paper, a design and implementation of VHDL compiler front-end, aims at supporting the full-set of VHDL '87 & '93 LRM and carring out the preprocessing of VHDL-to-C, is described. The VHDL compiler front-end includes 1)the symbol tree of analyzed data to represent the hierarchy, the scope and visibility, the overloading and homograph, the concurrent multiple stacks in VHDL, 2)the data structure and supportig routies to deal with the objects, the type and subtype, the attribute and operation in VHDL, 3)the analysis of the concurrent/sequential statements, the behavior/structural descriptions, of semantic token and the propagation of symbol & type to improve the registration and retrieval procedure of analyzed data. In the experiments with Validation Suite, the VHDL compiler front-end could support the full-set specification of VHDL LRM '87 & '93; and in the experiments to asses the performance of symantic token for the VHDL hierachy/visibility/concurrency/semantic checking, the improvement of about 20-30% could be achieved.

  • PDF

A Study on Selecting Key Opcodes for Malware Classification and Its Usefulness (악성코드 분류를 위한 중요 연산부호 선택 및 그 유용성에 관한 연구)

  • Park, Jeong Been;Han, Kyung Soo;Kim, Tae Gune;Im, Eul Gyu
    • Journal of KIISE
    • /
    • v.42 no.5
    • /
    • pp.558-565
    • /
    • 2015
  • Recently, the number of new malware and malware variants has dramatically increased. As a result, the time for analyzing malware and the efforts of malware analyzers have also increased. Therefore, malware classification helps malware analyzers decrease the overhead of malware analysis, and the classification is useful in studying the malware's genealogy. In this paper, we proposed a set of key opcode to classify the malware. In our experiments, we selected the top 10-opcode as key opcode, and the key opcode decreased the training time of a Supervised learning algorithm by 91% with preserving classification accuracy.

Automated Detecting and Tracing for Plagiarized Programs using Gumbel Distribution Model (굼벨 분포 모델을 이용한 표절 프로그램 자동 탐색 및 추적)

  • Ji, Jeong-Hoon;Woo, Gyun;Cho, Hwan-Gue
    • The KIPS Transactions:PartA
    • /
    • v.16A no.6
    • /
    • pp.453-462
    • /
    • 2009
  • Studies on software plagiarism detection, prevention and judgement have become widespread due to the growing of interest and importance for the protection and authentication of software intellectual property. Many previous studies focused on comparing all pairs of submitted codes by using attribute counting, token pattern, program parse tree, and similarity measuring algorithm. It is important to provide a clear-cut model for distinguishing plagiarism and collaboration. This paper proposes a source code clustering algorithm using a probability model on extreme value distribution. First, we propose an asymmetric distance measure pdist($P_a$, $P_b$) to measure the similarity of $P_a$ and $P_b$ Then, we construct the Plagiarism Direction Graph (PDG) for a given program set using pdist($P_a$, $P_b$) as edge weights. And, we transform the PDG into a Gumbel Distance Graph (GDG) model, since we found that the pdist($P_a$, $P_b$) score distribution is similar to a well-known Gumbel distribution. Second, we newly define pseudo-plagiarism which is a sort of virtual plagiarism forced by a very strong functional requirement in the specification. We conducted experiments with 18 groups of programs (more than 700 source codes) collected from the ICPC (International Collegiate Programming Contest) and KOI (Korean Olympiad for Informatics) programming contests. The experiments showed that most plagiarized codes could be detected with high sensitivity and that our algorithm successfully separated real plagiarism from pseudo plagiarism.

Developmental disability Diagnosis Assessment Systems Implementation using Multimedia Authorizing Tool (멀티미디어 저작도구를 이용한 발달장애 진단.평가 시스템 구현연구)

  • Byun, Sang-Hea;Lee, Jae-Hyun
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.3 no.1
    • /
    • pp.57-72
    • /
    • 2008
  • Serve and do so that graft together specialists' view application field of computer and developmental disability diagnosis estimation data to construct developmental disability diagnosis estimation system in this Paper and constructed developmental disability diagnosis estimation system. Developmental disability diagnosis estimation must supply information of specification area that specialists are having continuously. Developmental disability diagnosis estimation specialist system need multimedia data processing that is specialized little more for developmental disability classification diagnosis and decision-making and is atomized for this. Characteristic of developmental disability diagnosis estimation system that study in this paper can supply quick feedback about result, and can reduce mistake on recording and calculation as well as can shorten examination's enforcement time, and background of training is efficient system fairly in terms of nonprofessional who is not many can use easily. But, as well as when multimedia information that is essential data of system construction for developmental disability diagnosis estimation is having various kinds attribute and a person must achieve description about all developmental disability diagnosis estimation informations, great amount of work done is accompanied, technology about equal data can become different according to management. Because of these problems, applied search technology of contents base (Content-based) that search connection information by contents of edit target data for developmental disability diagnosis estimation data processing multimedia data processing technical development. In the meantime, typical access way for conversation style data processing to support fast image search, after draw special quality of data by N-dimension vector, store to database regarding this as value of N dimension and used data structure of Tree techniques to use index structure that search relevant data based on this costs. But, these are not coincided correctly in purpose of developmental disability diagnosis estimation because is developed focusing in application field that use data of low dimension such as original space DataBase or geography information system. Therefore, studied save structure and index mechanism of new way that support fast search to search bulky good physician data.

  • PDF

The State Attribute and Grade Influence Structure for the RC Bridge Deck Slabs by Information Entropy (정보 엔트로피에 의한 RC 교량 상판의 상태속성 및 등급 영향 구조 분석)

  • Hwang, Jin-Ha;Park, Jong-Hoi;An, Seoung-Su
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.23 no.1
    • /
    • pp.61-71
    • /
    • 2010
  • The attributes related to the health condition of RC deck slabs are analyzed to help us identify and rate the safety level of the bridges in this study. According to the related reports the state assessment for the outward aspects of bridges is the important and critical part for rating the overall structural safety. In this respect, the careful identification for the various state attributes make the field inspection and structural diagnosis very effective. This study analyzes the influence of the state attributes on evaluation classes and the relationship of them by the inductive reasoning, which raise the understanding and performance for evaluation work, and support the logical approach for the state assessment. ID3 algorithm applied to the case set which is constructed from the field reports indicates the main attributes and the precedence governing the assessment, and derives the decision hierarchy for the state assessment.

Advanced Improvement for Frequent Pattern Mining using Bit-Clustering (비트 클러스터링을 이용한 빈발 패턴 탐사의 성능 개선 방안)

  • Kim, Eui-Chan;Kim, Kye-Hyun;Lee, Chul-Yong;Park, Eun-Ji
    • Journal of Korea Spatial Information System Society
    • /
    • v.9 no.1
    • /
    • pp.105-115
    • /
    • 2007
  • Data mining extracts interesting knowledge from a large database. Among numerous data mining techniques, research work is primarily concentrated on clustering and association rules. The clustering technique of the active research topics mainly deals with analyzing spatial and attribute data. And, the technique of association rules deals with identifying frequent patterns. There was an advanced apriori algorithm using an existing bit-clustering algorithm. In an effort to identify an alternative algorithm to improve apriori, we investigated FP-Growth and discussed the possibility of adopting bit-clustering as the alternative method to solve the problems with FP-Growth. FP-Growth using bit-clustering demonstrated better performance than the existing method. We used chess data in our experiments. Chess data were used in the pattern mining evaluation. We made a creation of FP-Tree with different minimum support values. In the case of high minimum support values, similar results that the existing techniques demonstrated were obtained. In other cases, however, the performance of the technique proposed in this paper showed better results in comparison with the existing technique. As a result, the technique proposed in this paper was considered to lead to higher performance. In addition, the method to apply bit-clustering to GML data was proposed.

  • PDF

A Study on Yunqi Climate (運氣氣候) through analysis of Meteorological research data in Korea (한국(韓國) 기상자료(氣象資料)의 분석(分析)을 통(通)한 운기(運氣) 기후(氣候)에 관(關)한 연구(硏究))

  • Park, Chan-Young;Kim, Ki-Wook;Park, Hyun-Kook
    • The Journal of Dong Guk Oriental Medicine
    • /
    • v.8 no.2
    • /
    • pp.1-24
    • /
    • 2000
  • The comparison of climate's character of Yunqi(運氣) with the data of meterological observation were made in the research of climate. 1. The comparison of the average velocity of wind, temperature, rainfall, humidity of Seoul, by late 1954 to 1983, with Yunqi(運氣) was made. Fire-Chi(火氣) and moisture-qi(濕氣) were matched with the attribute of Taiyun(大運). Cold-qi(寒氣) was had some relationship. Dry-qi(燥 氣) and Wind-qi(風氣) were not matched. About the relationship of Spirit-of-official-sky(司天之氣) with climate, when the Moisture-soil(濕土) was added, they were matched and when the King-fire(君火) was added, they have some relationship. But Wind-tree(風木), Dry-metal(燥金), Buble-fire(相火), Cold-water(寒水) was added they were not matched. 2. According to the observation data of rainfall by late 180 years of Seoul; about Taiyun(大運), when the Water-Yun(水運) was greatly exceeded and Fire-Yun(火運) was shorted, in the case of Official-sky(司天), when Wind-Tree(風木) was added, the frequency was highly. So when the Soil-Yun(土運) was greatly exceeded and when Official-sky(司天)was added to the Moisture-soil(濕土), the rainfall was not matched. 3. The relationship of the frequency of the abnormal climate occurrences between Yunqi-promotion-weak(運氣盛衰)and Yunqi-Harmony(運氣同化) and Yunqi-soft-attacking(運氣順逆) in the weather of Korean Peninsula was compared by 1564 to 1863. They were not matched except the case of Yunqi-Harmony(運氣同化). 4. There were some cases which were not matched exactly between the climate predicted by the theory and real climate in 1984, the year of Kap-ga(甲子年). But many correspondence between the observation by the office of meteorology and the prediction by the analysis from Yun-qi-sang-hab(運氣相合) theory. 5. Because meterological phenomena of real world and analysis from the hypothesis of Yunqi(運氣) have no relationship with each other, some of Doctor denied Yunqi(運氣) in the way of matching mechanically. But the thought of Doctor who denied Fortune-spirit(運氣) made promotion for the theory of divination by bringing deeper insight. And it was not only the negative side. 6. In the point of geographical difference, the climate of China, the origination Yunqi theory, is different from the Korea's. Thus some observation errors should be considered. From the basis of this thesis, I hope that the deeper advance would be made into the Korean Yunqi theory.

  • PDF