• Title/Summary/Keyword: Similarity Searches

Search Result 57, Processing Time 0.03 seconds

Identification and Characterization of the Causal Organism of Gummy Stem Blight in the Muskmelon (Cucumis melo L.)

  • Choi, In-Young;Choi, Jang-Nam;Choi, Dong-Chil;Sharma, Praveen Kumar;Lee, Wang-Hyu
    • Mycobiology
    • /
    • v.38 no.3
    • /
    • pp.166-170
    • /
    • 2010
  • Gummy stem blight is a major foliar disease of muskmelon (Cucumis melo L.). In this study, morphological characteristics and rDNA internal transcribed spacer (ITS) sequences were analyzed to identify the causal organism of this disease. Morphological examination of the Jeonbuk isolate revealed that the percentage of monoseptal conidia ranged from 0% to 10%, and the average length $\times$ width of the conidia was 70 ($\pm$ 0.96) $\times$ 32.0 ($\pm$ 0.15) ${\mu}m$ on potato dextrose agar. The BLAST analysis showed nucleotide gaps of 1/494, 2/492, and 1/478 with identities of 485/492 (98%), 492/494 (99%), 491/494 (99%), and 476/478 (99%). The similarity in sequence identity between the rDNA ITS region of the Jeonbuk isolate and other Didymella bryoniae from BLAST searches of GenBank was 100% and was 95.0% within the group. Nucleotide sequences of the rDNA ITS region from pure culture ranged from 98.2% to 99.8%. Phylogenetic analysis with related species of D. bryoniae revealed that D. bryoniae is a monophyletic group distinguishable from other Didymella spp., including Ascochyta pinodes, Mycosphaerella pinodes, M. zeae-maydis, D. pinodes, D. applanata, D. exigua, D. rabiei, D. lentis, D. fabae, and D. vitalbina. Phylogenetic analysis, based on rDNA ITS sequence, clearly distinguished D. bryoniae and Didymella spp. from the 10 other species studied. This study identified the Jeonbuk isolate to be D. bryoniae.

Fractal Compression using Range Block Coherence (레인지 블록 유사성을 이용한 프랙탈 압축)

  • Kim, Young-Bong;Lee, Yun-Jung
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.27 no.2
    • /
    • pp.117-122
    • /
    • 2000
  • The fractal image compression is based on the self-similarity that some area in an image is very similar to others. This compression method offers high compression ratio and fast decompression, but it has very long encoding time. To cut-off the encoding time, most researches give a restriction on domain blocks to be compared with a range block or make an effective search sequence of the domain blocks for a range block. However, most of them take much encoding time yet. In this research, we propose an algorithm that greatly reduces the encoding time by considering the coherence between range blocks. This algorithm first classifies all range blocks into some groups using the coherence between range blocks, and then searches corresponding domain blocks only for the key block of each group. If this scheme is joined in a prior work of the other fractal compression algorithm, it will give a great effectiveness to encoding time.

  • PDF

Changes in the Attitudes of Doctors toward Cooperative Practices between Western Medicine and Traditional Korean Medicine - A Systematic Review in Korean Literature - (한.양방 협진에 대한 의사들의 인식변화 - 국내 문헌에 대한 체계적 고찰 -)

  • Min, Hyun-Ju;Ryu, Ji-Seon;Yun, Young-Ju
    • Journal of Society of Preventive Korean Medicine
    • /
    • v.16 no.1
    • /
    • pp.15-29
    • /
    • 2012
  • Background : With the increase of cooperative practices (CP) between conventional western medicine and traditional Korean medicine, there have been lots of researches on the status of CP and the attitude of doctors. Objective : Since most of the research is cross-sectional, this study aims to figure out the changes in the attitude of doctors toward CP through systematic review. Method : Systematic literature searches were performed on several databases in Korea. They were categorized according to the respondents and question items and analyzed by the context of questions, similarity of respondents and measurement scale. And we analyzed the changes of response regarding to doctors' awareness and attitude to CP. Results : Thirteen survey studies including attitude of doctors toward CP were selected. These studies were conducted between 1997 and 2009 and the number of respondents of each study ranged from 20 to 702. There has been increasing awareness of CP among doctors ; however the positive responses on the necessity of CP has decreased. Regarding the type of illness effectively treated employing CP, there was a shift from neurovascular to musculoskeletal and immune diseases. Most of the studies listed different approaches to disease, prejudice of health care providers and inadequate legal system as major obstacles against CP. Conclusion : In spite of the increase of CP in the last 20 years, there has not been marked positive change in the doctors' attitude toward CP. To promote CP, it is required to confirm the effectiveness of CP through disease models and change the medical legislation policies on CP.

Algorithm for Predicting Functionally Equivalent Proteins from BLAST and HMMER Searches

  • Yu, Dong Su;Lee, Dae-Hee;Kim, Seong Keun;Lee, Choong Hoon;Song, Ju Yeon;Kong, Eun Bae;Kim, Jihyun F.
    • Journal of Microbiology and Biotechnology
    • /
    • v.22 no.8
    • /
    • pp.1054-1058
    • /
    • 2012
  • In order to predict biologically significant attributes such as function from protein sequences, searching against large databases for homologous proteins is a common practice. In particular, BLAST and HMMER are widely used in a variety of biological fields. However, sequence-homologous proteins determined by BLAST and proteins having the same domains predicted by HMMER are not always functionally equivalent, even though their sequences are aligning with high similarity. Thus, accurate assignment of functionally equivalent proteins from aligned sequences remains a challenge in bioinformatics. We have developed the FEP-BH algorithm to predict functionally equivalent proteins from protein-protein pairs identified by BLAST and from protein-domain pairs predicted by HMMER. When examined against domain classes of the Pfam-A seed database, FEP-BH showed 71.53% accuracy, whereas BLAST and HMMER were 57.72% and 36.62%, respectively. We expect that the FEP-BH algorithm will be effective in predicting functionally equivalent proteins from BLAST and HMMER outputs and will also suit biologists who want to search out functionally equivalent proteins from among sequence-homologous proteins.

Gene Microarray Analysis for Porcine Adipose Tissue: Comparison of Gene Expression between Chinese Xiang Pig and Large White

  • Guo, W.;Wang, S.H.;Cao, H.J.;Xu, K.;Zhang, J.;Du, Z.L.;Lu, W.;Feng, J.D.;Li, N.;Wu, C.H.;Zhang, L.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.21 no.1
    • /
    • pp.11-18
    • /
    • 2008
  • We created a cDNA microarray representing approximately 3,500 pig genes for functional genomic studies. The array elements were selected from 6,494 cDNA clones identified in a large-scale expressed sequence tag (EST) project. These cDNA clones came from normalized and subtracted porcine adipose tissue cDNA libraries. Sequence similarity searches of the 3,426 ESTs represented on the array using BLASTN identified 2,790 (81.4%) as putative human orthologs, with the remainder consisting of "novel" genes or highly divergent orthologs. We used the gene microarray to profile transcripts expressed by adipose tissue of fatty Chinese Xiang pig (XP) and muscley Large White (LW). Microarray analysis of RNA extracted from adipose tissue of fatty XP and muscley LW identified 81 genes that were differently expressed two fold or more. Transcriptional differences of four of these genes, adipocyte fatty acid binding protein (aP2), stearyl-CoA desaturase (SCD), sterol regulatory element binding transcription factor 1 (SREBF1) and lipoprotein lipase (LPL) were confirmed using SYBR Green quantitative RT-PCR technology. Our results showed that high expression of SCD and SREBF1 may be one of the reasons that larger fat deposits are observed in the XP. In addition, our findings also illustrate the potential power of microarrays for understanding the molecular mechanisms of porcine development, disease resistance, nutrition, fertility and production traits.

국가연구개발사업 평가에서 사회연결망 분석 활용 방안

  • Gi, Ji-Hun
    • Proceedings of the Korea Technology Innovation Society Conference
    • /
    • 2017.11a
    • /
    • pp.129-129
    • /
    • 2017
  • In planning and evaluating government R&D programs, one of the first steps is to understand the government's current R&D investment portfolio - which fields or topics the government is now investing in in R&D. Analysis methods of an investment portfolio of government R&D tend traditionally to rely on keyword searches or ad-hoc two-dimensional classifications. The main drawback of these approaches is their limited ability to account for the characteristics of the whole government investment in R&D and the role of individual R&D program in it, which tends to depend on the relationship with other programs. This paper suggests a new method for mapping and analyzing government investment in R&D using a combination of methods from natural language processing (NLP) and network analysis. The NLP enables us to build a network of government R&D programs whose links are defined as similarity in R&D topics. Then methods from network analysis show the characteristics of government investment in R&D, including major investment fields, unexplored topics, and key R&D programs which play a role like a hub or a bridge in the network of R&D programs, which are difficult to be identified by conventional methods. These insights can be utilized in planning a new R&D program, in reviewing its proposal, or in evaluating the performance of R&D programs. The utilized (filtered) Korean text corpus consists of hundreds of R&D program descriptions in the budget requests for fiscal year 2017 submitted by government departments to the Korean Ministry of Strategy and Finance.

  • PDF

Sequence Stream Indexing Method using DFT and Bitmap in Sequence Data Warehouse (시퀀스 데이터웨어하우스에서 이산푸리에변환과 비트맵을 이용한 시퀀스 스트림 색인 기법)

  • Son, Dong-Won;Hong, Dong-Kweon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.2
    • /
    • pp.181-186
    • /
    • 2012
  • Recently there has been many active researches on searching similar sequences from data generated with the passage of time. Those data are classified as time series data or sequence data and have different semantics from scalar data of traditional databases. In this paper similar sequence search retrieves sequences that have a similar trend of value changes. At first we have transformed the original sequences by applying DFT. The converted data are more suitable for trend analysis and they require less number of attributes for sequence comparisons. In addition we have developed a region-based query and we applied bitmap indexes which could show better performance in data warehouse. We have built bitmap indexes with varying number of attributes and we have found the least cost query plans for efficient similar sequence searches.

KBUD: The Korea Brain UniGene Database

  • Jeon, Yeo-Jin;Oh, Jung-Hwa;Yang, Jin-Ok;Kim, Nam-Soon
    • Genomics & Informatics
    • /
    • v.3 no.3
    • /
    • pp.86-93
    • /
    • 2005
  • Human brain EST data provide important clues for our understanding of the molecular biology associated with the function of the normal brain and the molecular pathophysiology with brain disorders. To systematically and efficiently study the function and disorders of the human brain, 45,773 human brain ESTs were collected from 27 human brain cDNA libraries, which were constructed from normal brains and brain disorders such as brain tumors, Parkinson's disease (PO) and epilepsy. An analysis of 45,773 human brain ESTs using our EST analysis pipeline resulted in 38,396 high-quality ESTs and 35,906 ESTs, which were coalesced into 8,246 unique gene clusters, showing a significant similarity to known genes in the human RefSeq, human mRNAs and UniGene database. In addition, among 8,246 gene clusters, 4,287 genes ($52\%$) were found to contain full-length cONA clones. To facilitate the extraction of useful information in collected these human brain ESTs, we developed a user-friendly interface system, the Korea Brain Unigene Database (KBUD). The KBUD web interface allows access to our human brain data through three major search modes, the BioCarta pathway, keywords and BLAST searches. Each result when viewed in KBUD offers comprehensive information concerning the analyzed human brain ESTs provided by our data as well as data linked to various other publiC databases. The user-friendly developed KBUD, the first world-wide web interface for human brain EST data with ESTs of human brain disorders as well as normal brains, will be a helpful system for developing a better understanding of the underlying mechanisms of the normal brain well as brain disorders. The KBUD system is freely accessible at http://kugi.kribb.re.kr/KU/cgi -bin/brain. pI.

A Study on the Design of Case-based Reasoning Office Knowledge Recommender System for Office Professionals (사례기반추론을 이용한 사무지식 추천시스템)

  • Kim, Myong-Ok;Na, Jung-Ah
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.3
    • /
    • pp.131-146
    • /
    • 2011
  • It is becoming more essential than ever for office professionals to become competent in information collection/gathering and problem solving in today's global business society. In particular, office professionals do not only assist simple chores but are also forced to make decisions as quickly and efficiently as possible in problematic situations that can end in either profit or loss to their company. Since office professionals rely heavily on their tacit knowledge to solve problems that arise in everyday business situations, it is truly helpful and efficient to refer to similar business cases from the past and share or reuse such previous business knowledge for better performance results. Case-based reasoning(CBR) is a problem-solving method which utilizes previous similar cases to solve problems. Through CBR, the closest case to the current business situation can be searched and retrieved from the case or knowledge base and can be referred to for a new solution. This reduces the time and resources needed and increase success probability. The main purpose of this study is to design a system called COKRS(Case-based reasoning Office Knowledge Recommender System) and develop a prototype for it. COKRS manages cases and their meta data, accepts key words from the user and searches the casebase for the most similar past case to the input keyword, and communicates with users to collect information about the quality of the case provided and continuously apply the information to update values on the similarity table. Core concepts like system architecture, definition of a case, meta database, similarity table have been introduced, and also an algorithm to retrieve all similar cases from past work history has also been proposed. In this research, a case is best defined as a work experience in office administration. However, defining a case in office administration was not an easy task in reality. We surveyed 10 office professionals in order to get an idea of how to define a case in office administration and found out that in most cases any type of office work is to be recorded digitally and/or non-digitally. Therefore, we have defined a record or document case as for COKRS. Similarity table was composed of items of the result of job analysis for office professionals conducted in a previous research. Values between items of the similarity table were initially set to those from researchers' experiences and literature review. The results of this study could also be utilized in other areas of business for knowledge sharing wherever it is necessary and beneficial to share and learn from past experiences. We expect this research to be a reference for researchers and developers who are in this area or interested in office knowledge recommendation system based on CBR. Focus group interview(FGI) was conducted with ten administrative assistants carefully selected from various areas of business. They were given a chance to try out COKRS in an actual work setting and make some suggestions for future improvement. FGI has identified the user-interface for saving and searching cases for keywords as the most positive aspect of COKRS, and has identified the most urgently needed improvement as transforming tacit knowledge and knowhow into recorded documents more efficiently. Also, the focus group has mentioned that it is essential to secure enough support, encouragement, and reward from the company and promote positive attitude and atmosphere for knowledge sharing for everybody's benefit in the company.

An Intelligence Support System Research on KTX Rolling Stock Failure Using Case-based Reasoning and Text Mining (사례기반추론과 텍스트마이닝 기법을 활용한 KTX 차량고장 지능형 조치지원시스템 연구)

  • Lee, Hyung Il;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.47-73
    • /
    • 2020
  • KTX rolling stocks are a system consisting of several machines, electrical devices, and components. The maintenance of the rolling stocks requires considerable expertise and experience of maintenance workers. In the event of a rolling stock failure, the knowledge and experience of the maintainer will result in a difference in the quality of the time and work to solve the problem. So, the resulting availability of the vehicle will vary. Although problem solving is generally based on fault manuals, experienced and skilled professionals can quickly diagnose and take actions by applying personal know-how. Since this knowledge exists in a tacit form, it is difficult to pass it on completely to a successor, and there have been studies that have developed a case-based rolling stock expert system to turn it into a data-driven one. Nonetheless, research on the most commonly used KTX rolling stock on the main-line or the development of a system that extracts text meanings and searches for similar cases is still lacking. Therefore, this study proposes an intelligence supporting system that provides an action guide for emerging failures by using the know-how of these rolling stocks maintenance experts as an example of problem solving. For this purpose, the case base was constructed by collecting the rolling stocks failure data generated from 2015 to 2017, and the integrated dictionary was constructed separately through the case base to include the essential terminology and failure codes in consideration of the specialty of the railway rolling stock sector. Based on a deployed case base, a new failure was retrieved from past cases and the top three most similar failure cases were extracted to propose the actual actions of these cases as a diagnostic guide. In this study, various dimensionality reduction measures were applied to calculate similarity by taking into account the meaningful relationship of failure details in order to compensate for the limitations of the method of searching cases by keyword matching in rolling stock failure expert system studies using case-based reasoning in the precedent case-based expert system studies, and their usefulness was verified through experiments. Among the various dimensionality reduction techniques, similar cases were retrieved by applying three algorithms: Non-negative Matrix Factorization(NMF), Latent Semantic Analysis(LSA), and Doc2Vec to extract the characteristics of the failure and measure the cosine distance between the vectors. The precision, recall, and F-measure methods were used to assess the performance of the proposed actions. To compare the performance of dimensionality reduction techniques, the analysis of variance confirmed that the performance differences of the five algorithms were statistically significant, with a comparison between the algorithm that randomly extracts failure cases with identical failure codes and the algorithm that applies cosine similarity directly based on words. In addition, optimal techniques were derived for practical application by verifying differences in performance depending on the number of dimensions for dimensionality reduction. The analysis showed that the performance of the cosine similarity was higher than that of the dimension using Non-negative Matrix Factorization(NMF) and Latent Semantic Analysis(LSA) and the performance of algorithm using Doc2Vec was the highest. Furthermore, in terms of dimensionality reduction techniques, the larger the number of dimensions at the appropriate level, the better the performance was found. Through this study, we confirmed the usefulness of effective methods of extracting characteristics of data and converting unstructured data when applying case-based reasoning based on which most of the attributes are texted in the special field of KTX rolling stock. Text mining is a trend where studies are being conducted for use in many areas, but studies using such text data are still lacking in an environment where there are a number of specialized terms and limited access to data, such as the one we want to use in this study. In this regard, it is significant that the study first presented an intelligent diagnostic system that suggested action by searching for a case by applying text mining techniques to extract the characteristics of the failure to complement keyword-based case searches. It is expected that this will provide implications as basic study for developing diagnostic systems that can be used immediately on the site.