• Title/Summary/Keyword: extraction techniques

Search Result 894, Processing Time 0.024 seconds

Natural language processing techniques for bioinformatics

  • Tsujii, Jun-ichi
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.3-3
    • /
    • 2003
  • With biomedical literature expanding so rapidly, there is an urgent need to discover and organize knowledge extracted from texts. Although factual databases contain crucial information the overwhelming amount of new knowledge remains in textual form (e.g. MEDLINE). In addition, new terms are constantly coined as the relationships linking new genes, drugs, proteins etc. As the size of biomedical literature is expanding, more systems are applying a variety of methods to automate the process of knowledge acquisition and management. In my talk, I focus on the project, GENIA, of our group at the University of Tokyo, the objective of which is to construct an information extraction system of protein - protein interaction from abstracts of MEDLINE. The talk includes (1) Techniques we use fDr named entity recognition (1-a) SOHMM (Self-organized HMM) (1-b) Maximum Entropy Model (1-c) Lexicon-based Recognizer (2) Treatment of term variants and acronym finders (3) Event extraction using a full parser (4) Linguistic resources for text mining (GENIA corpus) (4-a) Semantic Tags (4-b) Structural Annotations (4-c) Co-reference tags (4-d) GENIA ontology I will also talk about possible extension of our work that links the findings of molecular biology with clinical findings, and claim that textual based or conceptual based biology would be a viable alternative to system biology that tends to emphasize the role of simulation models in bioinformatics.

  • PDF

Comparison of the Effects of Blending and Juicing on the Phytochemicals Contents and Antioxidant Capacity of Typical Korean Kernel Fruit Juices

  • Pyo, Young-Hee;Jin, Yoo-Jeong;Hwang, Ji-Young
    • Preventive Nutrition and Food Science
    • /
    • v.19 no.2
    • /
    • pp.108-114
    • /
    • 2014
  • Four Korean kernel fruit (apple, pear, persimmon, and mandarin orange) juices were obtained by household processing techniques (i.e., blending, juicing). Whole and flesh fractions of each fruit were extracted by a blender or a juicer and then examined for phytochemical content (i.e., organic acids, polyphenol compounds). The antioxidant capacity of each juice was determined by ferric reducing antioxidant power (FRAP) and 2,2-diphenyl-1-picrylhydrazyl (DPPH) assays. Results revealed that juices that had been prepared by blending whole fruits had stronger antioxidant activities and contained larger amounts of phenolic compounds than juices that had been prepared by juicing the flesh fraction of the fruit. However, the concentration of ascorbic acid in apple, pear, and mandarin orange juices was significantly (P<0.05) higher in juice that had been processed by juicing, rather than blending. The juices with the highest ascorbic acid (233.9 mg/serving), total polyphenols (862.3 mg gallic acid equivalents/serving), and flavonoids (295.1 mg quercetin equivalents/serving) concentrations were blended persimmon juice, blended mandarin orange juice, and juiced apple juice, respectively. These results indicate that juice extraction techniques significantly (P<0.05) influences the phytochemical levels and antioxidant capacity of fruit juices.

Future Prospects and Health Benefits of Functional Ingredients from Marine Bio-resources: A review

  • Samarakoon, Kalpa W.;Elvitigala, Don Anushka Sandaruwan;Lakmal, H.H. Chaminda;Kim, Young-Mog;Jeon, You-Jin
    • Fisheries and Aquatic Sciences
    • /
    • v.17 no.3
    • /
    • pp.275-290
    • /
    • 2014
  • The marine ecosystem represents a vast and dynamic array of bio-resources attributed with its huge diversity and considered as potential untapped reservoirs for the development of functional foods for future health markets. Basically, marine microorganisms, sponges, algae, invertebrates such as crustaceans and mollusks along with marine fish species can be considered as marine bio-resources, which can be utilized to obtain different health benefits for humans, directly or after processing. Most of the bio-molecular components, such as lipids and proteins from these marine bio-resources, which can be extracted in large scale using the modern and advanced biotechnological approaches, are suitable drug candidates for the pharmaceutical industry as well as functional food ingredients for the food industry. Moreover, the furtherance of high throughput molecular biological techniques has already been incorporated with identification, mining and extraction of molecular components from marine bio-resources. In this review, potential marine bio-resources with respect to their extractable bio-molecules were described in details, while explaining the present and prospective methods of identification and extraction, which are integrated with advanced techniques in modern biotechnology. In addition, this provides an overview of future trends in marine biotechnology.

Implementation of Digital Image Processing for Coastline Extraction from Synthetic Aperture Radar Imagery

  • Lee, Dong-Cheon;Seo, Su-Young;Lee, Im-Pyeong;Kwon, Jay-Hyoun;Tuell, Grady H.
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.25 no.6_1
    • /
    • pp.517-528
    • /
    • 2007
  • Extraction of the coastal boundary is important because the boundary serves as a reference in the demarcation of maritime zones such as territorial sea, contiguous zone, and exclusive economic zone. Accurate nautical charts also depend on well established, accurate, consistent, and current coastline delineation. However, to identify the precise location of the coastal boundary is a difficult task due to tidal and wave motions. This paper presents an efficient way to extract coastlines by applying digital image processing techniques to Synthetic Aperture Radar (SAR) imagery. Over the past few years, satellite-based SAR and high resolution airborne SAR images have become available, and SAR has been evaluated as a new mapping technology. Using remotely sensed data gives benefits in several aspects, especially SAR is largely unaffected by weather constraints, is operational at night time over a large area, and provides high contrast between water and land areas. Various image processing techniques including region growing, texture-based image segmentation, local entropy method, and refinement with image pyramid were implemented to extract the coastline in this study. Finally, the results were compared with existing coastline data derived from aerial photographs.

Hybrid Intelligent Web Recommendation Systems Based on Web Data Mining and Case-Based Reasoning

  • Kim, Jin-Sung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.3
    • /
    • pp.366-370
    • /
    • 2003
  • In this research, we suggest a hybrid intelligent Web recommendation systems based on Web data mining and case-based reasoning (CBR). One of the important research topics in the field of Internet business is blending artificial intelligence (AI) techniques with knowledge discovering in database (KDD) or data mining (DM). Data mining is used as an efficient mechanism in reasoning for association knowledge between goods and customers' preference. In the field of data mining, the features, called attributes, are often selected primary for mining the association knowledge between related products. Therefore, most of researches, in the arena of Web data mining, used association rules extraction mechanism. However, association rules extraction mechanism has a potential limitation in flexibility of reasoning. If there are some goods, which were not retrieved by association rules-based reasoning, we can't present more information to customer. To overcome this limitation case, we combined CBR with Web data mining. CBR is one of the AI techniques and used in problems for which it is difficult to solve with logical (association) rules. A Web-log data gathered in real-world Web shopping mall was given to illustrate the quality of the proposed hybrid recommendation mechanism. This Web shopping mall deals with remote-controlled plastic models such as remote-controlled car, yacht, airplane, and helicopter. The experimental results showed that our hybrid recommendation mechanism could reflect both association knowledge and implicit human knowledge extracted from cases in Web databases.

Automatic Intrapulse Modulated LPI Radar Waveform Identification (펄스 내 변조 저피탐 레이더 신호 자동 식별)

  • Kim, Minjun;Kong, Seung-Hyun
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.21 no.2
    • /
    • pp.133-140
    • /
    • 2018
  • In electronic warfare(EW), low probability of intercept(LPI) radar signal is a survival technique. Accordingly, identification techniques of the LPI radar waveform have became significant recently. In this paper, classification and extracting parameters techniques for 7 intrapulse modulated radar signals are introduced. We propose a technique of classifying intrapulse modulated radar signals using Convolutional Neural Network(CNN). The time-frequency image(TFI) obtained from Choi-William Distribution(CWD) is used as the input of CNN without extracting the extra feature of each intrapulse modulated radar signals. In addition a method to extract the intrapulse radar modulation parameters using binary image processing is introduced. We demonstrate the performance of the proposed intrapulse radar waveform identification system. Simulation results show that the classification system achieves a overall correct classification success rate of 90 % or better at SNR = -6 dB and the parameter extraction system has an overall error of less than 10 % at SNR of less than -4 dB.

Footprint extraction of urban buildings with LIDAR data

  • Kanniah, Kasturi Devi;Gunaratnam, Kasturi;Mohd, Mohd Ibrahim Seeni
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.113-119
    • /
    • 2003
  • Building information is extremely important for many applications within the urban environment. Sufficient techniques and user-friendly tools for information extraction from remotely sensed imagery are urgently needed. This paper presents an automatic and manual approach for extracting footprints of buildings in urban areas from airborne Light Detection and Ranging (LIDAR) data. First a digital surface model (DSM) was generated from the LIDAR point data. Then, objects higher than the ground surface are extracted using the generated DSM. Based on general knowledge on the study area and field visits, buildings were separated from other objects. The automatic technique for extracting the building footprints was based on different window sizes and different values of image add backs, while the manual technique was based on image segmentation. A comparison was then made to see how precise the two techniques are in detecting and extracting building footprints. Finally, the results were compared with manually digitized building reference data to conduct an accuracy assessment and the result shows that LIDAR data provide a better shape characterization of each buildings.

  • PDF

Remote Sensing Application for the Mineralized Zone Using Landsat TM Data (LANSAT TM자료에 의한 광화대조사 응용기법개발)

  • 姜必鍾;智光薰;曺民肇;崔映燮;Choi, Young Sup
    • Korean Journal of Remote Sensing
    • /
    • v.2 no.2
    • /
    • pp.79-94
    • /
    • 1986
  • TM data, which have better resolution in spatial and spectral than MSS data, were used for this study, and several Image Processing Techniques (IPT) were examined for finding the best IPT to fit to lineament extraction and mineralized zone mapping. The Ryeongnam area was selected as test area, because the area is one of major mineralized zones in Korea and its hydrothermal alteration zone is wider and deeper than other areas. The spatial filtering method is most optimum one for limeament extraction: that is, the directional spatial filtering is most efficient to detect N-S, E-W direction lineaments on the image, and the high boost filtering can be applied for mapping all direction lineaments. The ratio method was selected for detecting altered zone. It is possible to make several tens combinations in ratio with 7 bands of TM data, but considering spectral characteristics of each band of TM to the geological meterials and vegetation, the band 4/band 3(A), band 5/band 7(B), and B/A ratio methods were chosen among them. The 5/7 ratio image did not show clearly the altered area due to noise from vegetation cover, so the 4/3 ratio imae was used for trying to decrease the effect of vegetation. As a result the B/A ratio image showed quite nicely the altered zone of the test area. In conclusion, the spatial filtering is the best image processing techniques for lineament mapping, and the B/A ratio image in TM data is useful for the mineralized zone mapping.

Comparison of Performance Factors for Automatic Classification of Records Utilizing Metadata (메타데이터를 활용한 기록물 자동분류 성능 요소 비교)

  • Young Bum Gim;Woo Kwon Chang
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.3
    • /
    • pp.99-118
    • /
    • 2023
  • The objective of this study is to identify performance factors in the automatic classification of records by utilizing metadata that contains the contextual information of records. For this study, we collected 97,064 records of original textual information from Korean central administrative agencies in 2022. Various classification algorithms, data selection methods, and feature extraction techniques are applied and compared with the intent to discern the optimal performance-inducing technique. The study results demonstrated that among classification algorithms, Random Forest displayed higher performance, and among feature extraction techniques, the TF method proved to be the most effective. The minimum data quantity of unit tasks had a minimal influence on performance, and the addition of features positively affected performance, while their removal had a discernible negative impact.

Improving the Cyber Security over Banking Sector by Detecting the Malicious Attacks Using the Wrapper Stepwise Resnet Classifier

  • Damodharan Kuttiyappan;Rajasekar, V
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.6
    • /
    • pp.1657-1673
    • /
    • 2023
  • With the advancement of information technology, criminals employ multiple cyberspaces to promote cybercrime. To combat cybercrime and cyber dangers, banks and financial institutions use artificial intelligence (AI). AI technologies assist the banking sector to develop and grow in many ways. Transparency and explanation of AI's ability are required to preserve trust. Deep learning protects client behavior and interest data. Deep learning techniques may anticipate cyber-attack behavior, allowing for secure banking transactions. This proposed approach is based on a user-centric design that safeguards people's private data over banking. Here, initially, the attack data can be generated over banking transactions. Routing is done for the configuration of the nodes. Then, the obtained data can be preprocessed for removing the errors. Followed by hierarchical network feature extraction can be used to identify the abnormal features related to the attack. Finally, the user data can be protected and the malicious attack in the transmission route can be identified by using the Wrapper stepwise ResNet classifier. The proposed work outperforms other techniques in terms of attack detection and accuracy, and the findings are depicted in the graphical format by employing the Python tool.