• Title/Summary/Keyword: data extract

Search Result 3,991, Processing Time 0.027 seconds

Development of an Organism-specific Protein Interaction Database with Supplementary Data from the Web Sources (다양한 웹 데이터를 이용한 특정 유기체의 단백질 상호작용 데이터베이스 개발)

  • Hwang, Doo-Sung
    • The KIPS Transactions:PartD
    • /
    • v.9D no.6
    • /
    • pp.1091-1096
    • /
    • 2002
  • This paper presents the development of a protein interaction database. The developed system is characterized as follows. First, the proposed system not only maintains interaction data collected by an experiment, but also the genomic information of the protein data. Secondly, the system can extract details on interacting proteins through the developed wrappers. Thirdly, the system is based on wrapper-based system in order to extract the biologically meaningful data from various web sources and integrate them into a relational database. The system inherits a layered-modular architecture by introducing a wrapper-mediator approach in order to solve the syntactic and semantic heterogeneity among multiple data sources. Currently the system has wrapped the relevant data for about 40% of about 11,500 proteins on average from various accessible sources. A wrapper-mediator approach makes a protein interaction data comprehensive and useful with support of data interoperability and integration. The developing database will be useful for mining further knowledge and analysis of human life in proteomics studies.

The application of GIS and RS for extracting Sumjin Watershed hydrologic-parameter (섬진강 유역 수문인자 추출을 위한 GIS와 RS의 활용)

  • 김지은;이근상;조기성;장영률
    • Spatial Information Research
    • /
    • v.8 no.2
    • /
    • pp.257-274
    • /
    • 2000
  • Recently, natural environment is being forced by the quick increasing of population and industrialization, and especially, capacity and pollution of water resource is being come to the front. It needs to extract the accurate topological and hydrological parameters of watershed in order to manage water resource efficiently. But, these data are processed yet by manual work and simple operation in hydrological fields. In this paper, we presented algorithm that could extract topological any hydrological parameters over Sumjin watershed using GIS and RS and it gives the saving of data processing time and the confidency of data. The extraction procedure of topological characteristics and hydrological parameters is as below. First, watershed and stream are extracted by DEM and curve number is extracted throughout the overlay of landcov map and soil map. Also, we extracted surface parameters like watershed length and the slope of watershed length by Grid computation into watershed and stream. And we gave the method that could extract hydrologic parameters like Muskingum K and sub-basin lag time by executing computation into surface parameters and average SCS curve number being extracted.

  • PDF

Efficient Keyword Extraction from Social Big Data Based on Cohesion Scoring

  • Kim, Hyeon Gyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.10
    • /
    • pp.87-94
    • /
    • 2020
  • Social reviews such as SNS feeds and blog articles have been widely used to extract keywords reflecting opinions and complaints from users' perspective, and often include proper nouns or new words reflecting recent trends. In general, these words are not included in a dictionary, so conventional morphological analyzers may not detect and extract those words from the reviews properly. In addition, due to their high processing time, it is inadequate to provide analysis results in a timely manner. This paper presents a method for efficient keyword extraction from social reviews based on the notion of cohesion scoring. Cohesion scores can be calculated based on word frequencies, so keyword extraction can be performed without a dictionary when using it. On the other hand, their accuracy can be degraded when input data with poor spacing is given. Regarding this, an algorithm is presented which improves the existing cohesion scoring mechanism using the structure of a word tree. Our experiment results show that it took only 0.008 seconds to extract keywords from 1,000 reviews in the proposed method while resulting in 15.5% error ratio which is better than the existing morphological analyzers.

A Study to Extract Landuse Information from Digital Topographic Map in Urban Area (수치지형도를 이용한 도시지역 토지이용정보 추출기법에 관한 연구)

  • Min, Sook-Joo;Kim, Kye-Hyun;Kim, Kyoung-Soon
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.12 no.3 s.30
    • /
    • pp.13-21
    • /
    • 2004
  • Landuse information is used to plan land use, urban and environmental management as base data. And, demand for landuse information is rising due to ecological consideration. But existing method to extract landuse information from aerial photographs or field survey is consume a lot of time and cost. In urban area where the pattern of landuse is densely aggregated, a landuse information needs to be classified in detail for urban planning and management. Therefore this study aims to examine the method to extract landuse information in detail from 1:1,000 digital topographic data. For the purpose, the method was applied to a part of metropolitan Seoul. The results of study shows that extraction of landuse information except forest area is possible. Forest area is needed to describe smaller spatial unit. For the future, the method of describing forest area is improved it will be effectively applicable for the city maintenance.

  • PDF

Extracting Rules from Neural Networks with Continuous Attributes (연속형 속성을 갖는 인공 신경망의 규칙 추출)

  • Jagvaral, Batselem;Lee, Wan-Gon;Jeon, Myung-joong;Park, Hyun-Kyu;Park, Young-Tack
    • Journal of KIISE
    • /
    • v.45 no.1
    • /
    • pp.22-29
    • /
    • 2018
  • Over the decades, neural networks have been successfully used in numerous applications from speech recognition to image classification. However, these neural networks cannot explain their results and one needs to know how and why a specific conclusion was drawn. Most studies focus on extracting binary rules from neural networks, which is often impractical to do, since data sets used for machine learning applications contain continuous values. To fill the gap, this paper presents an algorithm to extract logic rules from a trained neural network for data with continuous attributes. It uses hyperplane-based linear classifiers to extract rules with numeric values from trained weights between input and hidden layers and then combines these classifiers with binary rules learned from hidden and output layers to form non-linear classification rules. Experiments with different datasets show that the proposed approach can accurately extract logical rules for data with nonlinear continuous attributes.

A Study on the extraction of hydrologic-Model input parameter using GSIS (GSIS를 이용한 수문모형 입력매개변수 추출에 관한 연구)

  • Lee, Geung-Sang;Chae, Hyo-Seok;Park, Jeong-Nam;Cho, Gi-Sung
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.8 no.2 s.16
    • /
    • pp.11-22
    • /
    • 2000
  • It needs to extract the accurate topological characteristics and hydrological parameters of watershed in order to manage water resource efficiently. But, these data are processed yet by manual wok and simple operation in hydrologic fields. In this paper, we presented algorithm that could extract topological characteristics and hydrological parameters over watershed using GSIS and it gives the saving of data processing tin and the confidency of data. We presented coupling method between GSIS and hydrologic model by using extracted parameters into the input parameter of HEC-HMS hydrologic model. The extraction procedure of topological characteristics and hydrological parameters is as below. First, watershed and stream are extracted by DEM and curve unmber is extracted throughout the overlay of landuse map and soil map. Also, we extracted surface parameters like the length of the longest flow path and the slope of the longest flow path by Grid computation into watershed and stream. And we gave the method that could extract hydrologic parameters like Muskingum K and sub-basin lag tin by executing computation into surface parameters and average Sn curve number being extracted.

  • PDF

An improved extraction technique of executable file from physical memory by analyzing file object (파일 오브젝트 분석 기반 개선된 물리 메모리 실행 파일 추출 방법)

  • Kang, Youngbok;Hwang, Hyunuk;Kim, Kibom;Noh, Bongnam
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.24 no.5
    • /
    • pp.861-870
    • /
    • 2014
  • According to the intelligence of the malicious code to extract the executable file in physical memory is emerging as an import researh issue. In previous physical memory studies on executable file extraction which is targeting running files, they are not extracted as same as original file saved in disc. Therefore, we need a method that can extract files as same as original one saved in disc and also can analyze file-information loaded in physical memory. In this paper, we provide a method that executable file extraction by analyzing information of Windows kernel file object. Also we analyze the characteristic of physical memory loaded file data from the experiment and we demonstrate superiority because the suggested method can effectively extract more of original file data than the existing method.

Extracting Three-Dimensional Geometric Information of Roads from Integrated Multi-sensor Data using Ground Vehicle Borne System (지상 이동체 기반의 다중 센서 통합 데이터를 활용한 도로의 3차원 기하정보 추출에 관한 연구)

  • Kim, Moon-Gie;Sung, Jung-Gon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.11 no.3
    • /
    • pp.68-79
    • /
    • 2008
  • Ground vehicle borne system which is named RoSSAV(Road Safety Survey and Analysis Vehicle) developed in KICT(Korea Institute of Construction Technology) can collect road geometric data. This system therefore is able to evaluate the road safety and analyze road deficient sections using data collected along the roads. The purpose of this study is to extract road geometric data for 3D road modeling in dangerous road section and The system should be able to quickly provide more accurate data. Various sensors(circular laser scanner, GPS, INS, CCD camera and DMI) are installed in moving object and collect road environment data. Finally, We extract 3d road geometry(center, boundary), road facility and slope using integrated multi-sensor data.

  • PDF

Development of Extracting Method of Horizontal Alignment in a Tunnel Using Positioning Satellite Data (측위위성자료를 활용한 터널 내 평면선형 추출기법 개발)

  • Kim, Jin-Soo;Jang, Ho-Sik;Lee, Jong-Chool
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.11 no.2 s.25
    • /
    • pp.39-45
    • /
    • 2003
  • Roads have been developed throughout the history of mankind, and play a significant role among many traffic facilities for the economy, politics and cultures of our lives. However, the management of roads has not been fully scientific or systematic due to governmental policies focused on construction resulting in damages, and the loss of drawings for existing roads. In this case, it is difficult to manage roads using normal cadastre due to its time consuming work. And, when applying satellite surveying to rapidly extract the centerline of roads, it is impossible to obtain data about the status of internal tunnels. Therefore, this study can be used to extract optimum alignment data of tunnels using the data from satellite surveying, and is a practical paper which can contribute to efficient management and usage of alignment data and road facilities in establishing a HMS(Highway Management System) for the renewal and management of the alignment data of roads, by comparing the data from satellites with the alignment data in existing drawings.

  • PDF

Data Reduction Method in Massive Data Sets

  • Namo, Gecynth Torre;Yun, Hong-Won
    • Journal of information and communication convergence engineering
    • /
    • v.7 no.1
    • /
    • pp.35-40
    • /
    • 2009
  • Many researchers strive to research on ways on how to improve the performance of RFID system and many papers were written to solve one of the major drawbacks of potent technology related with data management. As RFID system captures billions of data, problems arising from dirty data and large volume of data causes uproar in the RFID community those researchers are finding ways on how to address this issue. Especially, effective data management is important to manage large volume of data. Data reduction techniques in attempts to address the issues on data are also presented in this paper. This paper introduces readers to a new data reduction algorithm that might be an alternative to reduce data in RFID Systems. A process on how to extract data from the reduced database is also presented. Performance study is conducted to analyze the new data reduction algorithm. Our performance analysis shows the utility and feasibility of our categorization reduction algorithms.