• 제목/요약/키워드: Datasets Description

검색결과 34건 처리시간 0.024초

Pestalotiopsis kaki sp. nov., a Novel Species Isolated from Persimmon Tree (Diospyros kaki) Bark in Korea

  • Das, Kallol;Lee, Seung-Yeol;Jung, Hee-Young
    • Mycobiology
    • /
    • 제49권1호
    • /
    • pp.54-60
    • /
    • 2021
  • During the screening of Korean microflora, a fungal strain (KNU-PT-1804) belonging to the genus Pestalotiopsis was isolated from persimmon tree (Diospyros kaki) bark collected from North Gyeongsang Province, Korea. The strain, KNU-PT-1804, produced smaller conidia compared with related species P. kenyana, P. neglecta, and P. telopeae. The novelty of the strain was confirmed based on phylogenetic analysis using molecular datasets of internal transcribed spacer (ITS) regions, β-tubulin (TUB2), and translation elongation factor 1-alpha (TEF1α) genes. Molecular phylogeny strongly supports that the strain is distinct from previously known Pestalotiopsis species, and we proposed the novel species, Pestalotiopsis kaki sp. nov., and provide a detailed description and illustration.

Digital Epidemiology: Use of Digital Data Collected for Non-epidemiological Purposes in Epidemiological Studies

  • Park, Hyeoun-Ae;Jung, Hyesil;On, Jeongah;Park, Seul Ki;Kang, Hannah
    • Healthcare Informatics Research
    • /
    • 제24권4호
    • /
    • pp.253-262
    • /
    • 2018
  • Objectives: We reviewed digital epidemiological studies to characterize how researchers are using digital data by topic domain, study purpose, data source, and analytic method. Methods: We reviewed research articles published within the last decade that used digital data to answer epidemiological research questions. Data were abstracted from these articles using a data collection tool that we developed. Finally, we summarized the characteristics of the digital epidemiological studies. Results: We identified six main topic domains: infectious diseases (58.7%), non-communicable diseases (29.4%), mental health and substance use (8.3%), general population behavior (4.6%), environmental, dietary, and lifestyle (4.6%), and vital status (0.9%). We identified four categories for the study purpose: description (22.9%), exploration (34.9%), explanation (27.5%), and prediction and control (14.7%). We identified eight categories for the data sources: web search query (52.3%), social media posts (31.2%), web portal posts (11.9%), webpage access logs (7.3%), images (7.3%), mobile phone network data (1.8%), global positioning system data (1.8%), and others (2.8%). Of these, 50.5% used correlation analyses, 41.3% regression analyses, 25.6% machine learning, and 19.3% descriptive analyses. Conclusions: Digital data collected for non-epidemiological purposes are being used to study health phenomena in a variety of topic domains. Digital epidemiology requires access to large datasets and advanced analytics. Ensuring open access is clearly at odds with the desire to have as little personal data as possible in these large datasets to protect privacy. Establishment of data cooperatives with restricted access may be a solution to this dilemma.

Use of the Quantitatively Transformed Field Soil Structure Description of the US National Pedon Characterization Database to Improve Soil Pedotransfer Function

  • Yoon, Sung-Won;Gimenez, Daniel;Nemes, Attila;Chun, Hyen-Chung;Zhang, Yong-Seon;Sonn, Yeon-Kyu;Kang, Seong-Soo;Kim, Myung-Sook;Kim, Yoo-Hak;Ha, Sang-Keun
    • 한국토양비료학회지
    • /
    • 제44권5호
    • /
    • pp.944-958
    • /
    • 2011
  • Soil hydraulic properties such as hydraulic conductivity or water retention which are costly to measure can be indirectly generated by soil pedotransfer function (PTF) using easily obtainable soil data. The field soil structure description which is routinely recorded could also be used in PTF as an input to reduce the uncertainty. The purposes of this study were to use qualitative morphological soil structure descriptions and soil structural index into PTF and to evaluate their contribution in the prediction of soil hydraulic properties. We transformed categorical morphological descriptions of soil structure into quantitative values using categorical principal component analysis (CATPCA). This approach was tested with a large data set from the US National Pedon Characterization database with the aid of a categorical regression tree analysis. Six different PTFs were used to predict the saturated hydraulic conductivity and those results were averaged to quantify the uncertainty. Quantified morphological description was successively used in multiple linear regression approach to predict the averaged ensemble saturated conductivity. The selected stepwise regression model with only the transformed morphological variables and structural index as predictors predicted the $K_{sat}$ with $r^2$ = 0.48 (p = 0.018), indicating the feasibility of CATPCA approach. In a regression tree analysis, soil structure index and soil texture turned out to be important factors in the prediction of the hydraulic properties. Among structural descriptions size class turned out to be an important grouping parameter in the regression tree. Bulk density, clay content, W33 and structural index explained clusters selected by a two step clustering technique, implying the morphologically described soil structural features are closely related to soil physical as well as hydraulic properties. Although this study provided relatively new method which related soil structure description to soil structure index, the same approach should be tested using a datasets containing the actual measurement of hydraulic properties. More insight on the predictive power of soil structure index to estimate hydraulic properties would be achieved by considering measured the saturated hydraulic conductivity and the soil water retention.

맵리듀스 기반 대량 RDF 데이터셋 압축 변환 및 저장 방법 (Compression Conversion and Storing of Large RDF datasets based on MapReduce)

  • 김인아;이경하;이규철
    • 한국정보통신학회논문지
    • /
    • 제26권4호
    • /
    • pp.487-494
    • /
    • 2022
  • 최근 데이터를 활용한 분석에 대한 수요와 함께 분석 데이터인 지식 그래프의 크기는 점차 증가하여, 웹에서 수집한 데이터를 지식 그래프로 추출하였을 때 약 820억개의 엣지(Edge)를 가지는 수준까지 도달하였다. 많은 지식 그래프들은 웹 자원에 대한 메타데이터를 표현하기 위한 W3C 표준인 RDF(Resource Description Framework) 형식으로 표현되며, RDF 특성으로 인해 기존의 RDF 저장소들은 대량 RDF 데이터를 압축하고 저장할 때 처리 시간의 오버헤드가 발생하는 문제점을 가진다. 본 논문은 이러한 문제점을 개선하기 위해, 맵리듀스를 사용하여 대량 RDF 데이터를 정수 ID로 압축 변환하고, 수직 분할하여 저장하는 방법을 제안한다. 본 논문에서 제안한 방법은 RDF-3X와 비교하였을 때 최대 25.2배, H2RDF+와 비교하였을 때 최대 3.7배까지의 높은 성능 향상을 보였다.

Image classification and captioning model considering a CAM-based disagreement loss

  • Yoon, Yeo Chan;Park, So Young;Park, Soo Myoung;Lim, Heuiseok
    • ETRI Journal
    • /
    • 제42권1호
    • /
    • pp.67-77
    • /
    • 2020
  • Image captioning has received significant interest in recent years, and notable results have been achieved. Most previous approaches have focused on generating visual descriptions from images, whereas a few approaches have exploited visual descriptions for image classification. This study demonstrates that a good performance can be achieved for both description generation and image classification through an end-to-end joint learning approach with a loss function, which encourages each task to reach a consensus. When given images and visual descriptions, the proposed model learns a multimodal intermediate embedding, which can represent both the textual and visual characteristics of an object. The performance can be improved for both tasks by sharing the multimodal embedding. Through a novel loss function based on class activation mapping, which localizes the discriminative image region of a model, we achieve a higher score when the captioning and classification model reaches a consensus on the key parts of the object. Using the proposed model, we established a substantially improved performance for each task on the UCSD Birds and Oxford Flowers datasets.

Alternaria brassicifolii sp. nov. Isolated from Brassica rapa subsp. pekinensis in Korea

  • Deng, Jian Xin;Li, Mei Jia;Paul, Narayan Chandra;Oo, May Moe;Lee, Hyang Burm;Oh, Sang-Keun;Yu, Seung Hun
    • Mycobiology
    • /
    • 제46권2호
    • /
    • pp.172-176
    • /
    • 2018
  • A new species belonging to the genus Alternaria was isolated from the necrotic leaf spots of Brassica rapa subsp. pekinensis in Yuseong district, Daejeon, Korea. It is an occasional isolate, not an etiological agent, which is morphologically similar to A. broccoli-italicae, but differs in conidial size and conidiophore shape. Phylogenetic analysis using the sequence datasets of the internal transcribed spacer (ITS) region of the rDNA, glyceraldehyde-3-phosphate dehydrogenase (gpd), and plasma membrane ATPase genes showed that it is distantly related to A. broccoli-italicae and closely related to Alternaria species in the section Pseudoalternaria, which belonged to a clade basal to the section Infectoriae. Morphologically, the species is unique because it produces solitary conidia or conidial chains (two units), unlike the four members in the section Pseudoalternaria that produce conidia as short branched chains. It exhibits weak pathogenicity in the host plant. This report includes the description and illustration of A. brassicifolii as a new species.

Salient Object Detection Based on Regional Contrast and Relative Spatial Compactness

  • Xu, Dan;Tang, Zhenmin;Xu, Wei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제7권11호
    • /
    • pp.2737-2753
    • /
    • 2013
  • In this study, we propose a novel salient object detection strategy based on regional contrast and relative spatial compactness. Our algorithm consists of four basic steps. First, we learn color names offline using the probabilistic latent semantic analysis (PLSA) model to find the mapping between basic color names and pixel values. The color names can be used for image segmentation and region description. Second, image pixels are assigned to special color names according to their values, forming different color clusters. The saliency measure for every cluster is evaluated by its spatial compactness relative to other clusters rather than by the intra variance of the cluster alone. Third, every cluster is divided into local regions that are described with color name descriptors. The regional contrast is evaluated by computing the color distance between different regions in the entire image. Last, the final saliency map is constructed by incorporating the color cluster's spatial compactness measure and the corresponding regional contrast. Experiments show that our algorithm outperforms several existing salient object detection methods with higher precision and better recall rates when evaluated using public datasets.

후방향 전진 추론을 이용한 RDF 모델의 효율적인 변경 탐지 (Efficient Change Detection between RDF Models Using Backward Chaining Strategy)

  • 임동혁;김형주
    • 한국정보과학회논문지:컴퓨팅의 실제 및 레터
    • /
    • 제15권2호
    • /
    • pp.125-133
    • /
    • 2009
  • RDF(Resource Description Framework)는 시맨틱 웹에서 메타 정보를 기술하는 온톨로지 언어로 많이 사용되고 있다. 온톨로지는 실세계에 대한 모델링을 기반으로 하기 때문에 끊임없이 갱신이 발생한다. 이런 갱신을 찾고 분석하는 일은 지식 관리 시스템에서 핵심이 된다. 기존의 RDF 모델에 대한 변경 탐지 기법들은 구조적 변경에 초점을 두었으나 RDFS 함의 규칙을 적용하여 좀 더 작은 크기의 변경 부분을 찾는 연구들이 소개되고 있다. 하지만 RDF 모델의 추론은 데이타 크기와 시간의 증가에 영향을 미친다. 본 논문에서는 RDFS 함의 규칙을 효율적으로 사용하는 변경 탐지 기법을 제안한다. 제안된 기법은 후방향 전진 추론 기반으로 모델 일부분에만 추론을 적용하여 변경 내용을 계산한다. 실제 사용하는 RDF 데이타들을 사용하여 기존의 변경 탐지 기법과의 비교 실험을 통해 성능을 향상시킬 수 있음을 보인다.

Algorithms for Multi-sensor and Multi-primitive Photogrammetric Triangulation

  • Shin, Sung-Woong;Habib, Ayman F.;Ghanma, Mwafag;Kim, Chang-Jae;Kim, Eui-Myoung
    • ETRI Journal
    • /
    • 제29권4호
    • /
    • pp.411-420
    • /
    • 2007
  • The steady evolution of mapping technology is leading to an increasing availability of multi-sensory geo-spatial datasets, such as data acquired by single-head frame cameras, multi-head frame cameras, line cameras, and light detection and ranging systems, at a reasonable cost. The complementary nature of the data collected by these systems makes their integration to obtain a complete description of the object space. However, such integration is only possible after accurate co-registration of the collected data to a common reference frame. The registration can be carried out reliably through a triangulation procedure which considers the characteristics of the involved data. This paper introduces algorithms for a multi-primitive and multi-sensory triangulation environment, which is geared towards taking advantage of the complementary characteristics of spatial data available from the above mentioned sensors. The triangulation procedure ensures the alignment of involved data to a common reference frame. The devised methodologies are tested and proven efficient through experiments using real multi-sensory data.

  • PDF

Robust Features and Accurate Inliers Detection Framework: Application to Stereo Ego-motion Estimation

  • MIN, Haigen;ZHAO, Xiangmo;XU, Zhigang;ZHANG, Licheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권1호
    • /
    • pp.302-320
    • /
    • 2017
  • In this paper, an innovative robust feature detection and matching strategy for visual odometry based on stereo image sequence is proposed. First, a sparse multiscale 2D local invariant feature detection and description algorithm AKAZE is adopted to extract the interest points. A robust feature matching strategy is introduced to match AKAZE descriptors. In order to remove the outliers which are mismatched features or on dynamic objects, an improved random sample consensus outlier rejection scheme is presented. Thus the proposed method can be applied to dynamic environment. Then, geometric constraints are incorporated into the motion estimation without time-consuming 3-dimensional scene reconstruction. Last, an iterated sigma point Kalman Filter is adopted to refine the motion results. The presented ego-motion scheme is applied to benchmark datasets and compared with state-of-the-art approaches with data captured on campus in a considerably cluttered environment, where the superiorities are proved.