• Title/Summary/Keyword: Data-driven Research

Search Result 731, Processing Time 0.025 seconds

Mutational Data Loading Routines for Human Genome Databases: the BRCA1 Case

  • Van Der Kroon, Matthijs;Ramirez, Ignacio Lereu;Levin, Ana M.;Pastor, Oscar;Brinkkemper, Sjaak
    • Journal of Computing Science and Engineering
    • /
    • v.4 no.4
    • /
    • pp.291-312
    • /
    • 2010
  • The last decades a large amount of research has been done in the genomics domain which has and is generating terabytes, if not exabytes, of information stored globally in a very fragmented way. Different databases use different ways of storing the same data, resulting in undesired redundancy and restrained information transfer. Adding to this, keeping the existing databases consistent and data integrity maintained is mainly left to human intervention which in turn is very costly, both in time and money as well as error prone. Identifying a fixed conceptual dictionary in the form of a conceptual model thus seems crucial. This paper presents an effort to integrate the mutational data from the established genomic data source HGMD into a conceptual model driven database HGDB, thereby providing useful lessons to improve the already existing conceptual model of the human genome.

Predicting Arab Consumers' Preferences on the Korean Contents Distribution

  • Park, Young-Eun;Chaffar, Soumaya;Kim, Myoung-Sook;Ko, Hye-Young
    • Journal of Distribution Science
    • /
    • v.15 no.4
    • /
    • pp.33-40
    • /
    • 2017
  • Purpose - This study aims to examine the analysis of pattern on Arab countries consumers' preferences of the Korean Contents using social media, Facebook since Korean entertainment contents have been distributed in the global marketplace. Then we focus on developing Predictive model using a Data Mining Technique. Research design, data and methodology - In order to understand preference growth of Korean contents in Arabic countries, we- collected data from two popular Facebook pages: 'Korean movies and drama' and 'K-pop'. Then, we adopted a data-driven approach based on Data Mining techniques. Results - It is obvious that the number of likes for K-pop will increase for all North African and Middle Eastern countries, however concerning Korean Movies and Drama except Tunisia it is decreasing for Algeria, Egypt and Morocco. Also, concerning Saudi Arabia and United Arab Emirates, the number of likes will decrease for Korean Movies and Drama which is not the case for Iraq. Conclusions - It is noted in this study that K-contents such as drama, movie and music are sometimes a gateway to a wider interest in Korean culture, food and brands. Moreover, this study gives significant implications for developing predictive model to forecast Korean contents' consumption and preferences.

Automated Derivation of Cross-sectional Numerical Information of Retaining Walls Using Point Cloud Data (점군 데이터를 활용한 옹벽의 단면 수치 정보 자동화 도출)

  • Han, Jehee;Jang, Minseo;Han, Hyungseo;Jo, Hyoungjun;Shin, Do Hyoung
    • Journal of KIBIM
    • /
    • v.14 no.2
    • /
    • pp.1-12
    • /
    • 2024
  • The paper proposes a methodology that combines the Random Sample Consensus (RANSAC) algorithm and the Point Cloud Encoder-Decoder Network (PCEDNet) algorithm to automatically extract the length of infrastructure elements from point cloud data acquired through 3D LiDAR scans of retaining walls. This methodology is expected to significantly improve time and cost efficiency compared to traditional manual measurement techniques, which are crucial for the data-driven analysis required in the precision-demanding construction sector. Additionally, the extracted positional and dimensional data can contribute to enhanced accuracy and reliability in Scan-to-BIM processes. The results of this study are anticipated to provide important insights that could accelerate the digital transformation of the construction industry. This paper provides empirical data on how the integration of digital technologies can enhance efficiency and accuracy in the construction industry, and offers directions for future research and application.

Re-approach to the Concept of Data Literacy and Its Application to Library Information Services (데이터 리터러시 개념에 대한 재접근 및 도서관 정보서비스에의 적용)

  • Lee, Jeong-Mee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.1
    • /
    • pp.159-179
    • /
    • 2019
  • The purpose of this study is to re-approach the concept of data literacy, to describe the differences with other literacies along with the redefined concept of data literacy. Also, it is tried to find out why and how to use data literacy for library and information services. Research has shown that data literacy plays a central role in interacting with other literacy concepts, and should be understood as a data-driven problem-solving ability that is essential for the future human society. Based on these concept definitions, we propose the application of data literacy to library information service in terms of education service and research support service. In this study, data literacy is defined as the ability to utilize data needed by users in a data - based society, is to explain why data literacy is the ability to utilize data for users in modern society by distinguishing differences from other literacy. This concludes with a discussion and proposal on what library information services can be implemented.

A Research on the PIV Algorithm Using Image Coding (영상코드화 기법을 이용한 PIV 알고리듬에 대한 연구)

  • Kim, Sung-Kyun
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.24 no.2
    • /
    • pp.153-160
    • /
    • 2000
  • A Particle Image Velocimetry(PIV) algorithm is developed to analyze whole flow field both qualitatively and quantitatively. The practical use of PIV requires the use of fast, reliable, computer-based methods for tracking numerous particles suspended in a flow field. The TSS, NTSS, FFT-Hybrid, which are developed in the area of image compression and coding, are introduced to develop fast vector search algorithm. The numerical solution of the lid-driven cavity flow by the ADI algorithm with the Wachspress Formula is introduced to produce synthetic data for the validation of the tracking algorithms. The algorithms are applied to image data of real flow experiments. The comparisons in CPU time and mean error show, with a small loss of accuracy, CPU time for tracking is reduced considerably.

A Study on the Relationship between Organizational and Innovational Driven Characteristics and the Diffusion of Electronic Data Interchange (조직적 특성과 혁신유도 특성이 EDI의 확산에 미치는 영향)

  • Chung, Yoon;Noh, Young;Kang, Jae-Jung
    • Asia pacific journal of information systems
    • /
    • v.7 no.3
    • /
    • pp.89-108
    • /
    • 1997
  • This study, drawing upon research in innovation theory and Information systems, investigates the relationship among the organizational and innovation characteristics and the extent of internal and external diffusion of EDI in Korean firms. The data for this study were collected from 131 firms that have implemented EDI. The results of the correlation and the multiple regression analysis show that elapsed time and organizational compatibility are the major predictors of EDI diffusion. Specifically, the extent of communication, elapsed time and organizational compatibility are the major predictors of internal diffusion, while centralization, organizational compatibility and elapsed time are closely related to external diffusion of EDI. The results of this study imply that to facilitate the use of EDI widely within and beyond organizations, EDI system should be compatibile with the organizational tasks, values systems and existing information systems.

  • PDF

Fast Motion Synthesis of Quadrupedal Animals Using a Minimum Amount of Motion Capture Data

  • Sung, Mankyu
    • ETRI Journal
    • /
    • v.35 no.6
    • /
    • pp.1029-1037
    • /
    • 2013
  • This paper introduces a novel and fast synthesizing method for 3D motions of quadrupedal animals that uses only a small set of motion capture data. Unlike human motions, animal motions are relatively difficult to capture. Also, it is a challenge to synthesize continuously changing animal motions in real time because animals have various gait types according to their speed. The algorithm proposed herein, however, is able to synthesize continuously varying motions with proper limb configuration by using only one single cyclic animal motion per gait type based on the biologically driven Froude number. During the synthesis process, each gait type is automatically determined by its speed parameter, and the transition motions, which have not been entered as input, are synthesized accordingly by the optimized asynchronous motion blending technique. At the start time, given the user's control input, the motion path and spinal joints for turning are adjusted first and then the motion is stitched at any speed with proper transition motions to synthesize a long stream of motions.

Optimal Datum Unit Definition for Diagnostics of Journal Bearing System (저널베어링 상태 진단을 위한 최적의 데이터 분석 기준 설정)

  • Youn, Byeng D.;Jung, Joonha;Jeon, Byungchul;Kim, Yeon-Whan;Bae, Yong-Chae
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2014.10a
    • /
    • pp.84-89
    • /
    • 2014
  • Data-driven method for fault diagnostics system often use machine learning technique. To use such technique proper signal processing should be implemented such as time synchronous averaging (TSA) for ball bearing systems. However, for journal bearing diagnostics systems not much has been researched, and yet a proper signal processing method has not been studied. Therefore, in this research an optimal datum unit for a reliable journal bearing diagnostics system along with angular resampling process is being suggested. Before extracting time and frequency domain features, angular resampling is applied to each cycle of vibration data. As to preserve the characteristics of vibration signal, averaging method is replaced by finding the optimal datum unit which strengthens statistical characteristics of vibration signal. Then 20 features were extracted for various cases, and those features are being evaluated by two criteria, separability and classification accuracy.

  • PDF

Sustainability Considerations and Satisfaction with Online Food-Delivery Services During Covid-19 Pandemic

  • CHAE, Myoung-Jin
    • Asian Journal of Business Environment
    • /
    • v.12 no.4
    • /
    • pp.13-24
    • /
    • 2022
  • Purpose: Motivated by an expedited growth and distribution of Online Food-Delivery (OFD) services, especially during the recent Covid-19 pandemic, this research aims to explore 1) how consumers' sustainability considerations are associated with satisfaction with the services via opt-out cutlery options and 2) the role of the pandemic in the relationships between sustainability considerations, attitudes toward opt-out cutlery options, and satisfaction with the OFD services. Data and Methodology: An analysis of survey data using 434 consumers in the United States recruited from Amazon M-Turk was conducted using structural equation modeling. Results: Findings suggest that consumers' environmental, health, and ethical considerations are positively related to their attitudes toward opt-out cutlery options. Furthermore, attitudes toward opt-out cutlery options are positively related to satisfaction with the OFD services only when they feel connected with the environment, driven by perceived threats of an infectious disease (i.e. Covid-19). Conclusion: The study findings provide new insights to managers in the OFD service industry on how to promote sustainable consumption during the pandemic.

An Filtering Automatic Technique of LiDAR Data by Multiple Linear Regression Analysis (다중선형 회귀분석에 의한 LiDAR 자료의 필터링 자동화 기법)

  • Choi, Seung-Pil;Cho, Ji-Hyun;Kim, Jun-Seong
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.19 no.4
    • /
    • pp.109-118
    • /
    • 2011
  • In this research estimated accuracies that were results in all the area of filtering of the plane equation that was used by whole data set, and regional of filtering that was driven by the plane equation for each vertual Grid. All of this estimates were based by all the area of filtering that deduced the plane equation by multiple linear regression analysis that was used by ground data set. Therefore, accuracy of all the area of filtering that used whole data set has been dropped about 2~3% when average of accuracy of all the area of filtering was based on ground data set while accuracy of Regional of filtering dropped 2~4% when based on virtual Grid. Moreover, as virtual Grid which was set 3~4 cm was difference about 2% of accuracy from standard data. Thus, it leads conclusion of set 3~4 times bigger size in virtual Grid filtering over LiDAR scan gap will be more appropriated. Hence, the result of this research allow us to conclude that there was difference in average accuracy has been noticed when we applied each different approaches, I strongly suggest that it need to research more about real topography for further filtering accuracy.