• Title/Summary/Keyword: Data Heterogeneity

Search Result 613, Processing Time 0.026 seconds

Capacity aware Scalable Video Coding in P2P on Demand Streaming Systems

  • Xing, Changyou;Chen, Ming;Hu, Chao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.9
    • /
    • pp.2268-2283
    • /
    • 2013
  • Scalable video coding can handle peer heterogeneity of P2P streaming applications, but there is still a lack of comprehensive studies on how to use it to improve video playback quality. In this paper we propose a capacity aware scalable video coding mechanism for P2P on demand streaming system. The proposed mechanism includes capacity based neighbor selection, adaptive data scheduling and streaming layer adjustment, and can enable each peer to select appropriate streaming layers and acquire streaming chunks with proper sequence, along with choosing specific peers to provide them. Simulation results show that the presented mechanism can decrease the system's startup and playback delay, and increase the video playback quality as well as playback continuity, and thus it provides a better quality of experience for users.

Bayesian Modeling of Random Effects Covariance Matrix for Generalized Linear Mixed Models

  • Lee, Keunbaik
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.3
    • /
    • pp.235-240
    • /
    • 2013
  • Generalized linear mixed models(GLMMs) are frequently used for the analysis of longitudinal categorical data when the subject-specific effects is of interest. In GLMMs, the structure of the random effects covariance matrix is important for the estimation of fixed effects and to explain subject and time variations. The estimation of the matrix is not simple because of the high dimension and the positive definiteness; subsequently, we practically use the simple structure of the covariance matrix such as AR(1). However, this strong assumption can result in biased estimates of the fixed effects. In this paper, we introduce Bayesian modeling approaches for the random effects covariance matrix using a modified Cholesky decomposition. The modified Cholesky decomposition approach has been used to explain a heterogenous random effects covariance matrix and the subsequent estimated covariance matrix will be positive definite. We analyze metabolic syndrome data from a Korean Genomic Epidemiology Study using these methods.

An Empirical Research on Relation between FDI and Technology Diffusion: Using Nonstationary Panel Data (외국인 직접투자의 기술확산 효과에 대한 실증분석 : 비안정적 패널자료를 이용하여)

  • Kim Hong-Kee;Kim Jong-Woon
    • Journal of Korea Technology Innovation Society
    • /
    • v.8 no.3
    • /
    • pp.1225-1249
    • /
    • 2005
  • This study aims at investigating whether foreign direct investment plays a role as a channel of international technology diffusion. We used the annual panel data from 1980 to 2002. The nonstationary panel techniques, in particular group mean panel FMOLS(fully modified OLS) was exploited as an empirical methodology in order to tackle the heterogeneity between members and low frequency. The empirical results show that inflow direct investments lead to an increase in total factor productivity and economic growth. Also outflow direct investments contribute to an higher total factor productivity and economic growth. These results confirms that both inflow and outflow direct investments are important channels for international technology diffusion or spillover.

  • PDF

Nonparametric analysis of income distributions among different regions based on energy distance with applications to China Health and Nutrition Survey data

  • Ma, Zhihua;Xue, Yishu;Hu, Guanyu
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.1
    • /
    • pp.57-67
    • /
    • 2019
  • Income distribution is a major concern in economic theory. In regional economics, it is often of interest to compare income distributions in different regions. Traditional methods often compare the income inequality of different regions by assuming parametric forms of the income distributions, or using summary statistics like the Gini coefficient. In this paper, we propose a nonparametric procedure to test for heterogeneity in income distributions among different regions, and a K-means clustering procedure for clustering income distributions based on energy distance. In simulation studies, it is shown that the energy distance based method has competitive results with other common methods in hypothesis testing, and the energy distance based clustering method performs well in the clustering problem. The proposed approaches are applied in analyzing data from China Health and Nutrition Survey 2011. The results indicate that there are significant differences among income distributions of the 12 provinces in the dataset. After applying a 4-means clustering algorithm, we obtained the clustering results of the income distributions in the 12 provinces.

Significance and Research Challenges of Defensive and Offensive Cybersecurity in Smart Grid

  • Hana, Mujlid
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.12
    • /
    • pp.29-36
    • /
    • 2022
  • Smart grid (SG) software platforms and communication networks that run and manage the entire grid are increasingly concerned about cyber security. Characteristics of the smart grid networks, including heterogeneity, time restrictions, bandwidth, scalability, and other factors make it difficult to secure. The age-old strategy of "building bigger walls" is no longer sufficient given the rise in the quantity and size of cyberattacks as well as the sophisticated methods threat actor uses to hide their actions. Cyber security experts utilize technologies and procedures to defend IT systems and data from intruders. The primary objective of every organization's cybersecurity team is to safeguard data and information technology (IT) infrastructure. Consequently, further research is required to create guidelines and methods that are compatible with smart grid security. In this study, we have discussed objectives of of smart grid security, challenges of smart grid security, defensive cybersecurity techniques, offensive cybersecurity techniques and open research challenges of cybersecurity.

Comparison of performance of automatic detection model of GPR signal considering the heterogeneous ground (지반의 불균질성을 고려한 GPR 신호의 자동탐지모델 성능 비교)

  • Lee, Sang Yun;Song, Ki-Il;Kang, Kyung Nam;Ryu, Hee Hwan
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.24 no.4
    • /
    • pp.341-353
    • /
    • 2022
  • Pipelines are buried in urban area, and the position (depth and orientation) of buried pipeline should be clearly identified before ground excavation. Although various geophysical methods can be used to detect the buried pipeline, it is not easy to identify the exact information of pipeline due to heterogeneous ground condition. Among various non-destructive geo-exploration methods, ground penetration radar (GPR) can explore the ground subsurface rapidly with relatively low cost compared to other exploration methods. However, the exploration data obtained from GPR requires considerable experiences because interpretation is not intuitive. Recently, researches on automated detection technology for GPR data using deep learning have been conducted. However, the lack of GPR data which is essential for training makes it difficult to build up the reliable detection model. To overcome this problem, we conducted a preliminary study to improve the performance of the detection model using finite difference time domain (FDTD)-based numerical analysis. Firstly, numerical analysis was performed with homogeneous soil media having single permittivity. In case of heterogeneous ground, numerical analysis was performed considering the ground heterogeneity using fractal technique. Secondly, deep learning was carried out using convolutional neural network. Detection Model-A is trained with data set obtained from homogeneous ground. And, detection Model-B is trained with data set obtained from homogeneous ground and heterogeneous ground. As a result, it is found that the detection Model-B which is trained including heterogeneous ground shows better performance than detection Model-A. It indicates the ground heterogeneity should be considered to increase the performance of automated detection model for GPR exploration.

Classification and Prediction of Highway Accident Characteristics Using Vehicle Black Box Data (블랙박스 영상 기반 고속도로 사고유형 분류 및 사고 심각도 예측 평가)

  • Junhan Cho;Sungjun Lee;Seongmin Park;Juneyoung Park
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.21 no.6
    • /
    • pp.132-145
    • /
    • 2022
  • This study was based on the black box images of traffic accidents on highways, cluster analysis and prediction model comparisons were carried out. As analysis data, vehicle driving behavior and road surface conditions that can grasp road and traffic conditions just before the accident were used as explanatory variables. Considering that traffic accident data is affected by many factors, cluster analysis reflecting data heterogeneity is used. Each cluster classified by cluster analysis was divided based on the ratio of the severity level of the accident, and then an accident prediction evaluation was performed. As a result of applying the Logit model, the accident prediction model showed excellent predictive ability when classifying groups by cluster analysis and predicting them rather than analyzing the entire data. It is judged that it is more effective to predict accidents by reflecting the characteristics of accidents by group and the severity of accidents. In addition, it was found that a collision accident during stopping such as a secondary accident and a side collision accident during lane change act as important driving behavior variables.

Design of Falling Recognition Application System using Deep Learning

  • Kwon, TaeWoo;Lee, Jong-Yong;Jung, Kye-Dong
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.12 no.2
    • /
    • pp.120-126
    • /
    • 2020
  • Studies are being conducted regarding falling recognition using sensors on smartphonesto recognize falling in human daily life. These studies use a number of sensors, mostly acceleration sensors, gyro sensors, motion sensors, etc. Falling recognition system processes the values of sensor data by using a falling recognition algorithm and classifies behavior based on thresholds. If the threshold is ambiguous, the accuracy will be reduced. To solve this problem, Deep learning was introduced in the behavioral recognition system. Deep learning is a kind of machine learning technique that computers process and categorize input data rather than processing it by man-made algorithms. Thus, in this paper, we propose a falling recognition application system using deep learning based on smartphones. The proposed system is powered by apps on smartphones. It also consists of three layers and uses DataBase as a Service (DBaaS) to handle big data and address data heterogeneity. The proposed system uses deep learning to recognize the user's behavior, it can expect higher accuracy compared to the system in the general rule base.

A Study on the Application of Natural Language Processing in Health Care Big Data: Focusing on Word Embedding Methods (보건의료 빅데이터에서의 자연어처리기법 적용방안 연구: 단어임베딩 방법을 중심으로)

  • Kim, Hansang;Chung, Yeojin
    • Health Policy and Management
    • /
    • v.30 no.1
    • /
    • pp.15-25
    • /
    • 2020
  • While healthcare data sets include extensive information about patients, many researchers have limitations in analyzing them due to their intrinsic characteristics such as heterogeneity, longitudinal irregularity, and noise. In particular, since the majority of medical history information is recorded in text codes, the use of such information has been limited due to the high dimensionality of explanatory variables. To address this problem, recent studies applied word embedding techniques, originally developed for natural language processing, and derived positive results in terms of dimensional reduction and accuracy of the prediction model. This paper reviews the deep learning-based natural language processing techniques (word embedding) and summarizes research cases that have used those techniques in the health care field. Then we finally propose a research framework for applying deep learning-based natural language process in the analysis of domestic health insurance data.

Geomorphologic Nash Model with Variable Width Function

  • Thuy, Nguyen Thi Phuong;Kim, Joo-Cheol;Jung, Kwansue
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2015.05a
    • /
    • pp.212-212
    • /
    • 2015
  • So far, geomorphologic dispersion due to the heterogeneity characteristics of flow paths in a basin has been demonstrated as a major factor affecting to the hydrologic response function of a catchment. This effect has considered by many previous studies taking into account flow path length factors, especially in the application of width function. Based upon the analysis of topographic index, another important geomorphologic factor extracted from DEM data, this work presents a new factor named saturation to evaluate its effects to the formation of the well-known instantaneous unit hydrograph (IUH) in Nash model and drainage structure in a river basin. First, the geomorphologic parameters corresponding to different saturation conditions are computed from DEM data with the support of GIS software. Then, in the combination of hydrologic and geomorphologic data, effective rainfall in each saturation degree and the Nash parameters are calculated using excel. Finally, the verification process with direct runoff data is conducted using Fortran programming. This process is applied to five sub-watersheds in Bocheong catchment ($485.21km^2$) in Korea where the necessary data are available and believable. The results from this approach will improve researchers and students'understandings about the relationship between rainfall and runoff and its relation with drainage structure within a catchment.

  • PDF