• Title/Summary/Keyword: Data inconsistency

Search Result 228, Processing Time 0.024 seconds

Incremental Batch Update of Spatial Data Cube with Multi-dimensional Concept Hierarchies (다차원 개념 계층을 지원하는 공간 데이터 큐브의 점진적 일괄 갱신 기법)

  • Ok, Geun-Hyoung;Lee, Dong-Wook;You, Byeong-Seob;Lee, Jae-Dong;Bae, Hae-Young
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.11
    • /
    • pp.1395-1409
    • /
    • 2006
  • A spatial data warehouse has spatial data cube composed of multi-dimensional data for efficient OLAP(On-Line Analytical Processing) operations. A spatial data cube supporting concept hierarchies holds huge amount of data so that many researches have studied a incremental update method for minimum modification of a spatial data cube. The Cube, however, compressed by eliminating prefix and suffix redundancy has coalescing paths that cause update inconsistencies for some updates can affect the aggregate value of coalesced cell that has no relationship with the update. In this paper, we propose incremental batch update method of a spatial data cube. The proposed method uses duplicated nodes and extended node structure to avoid update inconsistencies. If any collision is detected during update procedure, the shared node is duplicated and the duplicate is updated. As a result, compressed spatial data cube that includes concept hierarchies can be updated incrementally with no inconsistency. In performance evaluation, we show the proposed method is more efficient than other naive update methods.

  • PDF

F_MixBERT: Sentiment Analysis Model using Focal Loss for Imbalanced E-commerce Reviews

  • Fengqian Pang;Xi Chen;Letong Li;Xin Xu;Zhiqiang Xing
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.2
    • /
    • pp.263-283
    • /
    • 2024
  • Users' comments after online shopping are critical to product reputation and business improvement. These comments, sometimes known as e-commerce reviews, influence other customers' purchasing decisions. To confront large amounts of e-commerce reviews, automatic analysis based on machine learning and deep learning draws more and more attention. A core task therein is sentiment analysis. However, the e-commerce reviews exhibit the following characteristics: (1) inconsistency between comment content and the star rating; (2) a large number of unlabeled data, i.e., comments without a star rating, and (3) the data imbalance caused by the sparse negative comments. This paper employs Bidirectional Encoder Representation from Transformers (BERT), one of the best natural language processing models, as the base model. According to the above data characteristics, we propose the F_MixBERT framework, to more effectively use inconsistently low-quality and unlabeled data and resolve the problem of data imbalance. In the framework, the proposed MixBERT incorporates the MixMatch approach into BERT's high-dimensional vectors to train the unlabeled and low-quality data with generated pseudo labels. Meanwhile, data imbalance is resolved by Focal loss, which penalizes the contribution of large-scale data and easily-identifiable data to total loss. Comparative experiments demonstrate that the proposed framework outperforms BERT and MixBERT for sentiment analysis of e-commerce comments.

A State-of-the-Art Review on Debonding Failures of FRP Laminates Externally Adhered to Concrete

  • Kang, Thomas H.K.;Howell, Joe;Kim, Sang-Hee;Lee, Dong-Joo
    • International Journal of Concrete Structures and Materials
    • /
    • v.6 no.2
    • /
    • pp.123-134
    • /
    • 2012
  • There is significant concern in the engineering community regarding the safety and effectiveness of fiber-reinforced polymer (FRP) strengthening of RC structures because of the potential for brittle debonding failures. In this paper, previous research programs conducted by other researchers were reviewed in terms of the debonding failure of FRP laminates externally attached to concrete. This review article also discusses the influences on bond strength and failure modes as well as the existing experimental research and developed equations. Based on the review, several important conclusions were re-emphasized, including the finding that the bond transfer strength is proportional to the concrete compressive strength; that there is a certain bond development length that has to be exceeded; and that thinner adhesive layers in fact lower the chances of a concrete-adhesive interface failure. It is also found that there exist uncertainty and inaccuracy in the available models when compared with the experimental data and inconsistency among the models. This demonstrates the need for continuing research and compilation of data on the topic of FRP's bond strength.

Alternative Labor Shortage Statistical Measures for Small and Medium Enterprises in Korea (한국의 중소 제조업체 노동력 부족의 개념과 측정)

  • Seol Dong-Hoon
    • Korea journal of population studies
    • /
    • v.27 no.1
    • /
    • pp.121-146
    • /
    • 2004
  • Despite the fact that there are about 435,000 unemployed youth out there in 2003, small and medium manufacturing companies experience a shortage of labor in South Korea. Korean government has released the statistical data on labor shortage as well as unemployment. However, there is an inconsistency in the labor shortage statistics of the small and medium business sector released by two different government bodies: the Labor Demand Survey by the Ministry of Labor (MOL), and the Manpower Survey for the Small and Medium Business by the Small and Medium Business Administration (SMBA). This paper analyzes causes of the differences the conceptualization and measurement of labor shortage and the data collecting methods. This paper also suggests alternative statistical indicators to overcome the confusion.

SSR (Simple Sector Remapper) the fault tolerant FTL algorithm for NAND flash memory

  • Lee, Gui-Young;Kim, Bumsoo;Kim, Shin-han;Byungsoo Jung
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.932-935
    • /
    • 2002
  • In this paper, we introduce new FTL(Flash Translation Layer) driver algorithm that tolerate the power off errors. FTL driver is the software that provide the block device interface to the upper layer software such as file systems or application programs that using the flash memory as a block device interfaced storage. Usually, the flash memory is used as the storage devices of the mobile system due to its low power consumption and small form factor. In mobile system, the state of the power supplement is not stable, because it using the small sized battery that has limited capacity. So, a sudden power off failure can be occurred when we read or write the data on the flash memory. During the write operation, power off failure may introduce the incomplete write operation. Incomplete write operation denotes the inconsistency of the data in flash memory. To provide the stable storage facility with flash memory in mobile system, FTL should provide the fault tolerance against the power off failure. SSR (Simple Sector Remapper) is a fault tolerant FTL driver that provides block device interface and also provides tolerance against power off errors.

  • PDF

Characteristics of deconstruction expressed in the contemporary knit fashion (현대 니트패션에 나타난 해체주의 특성)

  • Lee, Yoon Mee
    • The Research Journal of the Costume Culture
    • /
    • v.26 no.4
    • /
    • pp.583-597
    • /
    • 2018
  • The purpose of this study is to classify and analyze the deconstruction phenomena expressed in contemporary knit fashion design, and to analyze the inner meaning of deconstruction based on certain characteristics. As a method of study, literature data for theoretical backgrounds, prior studies, and internet data were analyzed. The scope of this study was restricted to knitwear published in the world's four major collections (Milan, Paris, New York and London) from 2014 F/W to 2018 S/S. Based on prior studies, four concepts of deconstruction were derived: "$Diff{\acute{e}}reance$", "Intertextuality", "Intermeaning of Meaning", "Dis De Phenomenon". The results of the study were as follows: first, "$Diff{\acute{e}}reance$" refers to a transcendence of time and space. These expressions are discursive, unrealistic, and convey freedom through intent that deviates from rules and norms. Second, "Intertextuality" indicates a mixture of different texts, such as styles, materials, and items. These expressions deliver novelty with amusement, and can be entertaining depending on audience expectations. Third, "Intermeaning of Meaning" is accidental category - depending on how the wearer wears the clothing. -; accordingly, free and spontaneous creativity is an emerging trend in fashion. Fourth, the clothing was expressed in deformed and distorted form by the construction and destruction of the structure, a technique we describe as the "Dis De Phenomenon". In this concept, the sense of free design of young emotion appears along with the sense of purity and shock due to intentional inconsistency.

Image Tracking Algorithm using Template Matching and PSNF-m

  • Bae, Jong-Sue;Song, Taek-Lyul
    • International Journal of Control, Automation, and Systems
    • /
    • v.6 no.3
    • /
    • pp.413-423
    • /
    • 2008
  • The template matching method is used as a simple method to track objects or patterns that we want to search for in the input image data from image sensors. It recognizes a segment with the highest correlation as a target. The concept of this method is similar to that of SNF (Strongest Neighbor Filter) that regards the measurement with the highest signal intensity as target-originated among other measurements. The SNF assumes that the strongest neighbor (SN) measurement in the validation gate originates from the target of interest and the SNF utilizes the SN in the update step of a standard Kalman filter (SKF). The SNF is widely used along with the nearest neighbor filter (NNF), due to computational simplicity in spite of its inconsistency of handling the SN as if it is the true target. Probabilistic Strongest Neighbor Filter for m validated measurements (PSNF-m) accounts for the probability that the SN in the validation gate originates from the target while the SNF assumes at any time that the SN measurement is target-originated. It is known that the PSNF-m is superior to the SNF in performance at a cost of increased computational load. In this paper, we suggest an image tracking algorithm that combines the template matching and the PSNF-m to estimate the states of a tracked target. Computer simulation results are included to demonstrate the performance of the proposed algorithm in comparison with other algorithms.

Elementary Students' Epistemological Views on the Nature of Scientific Measurement (측정의 본성에 대한 초등학생들의 인식론적 견해)

  • Yang, Chan-Ho;Lee, Ji-Hyeon;Kim, Young-Hoon;Noh, Tae-Hee
    • Journal of Korean Elementary Science Education
    • /
    • v.30 no.4
    • /
    • pp.430-441
    • /
    • 2011
  • We investigated the elementary students' epistemological views on the nature of scientific measurement. The Views About Scientific Measurement (Ibrahim, 2005) was administered to 117 sixth graders. The analyses of the results indicated that there was an inconsistency in their epistemological views depending on the contexts of the measurement. They also had some difficulties in understanding a distribution of the data, which is needed to understand the necessity of repeating measurements, choosing a best representative value, and comparing data sets. They were found to have some naive views on scientific measurement which influenced negatively for fostering modern epistemological views on the nature of scientific measurement. The results suggest that the nature of scientific measurement should be emphasized explicitly in the national curriculum, and an effective method which improves elementary students' epistemological views on the nature of scientific measurement also be developed.

Factors Influencing Uncertainty in Dialysis Patient by Duration of Dialysis (투석기간에 따른 투석 환자의 불확실성 요인)

  • Yun, Su Jung;Lee, Young Hee
    • Korean Journal of Adult Nursing
    • /
    • v.24 no.6
    • /
    • pp.597-606
    • /
    • 2012
  • Purpose: This study was to describe the uncertainty, depression, physical symptom, and family support among patients undergoing dialysis. Further, the factors that impact uncertainty were also examined. Methods: A convenience sample of 145 patients who received dialysis was selected. A descriptive correlation study was conducted. Data were collected using structured questionnaires and the collected data were analyzed using descriptive statistics and multiple regression analysis. Results: The patient who received more than five years of dialysis reported higher levels on inconsistency of uncertainty than patient with less than five years. These latter patients' reported uncertainty was positively correlated with depression, whereas, patients family support was correlated with uncertainty. The group's uncertainty with less than five years of dialysis explained about 13% of the variance. In contrast, variables of education level, family support, and monthly income were predictors of uncertainty and explained 33% of the variation. Conclusion: These results can provide for nursing intervention to facilitate reduction of uncertainty. To provide dialysis period-sensitive nursing intervention for uncertainty among dialysis patient, depression should be considered below five years. While factors such as education level, family support, and monthly income should be taken into account over five years.

Methods to System Integration in Distributed Heterogeneous Environments (분산 이기종 환경에서의 메시지미들웨어(MOM) 시스템 통합방안 연구)

  • Kim Jong-Bae;Song Jae-Young;Rhew Sung-Yul
    • Journal of Digital Contents Society
    • /
    • v.6 no.3
    • /
    • pp.163-168
    • /
    • 2005
  • Computing infrastructures and technologies are moving into the distributed environments. Due to increase of M&A, and outsourcing processes, or increase or development of various system in an organizations, there are various problems resing such as difficulties in maintenance and repairment, repetition or inconsistency of data, and lacks of interconnection between different of difficulties in financing and selecting an adequate solutionas. This study presents a method to integrate systems adopting massage middleware as an efficient alternative for integration of applications and data between different models under distributed system environments. We expect that the integration method presented in this study, adopting massage middleware between system, will be an efficient alternative to build up interface between small system in terms of expense and efficiency.

  • PDF